- Amazon (Seattle, WA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more
- Amazon (Seattle, WA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more
- Amazon (Cupertino, CA)
- …and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to ... and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a… more
- Amazon (Seattle, WA)
- …Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role ... Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
- Amazon (Seattle, WA)
- …Trainium cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more
- Amazon (Sunnyvale, CA)
- Description The Sensory Inference team at AGI is a group of innovative developers working on groundbreaking multi-modal inference solutions that revolutionize ... interact with the world. We push the limits of inference performance to provide the best possible experience for...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
- Microsoft Corporation (Redmond, WA)
- …- so that everyone can realize its benefits. We're looking for an experienced ** Site Reliability Engineer (SRE)** to join our infrastructure team. In ... workflows. **Qualifications** **Required Qualifications** + 4+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.… more
- Amazon (Palo Alto, CA)
- …the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more
- Amazon (New York, NY)
- …the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more
- Warner Bros. Discovery (Atlanta, GA)
- …products include popular, related and personalized content recommendations, contextual ad targeting, and site search - serving millions of CNN users via CNN web and ... with bandits for online ranking of recommendations * **Optimize Site Performance:** Dynamically deliver personalized content alongside cached assets, improving… more
- The Walt Disney Company (Nicasio, CA)
- …a related field. Master's Degree is preferred + 5+ years of experience in DevOps, Site Reliability Engineering, or a related role, with at least 2+ years ... The Skywalker Sound Development Group is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering our machine learning and AI… more
- CGI Technologies and Solutions, Inc. (Knoxville, TN)
- **Full Stack AI Engineer ** **Category:** Analytics and Emerging Digital Technologies **Main location:** United States, Tennessee, Knoxville **Position ID:** ... **Employment Type:** Full Time **Position Description:** CGI is seeking a Full-Stack AI Engineer to join our dynamic team, where you'll be responsible for building… more
- Evident Scientific (Needham, MA)
- …whole-slide imaging software systems. This role bridges field deployment engineering and site reliability , ensuring that every customer system is installed ... Senior Software Deployment & Customer Operations Engineer (Digital Pathology, Evident MIS) Job ID #:...and refine deployment playbooks and qualification checklists. 2. System Reliability & Upgrades * Manage version rollouts, patch upgrades,… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role ... clusters, storage solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI Service Integration:… more
- Amazon (Boston, MA)
- …of robotics. We are looking for a talented Senior Machine Learning Engineer to help us develop state-of-the-art, next generation web search capabilities within ... systems as well. (iv) push model performance to limits; optimize model inference to maximize hardware utilization, reducing GPU inference latency, balancing… more
- PennyMac (Westlake Village, CA)
- …equivalent experience). + 5+ years of experience in a Platform Engineering, DevOps or Site Reliability Engineering (SRE) role. + 1+ year(s) of experience with AI ... through the complete mortgage journey. A Typical Day The Senior AI Platform Engineer will: + Design, implement, and manage scalable and resilient infrastructure on… more
- JPMorgan Chase (Wilmington, DE)
- …adventure where you can push the limits of what's possible. As a Lead Software Engineer at JPMorgan Chase within the Chief Data and Analytics Office, you are an ... high-quality production code; review and debug ML pipeline, data processing, and inference code. + Identify opportunities to eliminate or automate remediation of… more
- Amazon (San Francisco, CA)
- …breakthrough foundation models run at production scale. As a Software Development Engineer embedded in our science team, you'll be instrumental in transforming novel ... applications, leveraging your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll balance deep… more
- Amazon (San Francisco, CA)
- …foundation models run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental in transforming cutting-edge ... your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
- Amazon (Cambridge, MA)
- …Imagine your future with us. ABOUT THIS ROLE As a Lead Software Development Engineer - AI/ML, it's up to you to design, architect, and implement machine learning ... intelligent systems that learn, adapt, and evolve. ABOUT YOU As a Lead Software Engineer - AI/ML, you will - Design and develop machine learning architectures and… more