Site Reliability Engineer Inference Jobs

61 jobs (page 1)

Categories

All Categories

Software/IT (10)

Senior Software Development Engineer…

Amazon (Seattle, WA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more

Amazon (01/06/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer - AI/ML, AWS…

Amazon (Seattle, WA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more

Amazon (12/31/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI/ML,…

Amazon (Cupertino, CA)

…and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to ... and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a… more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer , AWS Neuron…

Amazon (Seattle, WA)

…Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role ... Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (12/24/25)
- Save Job - Related Jobs - Block Source
Software Engineer -AI/ML, AWS Neuron…

Amazon (Seattle, WA)

…Trainium cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
AGI Inference Software Development…

Amazon (Sunnyvale, CA)

Description The Sensory Inference team at AGI is a group of innovative developers working on groundbreaking multi-modal inference solutions that revolutionize ... interact with the world. We push the limits of inference performance to provide the best possible experience for...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (12/12/25)
- Save Job - Related Jobs - Block Source
MTS - Site Reliability…

Microsoft Corporation (Redmond, WA)

…- so that everyone can realize its benefits. We're looking for an experienced ** Site Reliability Engineer (SRE)** to join our infrastructure team. In ... workflows. **Qualifications** **Required Qualifications** + 4+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.… more

Microsoft Corporation (12/17/25)
- Save Job - Related Jobs - Block Source
Sr Software Dev Engineer , Machine…

Amazon (Palo Alto, CA)

…the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more

Amazon (11/04/25)
- Save Job - Related Jobs - Block Source
Sr. Software Dev Engineer , Sponsored…

Amazon (New York, NY)

…the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more

Amazon (10/23/25)
- Save Job - Related Jobs - Block Source
Senior Staff Data Engineer

Warner Bros. Discovery (Atlanta, GA)

…products include popular, related and personalized content recommendations, contextual ad targeting, and site search - serving millions of CNN users via CNN web and ... with bandits for online ranking of recommendations * **Optimize Site Performance:** Dynamically deliver personalized content alongside cached assets, improving… more

Warner Bros. Discovery (11/06/25)
- Save Job - Related Jobs - Block Source
Sr ML Ops Engineer

The Walt Disney Company (Nicasio, CA)

…a related field. Master's Degree is preferred + 5+ years of experience in DevOps, Site Reliability Engineering, or a related role, with at least 2+ years ... The Skywalker Sound Development Group is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering our machine learning and AI… more

The Walt Disney Company (12/19/25)
- Save Job - Related Jobs - Block Source
Full Stack AI Engineer

CGI Technologies and Solutions, Inc. (Knoxville, TN)

**Full Stack AI Engineer ** **Category:** Analytics and Emerging Digital Technologies **Main location:** United States, Tennessee, Knoxville **Position ID:** ... **Employment Type:** Full Time **Position Description:** CGI is seeking a Full-Stack AI Engineer to join our dynamic team, where you'll be responsible for building… more

CGI Technologies and Solutions, Inc. (12/17/25)
- Save Job - Related Jobs - Block Source
Senior Software Deployment & Customer Operations…

Evident Scientific (Needham, MA)

…whole-slide imaging software systems. This role bridges field deployment engineering and site reliability , ensuring that every customer system is installed ... Senior Software Deployment & Customer Operations Engineer (Digital Pathology, Evident MIS) Job ID #:...and refine deployment playbooks and qualification checklists. 2. System Reliability & Upgrades * Manage version rollouts, patch upgrades,… more

Evident Scientific (12/16/25)
- Save Job - Related Jobs - Block Source
AI Senior Staff Systems Engineer

Cadence Design Systems, Inc. (San Jose, CA)

…world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role ... clusters, storage solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI Service Integration:… more

Cadence Design Systems, Inc. (12/29/25)
- Save Job - Related Jobs - Block Source
Senior Machine Learning Engineer , WebIR…

Amazon (Boston, MA)

…of robotics. We are looking for a talented Senior Machine Learning Engineer to help us develop state-of-the-art, next generation web search capabilities within ... systems as well. (iv) push model performance to limits; optimize model inference to maximize hardware utilization, reducing GPU inference latency, balancing… more

Amazon (11/27/25)
- Save Job - Related Jobs - Block Source
Senior AI Platform Engineer

PennyMac (Westlake Village, CA)

…equivalent experience). + 5+ years of experience in a Platform Engineering, DevOps or Site Reliability Engineering (SRE) role. + 1+ year(s) of experience with AI ... through the complete mortgage journey. A Typical Day The Senior AI Platform Engineer will: + Design, implement, and manage scalable and resilient infrastructure on… more

PennyMac (01/07/26)
- Save Job - Related Jobs - Block Source
Lead Software Engineer - GCP ML…

JPMorgan Chase (Wilmington, DE)

…adventure where you can push the limits of what's possible. As a Lead Software Engineer at JPMorgan Chase within the Chief Data and Analytics Office, you are an ... high-quality production code; review and debug ML pipeline, data processing, and inference code. + Identify opportunities to eliminate or automate remediation of… more

JPMorgan Chase (12/26/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer , Frontier AI…

Amazon (San Francisco, CA)

…breakthrough foundation models run at production scale. As a Software Development Engineer embedded in our science team, you'll be instrumental in transforming novel ... applications, leveraging your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll balance deep… more

Amazon (12/02/25)
- Save Job - Related Jobs - Block Source
Sr. Software Development Engineer , FAR…

Amazon (San Francisco, CA)

…foundation models run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental in transforming cutting-edge ... your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (12/02/25)
- Save Job - Related Jobs - Block Source
Lead Software Development Engineer - AI/ML,…

Amazon (Cambridge, MA)

…Imagine your future with us. ABOUT THIS ROLE As a Lead Software Development Engineer - AI/ML, it's up to you to design, architect, and implement machine learning ... intelligent systems that learn, adapt, and evolve. ABOUT YOU As a Lead Software Engineer - AI/ML, you will - Design and develop machine learning architectures and… more

Amazon (01/02/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search