Site Reliability Engineer Inference Jobs in Fremont, CA

Senior Software Development Engineer…

Amazon (Cupertino, CA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more

Amazon (01/06/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer…

Amazon (Cupertino, CA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more

Amazon (12/10/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI/ML,…

Amazon (Cupertino, CA)

…and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to ... and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a… more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
AGI Inference Software Development…

Amazon (Sunnyvale, CA)

Description The Sensory Inference team at AGI is a group of innovative developers working on groundbreaking multi-modal inference solutions that revolutionize ... interact with the world. We push the limits of inference performance to provide the best possible experience for...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (12/12/25)
- Save Job - Related Jobs - Block Source
Sr Software Dev Engineer , Machine…

Amazon (Palo Alto, CA)

…the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more

Amazon (11/04/25)
- Save Job - Related Jobs - Block Source
AI Senior Staff Systems Engineer

Cadence Design Systems, Inc. (San Jose, CA)

…world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role ... clusters, storage solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI Service Integration:… more

Cadence Design Systems, Inc. (12/29/25)
- Save Job - Related Jobs - Block Source
Sr. Software Engineer (ML), AGI…

Amazon (Sunnyvale, CA)

…(AGI) team is looking for a passionate, talented, and inventive Sr. ML Engineer with a strong machine learning background, to build customization capabilities such ... as fine tuning and distillation. As a Sr. ML engineer with the AGI team, you will be responsible...responsible for leading the development of novel LLM training, inference techniques and optimizations to advance the state of… more

Amazon (12/03/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer , Neuron…

Amazon (Cupertino, CA)

…and the Trn1 and Inf1 servers that use them. As the Software Development Engineer for the Neuron Foundation Tools Team, you will be responsible for working alongside ... development life cycle of the Neuron Profiler/Tools toolchain, ensuring scalability, reliability , and usability. You will collaborate with cross-functional teams to… more

Amazon (11/19/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer - Circuit…

Cadence Design Systems, Inc. (San Jose, CA)

…innovative results in real-world settings. Core Expertise + Statistical inference : significance testing (p-values, confidence intervals), Bayesian statistics, design ... Carlo methods (random sampling, density estimation). + Rare-event and reliability analysis (a plus): importance sampling, subset simulation, cross-entropy methods,… more

Cadence Design Systems, Inc. (01/08/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer

Amazon (Cupertino, CA)

…Web Services (AWS) is building a central pipeline of Software Development Engineer (SDE) talent for anticipated roles in 2026. This requisition supports hiring ... and AWS CloudFront. Key job responsibilities As an AWS Software Development Engineer , you will: - Design, develop, and maintain efficient, reusable, and reliable… more

Amazon (12/20/25)
- Save Job - Related Jobs - Block Source
Sr. Software Development Engineer…

Amazon (Cupertino, CA)

…compiler teams. * Collect requirements from various other teams including training, inference and runtime. * Collaborate with the compiler performance team to ensure ... Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS… more

Amazon (01/09/26)
- Save Job - Related Jobs - Block Source
Sr. System Development Engineer…

Amazon (Cupertino, CA)

…the foundation of the world's most advanced cloud for AI training and inference - where multi-billion-parameter models come to life at scale. Here, you'll design, ... complex problems. You will decompose big difficult server system testability, reliability and diagnosis problems into straightforward tasks, components or features… more

Amazon (10/25/25)
- Save Job - Related Jobs - Block Source
ASIC Design Engineer , Cloud-Scale Machine…

Amazon (Cupertino, CA)

…data centers including AWS Inferentia, our custom designed machine learning inference datacenter server. Our success depends on our world-class server ... making the right trade-offs. Key job responsibilities As an ASIC Design Engineer , you will: * Develop and implement high-performance, area and power-efficient RTL… more

Amazon (12/18/25)
- Save Job - Related Jobs - Block Source
Sr. Machine Learning - Compiler Engineer…

Amazon (Cupertino, CA)

…building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium delivers the best-in-class ... quantum leap in performance. As a Machine Learning Compiler Engineer II in the AWS Neuron Compiler team, you...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (12/17/25)
- Save Job - Related Jobs - Block Source
Sr ML Compiler Engineer , Annapurna Labs

Amazon (Cupertino, CA)

…for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the ... in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (11/12/25)
- Save Job - Related Jobs - Block Source
Sr. Machine Learning - Compiler Engineer…

Amazon (Cupertino, CA)

…for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the ... in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more

Amazon (10/29/25)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer PhD (Full Time)…

Cisco (Milpitas, CA)

…distillation, and generative adversarial networks (GANs). Performance, scalability, and reliability are front and center as models are trained, fine-tuned, ... academic or professional projects. **Preferred Qualifications** + Experience working with inference engines (eg, vLLM, Triton, TorchServe). + Knowledge of GPU… more

Cisco (12/20/25)
- Save Job - Related Jobs - Block Source
AI Machine Learning Engineer II (Intern)…

Cisco (San Jose, CA)

…distillation, and generative adversarial networks (GANs). Performance, scalability, and reliability are front and center as models are trained, fine-tuned, ... or academic/research project documentation. **Preferred Qualifications** + Experience with inference engines such as vLLM, Triton, or TorchServe. + Knowledge… more

Cisco (12/01/25)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer , Model…

Amazon (Mountain View, CA)

…experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Bachelor's degree ... learning and/or machine learning methods (eg for training, fine tuning, and inference ) - Hands-on experience with generative AI technology Preferred Qualifications -… more

Amazon (12/20/25)
- Save Job - Related Jobs - Block Source
ML Kernel Performance Engineer , AWS…

Amazon (Cupertino, CA)

…seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron Compiler ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more

Amazon (11/15/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search