Staff Ml Engineer Inference Jobs | Juju

Staff ML Engineer…

General Motors (Warren, MI)

…more) while maintaining reliability and cost efficiency. **About the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust ... **This job is eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within… more

General Motors (10/03/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer , AI/…

Amazon (Cupertino, CA)

…integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more

Amazon (08/29/25)
- Save Job - Related Jobs - Block Source
ML Acceleration / Framework Engineer…

Amazon (Seattle, WA)

…breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely with hardware designers, software ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...solutions for Inferentia chips. Proficiency in deploying and optimizing ML models for inference using frameworks like… more

Amazon (07/15/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI/ ML…

Amazon (Cupertino, CA)

…of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm/releases/tag/2.25.0 ... and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to… more

Amazon (09/21/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer , AI/…

Amazon (Cupertino, CA)

…learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron ... as stable diffusion, vision transformers and many more. The Inference Model Enablement team works side by side with...silicon and servers. Strong software development using Python and ML knowledge are both critical to this role. A… more

Amazon (09/13/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer - AI/…

Amazon (Cupertino, CA)

…and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, GPT2,… more

Amazon (09/19/25)
- Save Job - Related Jobs - Block Source
Staff Engineer , Inference…

MongoDB (Palo Alto, CA)

…AI-powered applications. **About the Role** We're looking for a Staff Engineer to join our team building the inference platform for embedding ... building the infrastructure that enables real-time, high-scale, and low-latency inference - all deeply integrated into Atlas and optimized...into Atlas and optimized for developer experience. As a Staff Engineer , you'll be hands-on with design… more

MongoDB (08/27/25)
- Save Job - Related Jobs - Block Source
Staff Software Engineer , Generative…

Google (Kirkland, WA)

Staff Software Engineer , Generative AI Inference _corporate_fare_ Google _place_ Seattle, WA, USA; Kirkland, WA, USA **Advanced** Experience owning outcomes ... or cross-business projects. + 3 years of experience with AI/ ML inference stack. **About the job** Google's...leading cost-effective, simplified, and fastest platform for running GenAI inference workloads. As a Software Engineer , you… more

Google (10/01/25)
- Save Job - Related Jobs - Block Source
Staff Software Engineer , ML…

DoorDash (San Francisco, CA)

…Logistics, Fraud, and Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation ... where it makes sense - to accelerate innovation. As Staff Software Engineer , you'll pair deep technical...ML serving systems. + Bring deep familiarity with ML inference and serving ecosystems. + Know… more

DoorDash (08/26/25)
- Save Job - Related Jobs - Block Source
Staff AI/ ML Computer Vision…

TP-Link North America, Inc. (Irvine, CA)

We are seeking for a Staff AI/ ML Computer Vision Engineer to design and develop cutting-edge AI-powered features for our next-generation smart home ... problem-solving and collaborative skills. Responsibilities + Architect and implement ML -based computer vision pipelines for real-time object detection, tracking, and… more

TP-Link North America, Inc. (08/22/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer (Data/…

Amazon (Bellevue, WA)

…data into intelligent, interconnected information at scale - Release and maintain ML model infrastructure to enable high-throughput, low-latency inference in ... etc, - Experience with building agent based on LLM, prompt engineering, and ML model inference optimization - Experience working alongside applied scientists… more

Amazon (09/13/25)
- Save Job - Related Jobs - Block Source
Sr. Staff ML Engineer…

Zscaler (San Jose, CA)

…speed and agility with a cloud-first strategy. We're looking for an experienced Sr. Staff Machine Learning Engineer to join our Digital Experience team. This ... and integrate state-of-the-art GenAI advances (eg, LLMs/SLMs, retrieval, fine-tuning, inference optimization) to deliver reliable and cost-efficient production features… more

Zscaler (09/26/25)
- Save Job - Related Jobs - Block Source
Staff Software Engineer - AI/…

Zoom (San Jose, CA)

…you can expect We are seeking a highly experienced Staff Software Engineer with deep expertise in AI/ ML , Generative AI, and full-stack application ... in software engineering, with at least 2 years leading complex AI/ ML or GenAI-driven applications. + Possess experience building full-stack applications and… more

Zoom (09/09/25)
- Save Job - Related Jobs - Block Source
ML Compiler Engineer I, Annapurna…

Amazon (Cupertino, CA)

…for advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through ... scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna...leap in performance. You: As a Machine Learning Compiler Engineer I on the AWS Neuron Compiler team, you… more

Amazon (09/06/25)
- Save Job - Related Jobs - Block Source
ML Kernel Performance Engineer , AWS…

Amazon (Cupertino, CA)

…seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more

Amazon (08/16/25)
- Save Job - Related Jobs - Block Source
Sr. ML Kernel Performance Engineer…

Amazon (Cupertino, CA)

…seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more

Amazon (08/15/25)
- Save Job - Related Jobs - Block Source
Sr Principal ML Engineer - eCommerce…

Autodesk (Juneau, AK)

…customer insights and faster decision-making. The Role We're looking for a Principal ( Staff ) Machine Learning Engineer to help shape the future of Autodesk's ... If you're passionate about solving ambiguous business problems and building scalable ML systems-even if you don't meet every qualification-we encourage you to apply.… more

Autodesk (09/26/25)
- Save Job - Related Jobs - Block Source
Sr ML Compiler Engineer , Annapurna…

Amazon (Cupertino, CA)

…tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of...in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will… more

Amazon (08/13/25)
- Save Job - Related Jobs - Block Source
Cleared - AI/ ML Modeler/ Engineer…

Noblis (Mclean, VA)

Responsibilities **Noblis is seeking a Cleared AI/ ML Modeler/ Engineer (All Levels) with IC experience and ACTIVE Top Secret with SCI and Polygraph in Bethesda, ... MD and McLean, VA** The AI/ ML Modeler/ Engineer will be responsible for and...accuracy degradation and implementing automated retraining pipelines. + Optimizing inference processes for scalability, latency, and resource efficiency while… more

Noblis (08/28/25)
- Save Job - Related Jobs - Block Source
Sr. Software Engineer - AI/ ML , AWS…

Amazon (Cupertino, CA)

…the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enablement and performance tuning of a wide variety of ML model families, including massive scale large language models...will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and… more

Amazon (07/18/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search