- General Motors (Warren, MI)
- …more) while maintaining reliability and cost efficiency. **About the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust ... **This job is eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within… more
- Amazon (Cupertino, CA)
- …integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
- Amazon (Seattle, WA)
- …breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely with hardware designers, software ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...solutions for Inferentia chips. Proficiency in deploying and optimizing ML models for inference using frameworks like… more
- Amazon (Cupertino, CA)
- …of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm/releases/tag/2.25.0 ... and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to… more
- Amazon (Cupertino, CA)
- …learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron ... as stable diffusion, vision transformers and many more. The Inference Model Enablement team works side by side with...silicon and servers. Strong software development using Python and ML knowledge are both critical to this role. A… more
- Amazon (Cupertino, CA)
- …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, GPT2,… more
- MongoDB (Palo Alto, CA)
- …AI-powered applications. **About the Role** We're looking for a Staff Engineer to join our team building the inference platform for embedding ... building the infrastructure that enables real-time, high-scale, and low-latency inference - all deeply integrated into Atlas and optimized...into Atlas and optimized for developer experience. As a Staff Engineer , you'll be hands-on with design… more
- Google (Kirkland, WA)
- Staff Software Engineer , Generative AI Inference _corporate_fare_ Google _place_ Seattle, WA, USA; Kirkland, WA, USA **Advanced** Experience owning outcomes ... or cross-business projects. + 3 years of experience with AI/ ML inference stack. **About the job** Google's...leading cost-effective, simplified, and fastest platform for running GenAI inference workloads. As a Software Engineer , you… more
- DoorDash (San Francisco, CA)
- …Logistics, Fraud, and Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation ... where it makes sense - to accelerate innovation. As Staff Software Engineer , you'll pair deep technical...ML serving systems. + Bring deep familiarity with ML inference and serving ecosystems. + Know… more
- TP-Link North America, Inc. (Irvine, CA)
- We are seeking for a Staff AI/ ML Computer Vision Engineer to design and develop cutting-edge AI-powered features for our next-generation smart home ... problem-solving and collaborative skills. Responsibilities + Architect and implement ML -based computer vision pipelines for real-time object detection, tracking, and… more
- Amazon (Bellevue, WA)
- …data into intelligent, interconnected information at scale - Release and maintain ML model infrastructure to enable high-throughput, low-latency inference in ... etc, - Experience with building agent based on LLM, prompt engineering, and ML model inference optimization - Experience working alongside applied scientists… more
- Zscaler (San Jose, CA)
- …speed and agility with a cloud-first strategy. We're looking for an experienced Sr. Staff Machine Learning Engineer to join our Digital Experience team. This ... and integrate state-of-the-art GenAI advances (eg, LLMs/SLMs, retrieval, fine-tuning, inference optimization) to deliver reliable and cost-efficient production features… more
- Zoom (San Jose, CA)
- …you can expect We are seeking a highly experienced Staff Software Engineer with deep expertise in AI/ ML , Generative AI, and full-stack application ... in software engineering, with at least 2 years leading complex AI/ ML or GenAI-driven applications. + Possess experience building full-stack applications and… more
- Amazon (Cupertino, CA)
- …for advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through ... scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna...leap in performance. You: As a Machine Learning Compiler Engineer I on the AWS Neuron Compiler team, you… more
- Amazon (Cupertino, CA)
- …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more
- Amazon (Cupertino, CA)
- …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more
- Autodesk (Juneau, AK)
- …customer insights and faster decision-making. The Role We're looking for a Principal ( Staff ) Machine Learning Engineer to help shape the future of Autodesk's ... If you're passionate about solving ambiguous business problems and building scalable ML systems-even if you don't meet every qualification-we encourage you to apply.… more
- Amazon (Cupertino, CA)
- …tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of...in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will… more
- Noblis (Mclean, VA)
- Responsibilities **Noblis is seeking a Cleared AI/ ML Modeler/ Engineer (All Levels) with IC experience and ACTIVE Top Secret with SCI and Polygraph in Bethesda, ... MD and McLean, VA** The AI/ ML Modeler/ Engineer will be responsible for and...accuracy degradation and implementing automated retraining pipelines. + Optimizing inference processes for scalability, latency, and resource efficiency while… more
- Amazon (Cupertino, CA)
- …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enablement and performance tuning of a wide variety of ML model families, including massive scale large language models...will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and… more