• Staff ML Engineer

    General Motors (Warren, MI)
    …more) while maintaining reliability and cost efficiency. **About the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust ... **This job is eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within… more
    General Motors (10/03/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer , AI/…

    Amazon (Cupertino, CA)
    …integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
    Amazon (08/29/25)
    - Save Job - Related Jobs - Block Source
  • ML Acceleration / Framework Engineer

    Amazon (Seattle, WA)
    …breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely with hardware designers, software ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...solutions for Inferentia chips. Proficiency in deploying and optimizing ML models for inference using frameworks like… more
    Amazon (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer AI/ ML

    Amazon (Cupertino, CA)
    …of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm/releases/tag/2.25.0 ... and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to… more
    Amazon (09/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer , AI/…

    Amazon (Cupertino, CA)
    …learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron ... as stable diffusion, vision transformers and many more. The Inference Model Enablement team works side by side with...silicon and servers. Strong software development using Python and ML knowledge are both critical to this role. A… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer - AI/…

    Amazon (Cupertino, CA)
    …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, GPT2,… more
    Amazon (09/19/25)
    - Save Job - Related Jobs - Block Source
  • Staff Engineer , Inference

    MongoDB (Palo Alto, CA)
    …AI-powered applications. **About the Role** We're looking for a Staff Engineer to join our team building the inference platform for embedding ... building the infrastructure that enables real-time, high-scale, and low-latency inference - all deeply integrated into Atlas and optimized...into Atlas and optimized for developer experience. As a Staff Engineer , you'll be hands-on with design… more
    MongoDB (08/27/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Generative…

    Google (Kirkland, WA)
    Staff Software Engineer , Generative AI Inference _corporate_fare_ Google _place_ Seattle, WA, USA; Kirkland, WA, USA **Advanced** Experience owning outcomes ... or cross-business projects. + 3 years of experience with AI/ ML inference stack. **About the job** Google's...leading cost-effective, simplified, and fastest platform for running GenAI inference workloads. As a Software Engineer , you… more
    Google (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , ML

    DoorDash (San Francisco, CA)
    …Logistics, Fraud, and Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation ... where it makes sense - to accelerate innovation. As Staff Software Engineer , you'll pair deep technical...ML serving systems. + Bring deep familiarity with ML inference and serving ecosystems. + Know… more
    DoorDash (08/26/25)
    - Save Job - Related Jobs - Block Source
  • Staff AI/ ML Computer Vision…

    TP-Link North America, Inc. (Irvine, CA)
    We are seeking for a Staff AI/ ML Computer Vision Engineer to design and develop cutting-edge AI-powered features for our next-generation smart home ... problem-solving and collaborative skills. Responsibilities + Architect and implement ML -based computer vision pipelines for real-time object detection, tracking, and… more
    TP-Link North America, Inc. (08/22/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer (Data/…

    Amazon (Bellevue, WA)
    …data into intelligent, interconnected information at scale - Release and maintain ML model infrastructure to enable high-throughput, low-latency inference in ... etc, - Experience with building agent based on LLM, prompt engineering, and ML model inference optimization - Experience working alongside applied scientists… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Staff ML Engineer

    Zscaler (San Jose, CA)
    …speed and agility with a cloud-first strategy. We're looking for an experienced Sr. Staff Machine Learning Engineer to join our Digital Experience team. This ... and integrate state-of-the-art GenAI advances (eg, LLMs/SLMs, retrieval, fine-tuning, inference optimization) to deliver reliable and cost-efficient production features… more
    Zscaler (09/26/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer - AI/…

    Zoom (San Jose, CA)
    …you can expect​ We are seeking a highly experienced Staff Software Engineer with deep expertise in AI/ ML , Generative AI, and full-stack application ... in software engineering, with at least 2 years leading complex AI/ ML or GenAI-driven applications. + Possess experience building full-stack applications and… more
    Zoom (09/09/25)
    - Save Job - Related Jobs - Block Source
  • ML Compiler Engineer I, Annapurna…

    Amazon (Cupertino, CA)
    …for advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through ... scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna...leap in performance. You: As a Machine Learning Compiler Engineer I on the AWS Neuron Compiler team, you… more
    Amazon (09/06/25)
    - Save Job - Related Jobs - Block Source
  • ML Kernel Performance Engineer , AWS…

    Amazon (Cupertino, CA)
    …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more
    Amazon (08/16/25)
    - Save Job - Related Jobs - Block Source
  • Sr. ML Kernel Performance Engineer

    Amazon (Cupertino, CA)
    …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more
    Amazon (08/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr Principal ML Engineer - eCommerce…

    Autodesk (Juneau, AK)
    …customer insights and faster decision-making. The Role We're looking for a Principal ( Staff ) Machine Learning Engineer to help shape the future of Autodesk's ... If you're passionate about solving ambiguous business problems and building scalable ML systems-even if you don't meet every qualification-we encourage you to apply.… more
    Autodesk (09/26/25)
    - Save Job - Related Jobs - Block Source
  • Sr ML Compiler Engineer , Annapurna…

    Amazon (Cupertino, CA)
    …tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of...in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will… more
    Amazon (08/13/25)
    - Save Job - Related Jobs - Block Source
  • Cleared - AI/ ML Modeler/ Engineer

    Noblis (Mclean, VA)
    Responsibilities **Noblis is seeking a Cleared AI/ ML Modeler/ Engineer (All Levels) with IC experience and ACTIVE Top Secret with SCI and Polygraph in Bethesda, ... MD and McLean, VA** The AI/ ML Modeler/ Engineer will be responsible for and...accuracy degradation and implementing automated retraining pipelines. + Optimizing inference processes for scalability, latency, and resource efficiency while… more
    Noblis (08/28/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI/ ML , AWS…

    Amazon (Cupertino, CA)
    …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enablement and performance tuning of a wide variety of ML model families, including massive scale large language models...will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and… more
    Amazon (07/18/25)
    - Save Job - Related Jobs - Block Source