• Staff ML Engineer

    General Motors (Mountain View, CA)
    …more) while maintaining reliability and cost efficiency. **About the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust ... **This job is eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within… more
    General Motors (10/03/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer , AI/…

    Amazon (Cupertino, CA)
    …integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
    Amazon (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer , AI/…

    Amazon (Cupertino, CA)
    …integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
    Amazon (08/29/25)
    - Save Job - Related Jobs - Block Source
  • ML Acceleration / Framework Engineer

    Amazon (Cupertino, CA)
    …breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely with hardware designers, software ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...solutions for Inferentia chips. Proficiency in deploying and optimizing ML models for inference using frameworks like… more
    Amazon (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer AI/ ML

    Amazon (Cupertino, CA)
    …of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm/releases/tag/2.25.0 ... and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to… more
    Amazon (09/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer , AI/…

    Amazon (Cupertino, CA)
    …learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron ... as stable diffusion, vision transformers and many more. The Inference Model Enablement team works side by side with...silicon and servers. Strong software development using Python and ML knowledge are both critical to this role. A… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer - AI/…

    Amazon (Cupertino, CA)
    …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, GPT2,… more
    Amazon (09/19/25)
    - Save Job - Related Jobs - Block Source
  • Staff Engineer , Inference

    MongoDB (Palo Alto, CA)
    …AI-powered applications. **About the Role** We're looking for a Staff Engineer to join our team building the inference platform for embedding ... building the infrastructure that enables real-time, high-scale, and low-latency inference - all deeply integrated into Atlas and optimized...into Atlas and optimized for developer experience. As a Staff Engineer , you'll be hands-on with design… more
    MongoDB (08/27/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Cloud…

    Google (Sunnyvale, CA)
    Staff Software Engineer , Cloud ML Compute Services _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +2 more; +1 more **Advanced** ... on and is growing every day. As a software engineer , you will work on a specific project critical... infrastructure customers with large-scale, cloud-based access to Google's ML supercomputers to run training and inference more
    Google (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer (Data/…

    Amazon (Sunnyvale, CA)
    …data into intelligent, interconnected information at scale - Release and maintain ML model infrastructure to enable high-throughput, low-latency inference in ... etc, - Experience with building agent based on LLM, prompt engineering, and ML model inference optimization - Experience working alongside applied scientists… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Staff ML Engineer

    Zscaler (San Jose, CA)
    …speed and agility with a cloud-first strategy. We're looking for an experienced Sr. Staff Machine Learning Engineer to join our Digital Experience team. This ... and integrate state-of-the-art GenAI advances (eg, LLMs/SLMs, retrieval, fine-tuning, inference optimization) to deliver reliable and cost-efficient production features… more
    Zscaler (09/26/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer - AI/…

    Zoom (San Jose, CA)
    …you can expect​ We are seeking a highly experienced Staff Software Engineer with deep expertise in AI/ ML , Generative AI, and full-stack application ... in software engineering, with at least 2 years leading complex AI/ ML or GenAI-driven applications. + Possess experience building full-stack applications and… more
    Zoom (09/09/25)
    - Save Job - Related Jobs - Block Source
  • ML Compiler Engineer I, Annapurna…

    Amazon (Cupertino, CA)
    …for advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through ... scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna...leap in performance. You: As a Machine Learning Compiler Engineer I on the AWS Neuron Compiler team, you… more
    Amazon (09/06/25)
    - Save Job - Related Jobs - Block Source
  • ML Kernel Performance Engineer , AWS…

    Amazon (Cupertino, CA)
    …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more
    Amazon (08/16/25)
    - Save Job - Related Jobs - Block Source
  • Sr. ML Kernel Performance Engineer

    Amazon (Cupertino, CA)
    …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance… more
    Amazon (08/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr ML Compiler Engineer , Annapurna…

    Amazon (Cupertino, CA)
    …tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of...in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will… more
    Amazon (08/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - AI/ ML , AWS…

    Amazon (Cupertino, CA)
    …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enablement and performance tuning of a wide variety of ML model families, including massive scale large language models...will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow, Jax using XLA and… more
    Amazon (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Hardware Dev Engineer (AWS Generative…

    Amazon (Cupertino, CA)
    …you want to build the future of the cloud for AI training and inference ? Want to do industry leading work delivering continuous price performance improvements in the ... operating AWS cloud offerings that enable high performance and scalability in AI/ ML and HPC workloads. AWS Infrastructure Services owns the design, planning,… more
    Amazon (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    …+ Knowledge of ML converters/compilers and run times, and hardware-accelerated ML inference techniques. + Understanding of Generative AI model architectures ... Senior Staff Software Engineer , On-Device Machine Learning...+ 7 years of experience leading technical project strategy, ML design, and working with industry-scale ML more
    Google (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff AI Engineer

    Palo Alto Networks (Santa Clara, CA)
    …create an environment where we all win with precision. **Your Career** As a Senior Staff AI Engineer for Enterprise AI Solutions, you will be a critical ... + Platform Development: Develop and implement core components of the enterprise AI/ ML platform, ensuring scalability and security. Contribute to the lifecycle of… more
    Palo Alto Networks (09/18/25)
    - Save Job - Related Jobs - Block Source