• Senior Software Development Engineer

    Amazon (Cupertino, CA)
    …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more
    Amazon (01/06/26)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer

    Amazon (Cupertino, CA)
    …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and… more
    Amazon (12/10/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer AI/ML,…

    Amazon (Cupertino, CA)
    …and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. The Neuron Serving team develops infrastructure to ... and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a… more
    Amazon (12/21/25)
    - Save Job - Related Jobs - Block Source
  • AGI Inference Software Development…

    Amazon (Sunnyvale, CA)
    Description The Sensory Inference team at AGI is a group of innovative developers working on groundbreaking multi-modal inference solutions that revolutionize ... interact with the world. We push the limits of inference performance to provide the best possible experience for...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
    Amazon (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Sr Software Dev Engineer , Machine…

    Amazon (Palo Alto, CA)
    …the future of advertising. Key job responsibilities As a Software Development Engineer in Machine Learning, you will: * Enhance the scalability, automation, and ... efficiency of large-scale training and real-time inference systems. * Pioneer the development of LLM ...for action. We are looking for a talented Software Engineer with a strong background in machine learning engineering… more
    Amazon (11/04/25)
    - Save Job - Related Jobs - Block Source
  • AI Senior Staff Systems Engineer

    Cadence Design Systems, Inc. (San Jose, CA)
    …world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role ... clusters, storage solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI Service Integration:… more
    Cadence Design Systems, Inc. (12/29/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer (ML), AGI…

    Amazon (Sunnyvale, CA)
    …(AGI) team is looking for a passionate, talented, and inventive Sr. ML Engineer with a strong machine learning background, to build customization capabilities such ... as fine tuning and distillation. As a Sr. ML engineer with the AGI team, you will be responsible...responsible for leading the development of novel LLM training, inference techniques and optimizations to advance the state of… more
    Amazon (12/03/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Neuron…

    Amazon (Cupertino, CA)
    …and the Trn1 and Inf1 servers that use them. As the Software Development Engineer for the Neuron Foundation Tools Team, you will be responsible for working alongside ... development life cycle of the Neuron Profiler/Tools toolchain, ensuring scalability, reliability , and usability. You will collaborate with cross-functional teams to… more
    Amazon (11/19/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer - Circuit…

    Cadence Design Systems, Inc. (San Jose, CA)
    …innovative results in real-world settings. Core Expertise + Statistical inference : significance testing (p-values, confidence intervals), Bayesian statistics, design ... Carlo methods (random sampling, density estimation). + Rare-event and reliability analysis (a plus): importance sampling, subset simulation, cross-entropy methods,… more
    Cadence Design Systems, Inc. (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer

    Amazon (Cupertino, CA)
    …Web Services (AWS) is building a central pipeline of Software Development Engineer (SDE) talent for anticipated roles in 2026. This requisition supports hiring ... and AWS CloudFront. Key job responsibilities As an AWS Software Development Engineer , you will: - Design, develop, and maintain efficient, reusable, and reliable… more
    Amazon (12/20/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    …compiler teams. * Collect requirements from various other teams including training, inference and runtime. * Collaborate with the compiler performance team to ensure ... Trainium and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the cloud to our AWS… more
    Amazon (01/09/26)
    - Save Job - Related Jobs - Block Source
  • Sr. System Development Engineer

    Amazon (Cupertino, CA)
    …the foundation of the world's most advanced cloud for AI training and inference - where multi-billion-parameter models come to life at scale. Here, you'll design, ... complex problems. You will decompose big difficult server system testability, reliability and diagnosis problems into straightforward tasks, components or features… more
    Amazon (10/25/25)
    - Save Job - Related Jobs - Block Source
  • ASIC Design Engineer , Cloud-Scale Machine…

    Amazon (Cupertino, CA)
    …data centers including AWS Inferentia, our custom designed machine learning inference datacenter server. Our success depends on our world-class server ... making the right trade-offs. Key job responsibilities As an ASIC Design Engineer , you will: * Develop and implement high-performance, area and power-efficient RTL… more
    Amazon (12/18/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Machine Learning - Compiler Engineer

    Amazon (Cupertino, CA)
    …building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium delivers the best-in-class ... quantum leap in performance. As a Machine Learning Compiler Engineer II in the AWS Neuron Compiler team, you...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
    Amazon (12/17/25)
    - Save Job - Related Jobs - Block Source
  • Sr ML Compiler Engineer , Annapurna Labs

    Amazon (Cupertino, CA)
    …for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the ... in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
    Amazon (11/12/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Machine Learning - Compiler Engineer

    Amazon (Cupertino, CA)
    …for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the ... in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will...5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
    Amazon (10/29/25)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer PhD (Full Time)…

    Cisco (Milpitas, CA)
    …distillation, and generative adversarial networks (GANs). Performance, scalability, and reliability are front and center as models are trained, fine-tuned, ... academic or professional projects. **Preferred Qualifications** + Experience working with inference engines (eg, vLLM, Triton, TorchServe). + Knowledge of GPU… more
    Cisco (12/20/25)
    - Save Job - Related Jobs - Block Source
  • AI Machine Learning Engineer II (Intern)…

    Cisco (San Jose, CA)
    …distillation, and generative adversarial networks (GANs). Performance, scalability, and reliability are front and center as models are trained, fine-tuned, ... or academic/research project documentation. **Preferred Qualifications** + Experience with inference engines such as vLLM, Triton, or TorchServe. + Knowledge… more
    Cisco (12/01/25)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer , Model…

    Amazon (Mountain View, CA)
    …experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Bachelor's degree ... learning and/or machine learning methods (eg for training, fine tuning, and inference ) - Hands-on experience with generative AI technology Preferred Qualifications -… more
    Amazon (12/20/25)
    - Save Job - Related Jobs - Block Source
  • ML Kernel Performance Engineer , AWS…

    Amazon (Cupertino, CA)
    …seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron Compiler ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
    Amazon (11/15/25)
    - Save Job - Related Jobs - Block Source