• Capital One Bank (Newport News, VA)
    …AWS)The Capital One machine learning platform organization manages our cloud-based enterprise ML + AI system providing users with development tools and runtime ... environments necessary to build and run machine learning and AI systems with large scale real-time and batch processing...support our diverse offerings ranging from developer notebooks to model training to model inference and feature… more
    Talent (10/02/25)
    - Save Job - Related Jobs - Block Source
  • AI / ML Model Runtime

    Broadcom (Palo Alto, CA)
    …team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more
    Broadcom (07/25/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Product Manager - Runtime Infra,…

    Amazon (Cupertino, CA)
    …an experienced Technical Product Manager to define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the AWS ... ML performance in the cloud. You will lead runtime and infrastructure requirements working backward from customer needs,...to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the… more
    Amazon (09/02/25)
    - Save Job - Related Jobs - Block Source
  • Technical Lead, ML Frameworks,…

    Google (Mountain View, CA)
    Technical Lead, ML Frameworks, Runtime , Devices, Numerical Acceleration _corporate_fare_ Google _place_ Mountain View, CA, USA **Advanced** Experience owning ... project strategy, ML design, and optimizing industry-scale ML infrastructure (eg, model deployment, model...into priorities and projects for the broader group. The ML , Systems, & Cloud AI (MSCA) organization… more
    Google (10/04/25)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer, ML

    pony.ai (Fremont, CA)
    …evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more
    pony.ai (08/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, AI

    Amazon (Cupertino, CA)
    …machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design, ... ML accelerators. This comprehensive toolkit includes an ML compiler, runtime , and application framework that...of the most interesting and impactful infrastructure challenges in AI / ML today. Basic Qualifications - Bachelor's degree… more
    Amazon (08/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, AI

    Amazon (Cupertino, CA)
    …use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for ... and performance tuning of a wide variety of LLM model families, including massive scale large language models like...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer II - AI / ML , AWS…

    Amazon (Seattle, WA)
    …for development, enablement and performance tuning of a wide variety of ML model families, including state of art GEN- AI models and massive scale large ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...debug and resolve accuracy issues raising from migration of model to AI accelerators. The team develops… more
    Amazon (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Seattle, WA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... to and experience with Amazon's growing suite of generative AI services and other cutting-edge cloud computing offerings across...a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is… more
    Amazon (08/24/25)
    - Save Job - Related Jobs - Block Source
  • Sr. AI / ML Computer Vision Engineer

    TP-Link North America, Inc. (Irvine, CA)
    …enable consumers to enjoy a seamless, effortless lifestyle. We are seeking a Senior AI / ML Computer Vision Engineer to drive the development and deployment of ... AI -powered features across our smart home automation product lines,...video processing with hands-on experience in deploying and optimizing ML models on constrained edge devices. Responsibilities + Lead… more
    TP-Link North America, Inc. (09/19/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer- AI / ML , AWS…

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
    Amazon (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Staff AI / ML Computer Vision…

    TP-Link North America, Inc. (Irvine, CA)
    We are seeking for a Staff AI / ML Computer Vision Engineer to design and develop cutting-edge AI -powered features for our next-generation smart home ... project execution, mentoring engineers, setting standards for deploying efficient, real-time AI at the edge, and ensuring seamless integration with cloud… more
    TP-Link North America, Inc. (08/22/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Seattle, WA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
    Amazon (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
    Amazon (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer - AI

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, ... for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference… more
    Amazon (09/19/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer- AI / ML , AWS…

    Amazon (Seattle, WA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like GPT2, ... for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role is...side by side with chip architects, compiler engineers and runtime engineers to create , build and tune distributed… more
    Amazon (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Cupertino, CA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such ... Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with...side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training… more
    Amazon (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI / ML , AWS…

    Amazon (Seattle, WA)
    …responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such ... Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with...side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training… more
    Amazon (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior ML Research Engineer - LLM…

    Microsoft Corporation (Mountain View, CA)
    …Bing, MSN, Office 365, OneDrive, Skype, Teams and Xbox Live. We looking for **Senior ML Research Engineer - LLM Quantization & Model Optimization** to join our ... with companywide AI teams. + Work cross-functionally with data scientists and ML researchers/engineers to align on model accuracy and performance goals. +… more
    Microsoft Corporation (09/27/25)
    - Save Job - Related Jobs - Block Source
  • ML Acceleration / Framework Engineer…

    Amazon (Seattle, WA)
    …to this and extending all of this for the Neuron based system is key. - ML Frameworks partners with compiler, runtime , and research experts to make AWS Trainium ... side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training...with vLLM, Triton, and TensorRT-turning breakthrough ideas into production‑ready AI for millions of customers. - The ML more
    Amazon (07/15/25)
    - Save Job - Related Jobs - Block Source