• Machine Learning Engineer

    Red Hat (Raleigh, NC)
    …provides a stable platform for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM, you will be at the ... LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings...solving challenging technical problems at the forefront of deep learning in the open source way, this is the… more
    Red Hat (12/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Machine Learning

    Red Hat (Boston, MA)
    …on Github. As a Machine Learning ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...challenges in model performance and efficiency. Your work with machine learning and high performance computing will… more
    Red Hat (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Machine Learning

    Red Hat (Boston, MA)
    …for enterprises to build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/vllm-project/) ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...and guide fellow engineers, fostering a culture of continuous learning and innovation. **What you will bring** + Strong… more
    Red Hat (01/08/26)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer

    Amazon (Seattle, WA)
    …and the Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the... Technology team works side by side with the Inference Model Enablement, compiler runtime engineers to create, build… more
    Amazon (12/24/25)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer

    Amazon (Seattle, WA)
    …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. ... compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trn1. Experience optimizing inference performance for… more
    Amazon (12/13/25)
    - Save Job - Related Jobs - Block Source
  • Principal Machine Learning Platform…

    Palo Alto Networks (Santa Clara, CA)
    …while ensuring a formidable security posture from development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as ... and long-term strategy of our AI platform - ML inference . Beyond individual contribution, you will lead complex technical...a deep focus on MLOps, ML systems, or productionizing machine learning models at scale. + Expert-level… more
    Palo Alto Networks (11/18/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer -AI/ML, AWS Neuron…

    Amazon (Seattle, WA)
    … accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development ... at least one software programming language - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work… more
    Amazon (12/21/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer - AI/ML, AWS…

    Amazon (Seattle, WA)
    …scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with ... learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more
    Amazon (12/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer

    Amazon (Seattle, WA)
    …scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with ... learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more
    Amazon (01/06/26)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer - Model…

    NVIDIA (Santa Clara, CA)
    …out from the crowd: + Contributions to PyTorch, JAX, vLLM, SGLang, or other machine learning training and inference frameworks. + Hands-on experience ... strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Staff ML Engineer , Inference

    General Motors (Sunnyvale, CA)
    …use cases. Our platform supports the serving of state-of-the-art (SOTA) machine learning models for experimental and bulk inference , with a focus on ... eligible for relocation assistance.** **About the Team:** The ML Inference Platform is part of the AI Compute Platforms...+ 8+ years of industry experience, with focus on machine learning systems or high performance backend… more
    General Motors (10/21/25)
    - Save Job - Related Jobs - Block Source
  • AI Inference Engineer

    quadric.io, Inc (Burlingame, CA)
    …network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and ... conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM models and Quadric unique… more
    quadric.io, Inc (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Lead AI Engineer (FM Hosting, LLM…

    Capital One (San Francisco, CA)
    …for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in ... Lead AI Engineer (FM Hosting, LLM Inference ) **Overview**...world-class talent - along with our deep experience in machine learning - position us to be… more
    Capital One (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer AI/ML,…

    Amazon (Cupertino, CA)
    …is the software stack powering AWS Inferentia and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale. ... The Neuron Serving team develops infrastructure to serve modern machine learning models-including large language models (LLMs) and multimodal workloads-reliably… more
    Amazon (12/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Inference

    NVIDIA (Durham, NC)
    We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer who ... to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does...years or relevant experience + Strong mathematical foundation in machine learning and deep learning more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - vLLM…

    Red Hat (Boston, MA)
    …you will do + Collaborate with research and product development teams to scale machine learning products for internal and external applications + Create and ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...learning products and software. As an ML Ops engineer , you will work closely with our technical and… more
    Red Hat (12/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Machine

    Google (Sunnyvale, CA)
    Senior Software Engineer , Machine Learning , Kernel...+ 3 years of experience in software development for machine learning model inference or ... training, and 1 year of experience with ML model inference and training optimization on modern GPU/TPU architectures. **Preferred...Cloud is searching for a highly skilled and motivated engineer to optimize machine learning more
    Google (12/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior, Software Engineer ( Machine

    Walmart (Sunnyvale, CA)
    **Position Summary ** **What you'll do ** As a **Senior Software Engineer - Machine Learning ** , you are a technical leader working at the intersection of ... machine learning and software engineering. You have...scalable model serving platforms for both batch and real-time inference + Build model deployment pipelines with automated testing… more
    Walmart (01/09/26)
    - Save Job - Related Jobs - Block Source
  • Senior AI Machine Learning Research…

    CACI International (Florham Park, NJ)
    Senior AI Machine Learning Research Engineer Job Category: Engineering Time Type: Full time Minimum Clearance Required to Start: None Employee Type: Regular ... in Florham Park, NJ for a Senior AI and Machine Learning Research Engineer . Apply...and experience with 3D edge object identification, tracking, and inference . + Experience with computer vision frameworks, software &… more
    CACI International (11/12/25)
    - Save Job - Related Jobs - Block Source
  • Artificial Intelligence & Machine

    Honeywell (San Jose, CA)
    We're seeking a highly skilled Artificial Intelligence & Machine Learning Systems Engineer to architect, design, and develop advanced AI/ML systems that ... with cross-functional teams to deliver intelligent, scalable, and production-ready AI and machine learning technologies. You will be responsible for researching,… more
    Honeywell (01/07/26)
    - Save Job - Related Jobs - Block Source