Senior Ml Inference Engineer Jobs

147 jobs (page 1)

Categories

All Categories

Engineering (61)

Software/IT (16)

Management (5)

Senior Software Development Engineer…

Amazon (Seattle, WA)

…integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more

Amazon (01/06/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer…

Amazon (Seattle, WA)

…integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more

Amazon (12/26/25)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer , AWS Neuron…

Amazon (Seattle, WA)

…the Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enables and performance tunes building blocks for all key ML model families, including Llama3, GPT OSS, Qwen3, DeepSeek...Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the… more

Amazon (12/24/25)
- Save Job - Related Jobs - Block Source
Software Engineer -AI/ ML , AWS…

Amazon (Seattle, WA)

…cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more

Amazon (12/21/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer…

MongoDB (Palo Alto, CA)

**About the Role** We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic ... with Atlas and designed for developer-first experiences. As a Senior Engineer , you'll focus on building core...a cloud-native environment + Work across product, infrastructure, and ML teams to ensure the inference platform… more

MongoDB (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , AI…

NVIDIA (CA)

…how you can make a lasting impact on the world. We are now looking for a Senior System Software Engineer to work on user facing tools for Dynamo Inference ... of modern ML architectures with a keen intuition for optimizing inference performance. + Take full ownership of problems end-to-end, proactively acquiring any… more

NVIDIA (11/29/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , AI…

NVIDIA (Santa Clara, CA)

…systems, deep learning theories. + Knowledgeable and passionate about performance engineering in ML frameworks (eg, PyTorch) and inference engines (eg, vLLM and ... motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency....that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Principal Machine Learning…

Red Hat (Boston, MA)

…bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/vllm-project/) infrastructure in the LLM-D… more

Red Hat (01/08/26)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative ... as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculative decoding, sparsity,… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Technical Marketing Engineer…

NVIDIA (Santa Clara, CA)

…full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product ... ensure a consistent, high-impact go-to-market strategy. This role will focus on AI inference at scale, ensuring that customers and partners understand how to best… more

NVIDIA (11/06/25)
- Save Job - Related Jobs - Block Source
Senior System Software Engineer…

NVIDIA (CA)

We are now looking for a Senior System Software Engineer to work on Dynamo & Triton Inference Server! NVIDIA is hiring software engineers for its ... role, you will develop open source software to serve inference of trained AI models running on GPUs. You...design. + Experience with high scale distributed systems and ML systems Ways to stand out from the crowd:… more

NVIDIA (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer - vLLM…

Red Hat (Boston, MA)

…for enterprises to build, optimize, and scale LLM deployments. We are seeking an experienced Senior ML Ops engineer to work closely with our product and ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...scale SOTA deep learning products and software. As an ML Ops engineer , you will work closely… more

Red Hat (12/06/25)
- Save Job - Related Jobs - Block Source
Senior Engineer -AI Inference

Bank of America (Addison, TX)

Senior Engineer -AI Inference Addison, Texas;Plano, Texas; Newark, Delaware; Charlotte, North Carolina; Kennesaw, Georgia **To proceed with your application, ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/ Senior - Engineer -AI- Inference \_25029879) **Job Description:** At Bank… more

Bank of America (12/22/25)
- Save Job - Related Jobs - Block Source
Senior Principal Machine Learning…

Red Hat (Boston, MA)

…bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... (https://github.blog/news-insights/octoverse/octoverse-a-new-developer-joins-github-every-second-as-ai-leads-typescript-to-1/#the-top-open-source-projects-by-contributors) on Github. As a Machine Learning Engineer focused on vLLM, you will be… more

Red Hat (01/08/26)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer , AWS Neuron…

Amazon (Seattle, WA)

…and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, GPT2,… more

Amazon (12/13/25)
- Save Job - Related Jobs - Block Source
Senior AI/ ML Engineer

General Motors (Austin, TX)

**Job Description** ** Senior AI/ ML Engineer , AV ML Infra** We're General Motors (GM), a company driving the future of mobility with advanced self-driving ... performance by running large-scale simulation workloads and managing reliable ML inference pipelines. + ** ML ...expedite our path to commercialization. **Position Overview:** As a ** Senior AI/ ML Engineer ** , you… more

General Motors (10/31/25)
- Save Job - Related Jobs - Block Source
Senior AI/ ML Engineer

General Motors (Mountain View, CA)

…more connected, shaping the future of transportation on a global scale. **Role:** As a Senior AI/ ML Engineer within the Onboard Embodied AI organization, you ... will be a senior individual contributor driving cutting-edge end-to-end machine learning solutions...ML models, delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. + Lead and… more

General Motors (11/04/25)
- Save Job - Related Jobs - Block Source
Matterport - Senior ML Ops…

CoStar Realty Information, Inc. (Sunnyvale, CA)

Matterport - Senior ML Ops Engineer Job Description CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate ... documentation, appraisal, and marketing. About the Role: As a Senior MLOps Engineer at Matterport, a part...efficient models into production. You will work closely with ML R&D Engineers and other engineering teams to analyze… more

CoStar Realty Information, Inc. (11/26/25)
- Save Job - Related Jobs - Block Source
Senior AI/ ML Tooling…

General Motors (Sunnyvale, CA)

**Job Description** ** Senior AI/ ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize ... distillation, training, and inference of ML models. You will develop and enhance GM's internal ML tooling for high performance software by leveraging state… more

General Motors (11/04/25)
- Save Job - Related Jobs - Block Source
Sr. Software Engineer - AI/ ML , AWS…

Amazon (Seattle, WA)

…stack powering AWS's next-generation AI accelerators Inferentia and Trainium. As a Senior Software Engineer in our Machine Learning Applications team, you'll ... AI models at unprecedented scale. What You'll Impact: * Pioneer distributed inference solutions for industry-leading LLMs such as GPT, Llama, Qwen * Optimize… more

Amazon (10/31/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search