- Amazon (Seattle, WA)
- …integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
- Amazon (Seattle, WA)
- …integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and ... learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
- Amazon (Seattle, WA)
- …the Trn2 and future Trn3 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enables and performance tunes building blocks for all key ML model families, including Llama3, GPT OSS, Qwen3, DeepSeek...Llama3, GPT OSS, Qwen3, DeepSeek and beyond. The Neuron Inference Technology team works side by side with the… more
- Amazon (Seattle, WA)
- …cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more
- MongoDB (Palo Alto, CA)
- **About the Role** We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic ... with Atlas and designed for developer-first experiences. As a Senior Engineer , you'll focus on building core...a cloud-native environment + Work across product, infrastructure, and ML teams to ensure the inference platform… more
- NVIDIA (CA)
- …how you can make a lasting impact on the world. We are now looking for a Senior System Software Engineer to work on user facing tools for Dynamo Inference ... of modern ML architectures with a keen intuition for optimizing inference performance. + Take full ownership of problems end-to-end, proactively acquiring any… more
- NVIDIA (Santa Clara, CA)
- …systems, deep learning theories. + Knowledgeable and passionate about performance engineering in ML frameworks (eg, PyTorch) and inference engines (eg, vLLM and ... motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency....that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way… more
- Red Hat (Boston, MA)
- …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... build, optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/vllm-project/) infrastructure in the LLM-D… more
- NVIDIA (Santa Clara, CA)
- …streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative ... as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculative decoding, sparsity,… more
- NVIDIA (Santa Clara, CA)
- …full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product ... ensure a consistent, high-impact go-to-market strategy. This role will focus on AI inference at scale, ensuring that customers and partners understand how to best… more
- NVIDIA (CA)
- We are now looking for a Senior System Software Engineer to work on Dynamo & Triton Inference Server! NVIDIA is hiring software engineers for its ... role, you will develop open source software to serve inference of trained AI models running on GPUs. You...design. + Experience with high scale distributed systems and ML systems Ways to stand out from the crowd:… more
- Red Hat (Boston, MA)
- …for enterprises to build, optimize, and scale LLM deployments. We are seeking an experienced Senior ML Ops engineer to work closely with our product and ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...scale SOTA deep learning products and software. As an ML Ops engineer , you will work closely… more
- Bank of America (Addison, TX)
- Senior Engineer -AI Inference Addison, Texas;Plano, Texas; Newark, Delaware; Charlotte, North Carolina; Kennesaw, Georgia **To proceed with your application, ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/ Senior - Engineer -AI- Inference \_25029879) **Job Description:** At Bank… more
- Red Hat (Boston, MA)
- …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... (https://github.blog/news-insights/octoverse/octoverse-a-new-developer-joins-github-every-second-as-ai-leads-typescript-to-1/#the-top-open-source-projects-by-contributors) on Github. As a Machine Learning Engineer focused on vLLM, you will be… more
- Amazon (Seattle, WA)
- …and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... for development, enablement and performance tuning of a wide variety of ML model families, including massive scale large language models like Llama2, GPT2,… more
- General Motors (Austin, TX)
- **Job Description** ** Senior AI/ ML Engineer , AV ML Infra** We're General Motors (GM), a company driving the future of mobility with advanced self-driving ... performance by running large-scale simulation workloads and managing reliable ML inference pipelines. + ** ML ...expedite our path to commercialization. **Position Overview:** As a ** Senior AI/ ML Engineer ** , you… more
- General Motors (Mountain View, CA)
- …more connected, shaping the future of transportation on a global scale. **Role:** As a Senior AI/ ML Engineer within the Onboard Embodied AI organization, you ... will be a senior individual contributor driving cutting-edge end-to-end machine learning solutions...ML models, delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. + Lead and… more
- CoStar Realty Information, Inc. (Sunnyvale, CA)
- Matterport - Senior ML Ops Engineer Job Description CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate ... documentation, appraisal, and marketing. About the Role: As a Senior MLOps Engineer at Matterport, a part...efficient models into production. You will work closely with ML R&D Engineers and other engineering teams to analyze… more
- General Motors (Sunnyvale, CA)
- **Job Description** ** Senior AI/ ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize ... distillation, training, and inference of ML models. You will develop and enhance GM's internal ML tooling for high performance software by leveraging state… more
- Amazon (Seattle, WA)
- …stack powering AWS's next-generation AI accelerators Inferentia and Trainium. As a Senior Software Engineer in our Machine Learning Applications team, you'll ... AI models at unprecedented scale. What You'll Impact: * Pioneer distributed inference solutions for industry-leading LLMs such as GPT, Llama, Qwen * Optimize… more