Ai Research Engineer Vllm Jobs | Juju

Senior MLOps Engineer , vLLM…

Red Hat (Boston, MA)

…seeking an experienced ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...AI ! **What you will do** + Collaborate with research and product development teams to scale machine learning… more

Red Hat (09/19/25)
- Save Job - Related Jobs - Block Source
Intern 2026: AI Inference Optimization…

IBM (San Jose, CA)

…thrive. **Your role and responsibilities** As a software engineer with IBM Research , you'll bridge the gap between groundbreaking AI research and ... **Introduction** IBM Research takes responsibility for technology and its role...and contributing to leading open-source libraries and frameworks in AI , such as PyTorch, TensorFlow, vLLM , and… more

IBM (09/22/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI…

Amazon (Cupertino, CA)

…and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a ... particular focus on large-scale generative AI applications. Key job responsibilities * Architect and lead...workloads * Lead integration efforts with frameworks such as vLLM , SGLang, Torch XLA, TensorRT, and Triton * Develop… more

Amazon (09/21/25)
- Save Job - Related Jobs - Block Source
AI Senior Staff Systems Engineer

Cadence Design Systems, Inc. (San Jose, CA)

…an impact on the world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual ... will be pivotal in leading the development, operations, and support of our entire AI infrastructure. You will be responsible for the entire lifecycle of our AI… more

Cadence Design Systems, Inc. (09/30/25)
- Save Job - Related Jobs - Block Source
Senior Engineer - AI Inference

Bank of America (Addison, TX)

Senior Engineer - AI Inference Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you must be at ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI -Inference\_25029879) **Job Description:** At Bank of America,… more

Bank of America (09/11/25)
- Save Job - Related Jobs - Block Source
Software Engineer III -Gen AI…

Bank of America (Kennesaw, GA)

Software Engineer III -Gen AI Inferencing Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Software- Engineer -III--Gen- AI -Inferencing-\_25032986) **Job Description:** At Bank of America,… more

Bank of America (09/11/25)
- Save Job - Related Jobs - Block Source
Senior AI System Engineer

NVIDIA (Santa Clara, CA)

…capabilities of artificial intelligence. We are seeking an ambitious and forward-thinking AI /ML System Performance Engineer to contribute to the development of ... communication overlap), and hardware-level enhancements. As NVIDIA makes significant strides in AI datacenters, our team holds a central role in maximizing the… more

NVIDIA (08/29/25)
- Save Job - Related Jobs - Block Source
Senior AI Software Engineer , GenAI…

NVIDIA (Santa Clara, CA)

…Performance tuning and optimizations of deep learning framework and software components. + Research , prototype, and develop robust and scalable AI tools and ... NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron...PyTorch, JAX), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design,… more

NVIDIA (09/23/25)
- Save Job - Related Jobs - Block Source
Software engineer - AI /ML, AWS…

Amazon (Seattle, WA)

…cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible ... so on. Key job responsibilities Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from… more

Amazon (09/09/25)
- Save Job - Related Jobs - Block Source
ML Acceleration / Framework Engineer…

Amazon (Seattle, WA)

…with vLLM , Triton, and TensorRT-turning breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...key. - ML Frameworks partners with compiler, runtime, and research experts to make AWS Trainium and Inferentia feel… more

Amazon (07/15/25)
- Save Job - Related Jobs - Block Source
Sr. Research Engineer , Machine…

Amazon (New York, NY)

…Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Research Engineer with a strong hands-on machine learning background, to lead ... Large Language Models (LLMs) and Generative Artificial Intelligence (Gen AI ). You will have significant influence on our overall...upon industry leading frameworks (NeMo, Megatron Core, PyTorch, Jax, vLLM , TRT, etc) - Work with other team members… more

Amazon (09/23/25)
- Save Job - Related Jobs - Block Source
Senior DL Algorithms Engineer - Cosmos

NVIDIA (Santa Clara, CA)

…across diverse GPU platforms, particularly for physical AI and generative AI applications. You will collaborate with research scientists, software engineers, ... We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning...and hardware specialists to bring cutting-edge AI models from prototype to production. What you will… more

NVIDIA (08/08/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in… more

NVIDIA (09/23/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in ... NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such… more

NVIDIA (09/18/25)
- Save Job - Related Jobs - Block Source
Staff ML Engineer , Inference Platform

General Motors (Warren, MI)

…**About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, ... reliable, and cost-efficient platform that powers GM's AI efforts. We're proud to serve as the ...the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms… more

General Motors (10/03/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is...deep learning, enabling breakthroughs in areas like LLM, Generative AI , Recommenders and Vision that have put DL into… more

NVIDIA (07/29/25)
- Save Job - Related Jobs - Block Source
Senior Principal Software Engineer

Red Hat (Boston, MA)

…Summary:** The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston, MA office. In this role, you will work with a ... diverse team of highly motivated engineers on designing and implementing AI /ML workflows and solutions and integrating Partners solutions. You will also be working… more

Red Hat (09/19/25)
- Save Job - Related Jobs - Block Source
Lead Engineer , Inference Platform

MongoDB (Palo Alto, CA)

…the world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge research into production at scale + Solve ... applications by helping them modernize legacy workloads, embrace innovation, and unleash AI . Our industry-leading developer data platform, MongoDB Atlas, is the only… more

MongoDB (09/27/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer , TensorRT-LLM

NVIDIA (Santa Clara, CA)

…design to keep pace + Collaborate across the company to guide the direction of AI Inferencing, working with software, research and product teams What we need to ... We are now looking for a Principal Software Engineer , TensorRT-LLM ! NVIDIA is hiring experienced principal...world are using GPUs to power a revolution in AI , enabling breakthroughs in areas like content creation, code… more

NVIDIA (09/17/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Engineer , LLM…

NVIDIA (Santa Clara, CA)

…and finetuning with mixed precision recipes on next-gen NVIDIA GPU architectures. + Research , prototype, and develop robust and scalable AI tools and pipelines. ... highly optimized solutions. What you'll be doing: + Develop algorithms for AI /DL, data analytics, machine learning, or scientific computing + Contribute and advance… more

NVIDIA (08/21/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search