- Red Hat (Boston, MA)
- …seeking an experienced ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...AI ! **What you will do** + Collaborate with research and product development teams to scale machine learning… more
- IBM (San Jose, CA)
- …thrive. **Your role and responsibilities** As a software engineer with IBM Research , you'll bridge the gap between groundbreaking AI research and ... **Introduction** IBM Research takes responsibility for technology and its role...and contributing to leading open-source libraries and frameworks in AI , such as PyTorch, TensorFlow, vLLM , and… more
- Amazon (Cupertino, CA)
- …and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a ... particular focus on large-scale generative AI applications. Key job responsibilities * Architect and lead...workloads * Lead integration efforts with frameworks such as vLLM , SGLang, Torch XLA, TensorRT, and Triton * Develop… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …an impact on the world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual ... will be pivotal in leading the development, operations, and support of our entire AI infrastructure. You will be responsible for the entire lifecycle of our AI… more
- Bank of America (Addison, TX)
- Senior Engineer - AI Inference Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you must be at ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI -Inference\_25029879) **Job Description:** At Bank of America,… more
- Bank of America (Kennesaw, GA)
- Software Engineer III -Gen AI Inferencing Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Software- Engineer -III--Gen- AI -Inferencing-\_25032986) **Job Description:** At Bank of America,… more
- NVIDIA (Santa Clara, CA)
- …capabilities of artificial intelligence. We are seeking an ambitious and forward-thinking AI /ML System Performance Engineer to contribute to the development of ... communication overlap), and hardware-level enhancements. As NVIDIA makes significant strides in AI datacenters, our team holds a central role in maximizing the… more
- NVIDIA (Santa Clara, CA)
- …Performance tuning and optimizations of deep learning framework and software components. + Research , prototype, and develop robust and scalable AI tools and ... NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron...PyTorch, JAX), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design,… more
- Amazon (Seattle, WA)
- …cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible ... so on. Key job responsibilities Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from… more
- Amazon (Seattle, WA)
- …with vLLM , Triton, and TensorRT-turning breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...key. - ML Frameworks partners with compiler, runtime, and research experts to make AWS Trainium and Inferentia feel… more
- Amazon (New York, NY)
- …Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Research Engineer with a strong hands-on machine learning background, to lead ... Large Language Models (LLMs) and Generative Artificial Intelligence (Gen AI ). You will have significant influence on our overall...upon industry leading frameworks (NeMo, Megatron Core, PyTorch, Jax, vLLM , TRT, etc) - Work with other team members… more
- NVIDIA (Santa Clara, CA)
- …across diverse GPU platforms, particularly for physical AI and generative AI applications. You will collaborate with research scientists, software engineers, ... We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning...and hardware specialists to bring cutting-edge AI models from prototype to production. What you will… more
- NVIDIA (Santa Clara, CA)
- …strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in… more
- NVIDIA (Santa Clara, CA)
- …of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in ... NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such… more
- General Motors (Warren, MI)
- …**About the Team:** The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, ... reliable, and cost-efficient platform that powers GM's AI efforts. We're proud to serve as the ...the Role:** We are seeking a Staff ML Infrastructure engineer to help build and scale robust Compute platforms… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is...deep learning, enabling breakthroughs in areas like LLM, Generative AI , Recommenders and Vision that have put DL into… more
- Red Hat (Boston, MA)
- …Summary:** The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston, MA office. In this role, you will work with a ... diverse team of highly motivated engineers on designing and implementing AI /ML workflows and solutions and integrating Partners solutions. You will also be working… more
- MongoDB (Palo Alto, CA)
- …the world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge research into production at scale + Solve ... applications by helping them modernize legacy workloads, embrace innovation, and unleash AI . Our industry-leading developer data platform, MongoDB Atlas, is the only… more
- NVIDIA (Santa Clara, CA)
- …design to keep pace + Collaborate across the company to guide the direction of AI Inferencing, working with software, research and product teams What we need to ... We are now looking for a Principal Software Engineer , TensorRT-LLM ! NVIDIA is hiring experienced principal...world are using GPUs to power a revolution in AI , enabling breakthroughs in areas like content creation, code… more
- NVIDIA (Santa Clara, CA)
- …and finetuning with mixed precision recipes on next-gen NVIDIA GPU architectures. + Research , prototype, and develop robust and scalable AI tools and pipelines. ... highly optimized solutions. What you'll be doing: + Develop algorithms for AI /DL, data analytics, machine learning, or scientific computing + Contribute and advance… more