- Amazon (Cupertino, CA)
- …with vLLM , Triton, and TensorRT-turning breakthrough ideas into production‑ready AI for millions of customers. - The ML Inference team collaborates closely ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...key. - ML Frameworks partners with compiler, runtime, and research experts to make AWS Trainium and Inferentia feel… more
- Red Hat (Boston, MA)
- **Job Summary** At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red ... Hat Inference team accelerates AI for the enterprise and brings operational simplicity to...to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model… more
- FocusKPI Inc. (Mountain View, CA)
- …join one of our clients, a high-tech SaaS company. The client is seeking a Frontend AI Engineer (web) with research experience to join their Web Platform ... FocusKPI is looking for an AI /ML Engineer (web) with research...Experience with integrating large language models (LLM), finetuning on LLM/ VLLM , self-hosting LLMs, and on-device (edge) LLM + Some… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is...deep learning, enabling breakthroughs in areas like LLM, Generative AI , Recommenders and Vision that have put DL into… more
- Red Hat (Boston, MA)
- … AI landscape (models, techniques),especially emerging threats, mitigation techniques, AI safety research , and relevant regulatory landscapes, alongside ... a mission to bring the power of open-source LLMs and vLLM to every enterprise on the planet. As an ML Engineer , you will work closely with our product and … more
- Red Hat (Boston, MA)
- **Software Quality Engineer , InstructLab. Boston or Raleigh** **Company Description** At Red Hat, we connect an innovative community of customers, partners, and ... high-performing solutions. We offer cloud, Linux, middleware, virtualization, and AI technologies, together with award-winning global customer support, consulting,… more
- MongoDB (San Francisco, CA)
- …the world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge research into production at scale + Solve ... applications by helping them modernize legacy workloads, embrace innovation, and unleash AI . Our industry-leading developer data platform, MongoDB Atlas, is the only… more
- Red Hat (Boston, MA)
- …open source! Red Hat's Global Engineering Team is looking for a Principal Software Engineer to join our newly formed AI Engineering organization. This role will ... a bridge between Red Hat and our peer IBM Research team - this will involve participating in the...Responsibilities (what you'll do)** + Design, implement, and optimize AI tooling and systems to improve the quality and… more
- NVIDIA (Santa Clara, CA)
- …design to keep pace + Collaborate across the company to guide the direction of AI Inferencing, working with software, research and product teams What we need to ... We are now looking for a Principal Software Engineer , TensorRT-LLM ! NVIDIA is hiring experienced principal...world are using GPUs to power a revolution in AI , enabling breakthroughs in areas like content creation, code… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and ... and fast inference across diverse GPU platforms. You will collaborate with research scientists, software engineers, and hardware specialists to bring cutting-edge … more
- Amazon (Bellevue, WA)
- …to work in a highly technical domain at the boundary between fundamental AI research and production engineering such as Quantization, Speculative Decoding, and ... solutions for their cloud services. Looking for a highly-skilled Senior Machine Learning Engineer , to lead the development and delivery of technologies to push the… more
- Amazon (Cupertino, CA)
- …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... in popular machine learning frameworks (PyTorch and JAX) using AWS's specialized AI hardware. Working with our compiler and runtime teams, you'll learn how… more
- NVIDIA (Santa Clara, CA)
- …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... MPI, NCCL, UCX, UCC, NVSHMEM). + Exploring innovative communication technologies: Research and evaluate new communication technologies and techniques to enhance the… more