- MongoDB (Palo Alto, CA)
- … Engineer , you'll focus on building core systems and services that power model inference at scale. You'll own key components of the infrastructure, work ... **About the Role** We're looking for a Senior Engineer to help build the next-generation inference...multi-tenant service design + Familiar with concepts in ML model serving and inference runtimes, even if… more
- MongoDB (Palo Alto, CA)
- We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with...project **Nice to Have** + Prior experience working with model teams on inference -optimized architectures + Background… more
- quadric.io, Inc (Burlingame, CA)
- …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...Electric Engineering. + 5+ years of experience in AI/LLM model inference and deployment frameworks/tools + experience… more
- Capital One (San Francisco, CA)
- …develop, test, deploy, and support AI software components including foundation model training, large language model inference , similarity search, ... Lead AI Engineer (FM Hosting, LLM Inference ) **Overview**...developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency,… more
- Snap Inc. (San Francisco, CA)
- …ranking and recommendation systems more efficient and impactful. We're looking for a Software Engineer , ML Infrastructure to join Snap Inc! What you'll do: ... science or equivalent experience + 6+ years of post-Bachelor's software development experience; or Master's degree in a technical...Experience working with ML Training platforms or optimizing AI model inference + Familiarity with ML frameworks… more
- DoorDash (San Francisco, CA)
- …Logistics, Fraud, and Search. About the Role We're looking for a Staff Software Engineer with deep expertise in ML model serving to drive the next generation ... of our inference platform. This is a highly technical, hands-on role:...it makes sense - to accelerate innovation. As Staff Software Engineer , you'll pair deep technical execution… more
- Meta (Menlo Park, CA)
- …hardware software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels ... be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of...highly flexible platform to train & serve new DL/ML model architectures, combined with auto-tuned high performance for production… more
- Amazon (San Francisco, CA)
- …you'll contribute to breakthrough foundation models run at production scale. As a Software Development Engineer embedded in our science team, you'll be ... applications, leveraging your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll balance deep… more
- Amazon (San Francisco, CA)
- …foundation models run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental in transforming cutting-edge ... applications, leveraging your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll balance deep… more
- Genentech (South San Francisco, CA)
- …independent, and constantly evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on full stack ... and optimise workflows. We also work on scaling up model training and inference , evaluating the quality...field, and 4+ years of professional experience in full-stack software development (Senior Software Engineer );… more
- Oracle (San Francisco, CA)
- …agents that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and implementation ... - Contribute to the development and optimization of distributed systems for model inference and agent execution. - Implement features and enhancements… more
- Oracle (Redwood City, CA)
- …the next generation of AI accelerators and hardware solutions. As a Senior Principal software engineer , part of our growing team, you will be involved in ... engineering team is seeking a highly driven GPU platform software & system development engineer at the...with industry-standard hardware (eg, NVIDIA GPUs) for training and inference workloads. + Develop tools and processes for evaluating… more
- Amazon (San Francisco, CA)
- …help users find content hyper-personalized for them. Twitch is looking for a Senior Software Engineer to join our Machine Learning Infrastructure team. You will ... Software Development Engineer @ Twitch San...and you will have hands-on experience building and launching model -based experiments to improve products. **You Will:** + Architect… more
- Amazon (San Francisco, CA)
- …help users find content hyper-personalized for them. Twitch is looking for a Senior Software Engineer to join our Machine Learning Infrastructure team. You will ... work with software engineers, applied scientists and product managers in our...and you will have hands-on experience building and launching model -based experiments to improve products. You Will - Architect… more
- Amazon (San Francisco, CA)
- …and help users find content hyper-personalized for them. Twitch is looking for a Software Engineer II to join our Machine Learning Infrastructure team. You will ... work with software engineers, applied scientists and product managers in our...and you will have hands-on experience building and launching model -based experiments to improve products. You Will - Design… more
- ServiceNow, Inc. (San Francisco, CA)
- …+ Experience with AI-powered tools or workflows, including validation of datasets, model predictions, and inference consistency. FD21 For positions in this ... of builders, thinkers, and problem-solvers dedicated to delivering scalable, AI-powered software products that elevate how organizations work. We value clean… more
- Amazon (San Francisco, CA)
- …generative AI applications at scale. Additionally, we work closely with foundational model providers to optimize AI models for Amazon Silicon, enhancing performance ... continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF) * Model Optimization on AWS Silicon: Optimize AI models for deployment on… more
- Meta (Menlo Park, CA)
- …on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - Scaling / Performance Responsibilities: 1. ... role, you will be a member of the Network.AI Software team and part of the bigger DC networking...and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed… more
- Amazon (San Francisco, CA)
- …experience - Experience with Machine Learning and Large Language Model fundamentals, including architecture, training/ inference lifecycles, and optimization ... state-of-the-art data filtering techniques including deduplication, quality scoring, and model -based filtering methods. Collaborate directly with science teams to… more
- Rubrik (Palo Alto, CA)
- …platforms.** + **Experience with modern AI and LLM infrastructure** - including ** model gateways** (like LiteLLM or MCP), fine-tuning, inference optimization, or ... telemetry, audit trails, and behavioral analytics across thousands of agents. ** Engineer scalable, resilient AI platforms:** + Architect and scale systems that… more