AI Inference Engineer Jobs in San Francisco, CA

36 jobs (page 1)

Categories

All Categories

Engineering (8)

Software/IT (6)

Senior Machine Learning Engineer , Ads…

Unity Technologies (San Francisco, CA)

**San Francisco, CA, USA** **Senior Machine Learning Engineer , Ads Demand Optimization** Location San Francisco, CA, USA Department Engineering Requisition ID ... advertisers' experience. We are seeking skilled MLEs to design and implement AI -native demand optimization algorithms and systems. You will own end-to-end solutions… more

DirectEmployers Association (12/13/25)
- Save Job - Related Jobs - Block Source
AI Inference Engineer

quadric.io, Inc (Burlingame, CA)

…GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... of AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Lead AI Engineer (FM Hosting, LLM…

Capital One (San Francisco, CA)

Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700… more

Capital One (11/04/25)
- Save Job - Related Jobs - Block Source
Distinguished AI Engineer (Agentic…

Capital One (San Francisco, CA)

Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...worked. San Francisco, CA: $293,600 - $335,100 for Distinguished AI Engineer San Jose, CA: $293,600 -… more

Capital One (12/18/25)
- Save Job - Related Jobs - Block Source
AI Kernel Engineer

quadric.io, Inc (Burlingame, CA)

…GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number ... of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Senior/Principal Software Engineer…

Genentech (South San Francisco, CA)

…scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on full stack engineering, you will be… more

Genentech (12/06/25)
- Save Job - Related Jobs - Block Source
Principal/Senior Principal Machine Learning…

Genentech (South San Francisco, CA)

…scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a machine learning engineer in AI Enablement, you will be working closely with folks that span the… more

Genentech (12/06/25)
- Save Job - Related Jobs - Block Source
Senior Principal Software Engineer…

Oracle (Redwood City, CA)

… at the Principal Engineer level. We are at the forefront of AI innovation, exploring the next generation of AI accelerators and hardware solutions. As ... a Senior Principal software engineer , part of our growing team, you will be...will be involved in evaluation, prototyping, and optimizing cutting-edge AI hardware, AI accelerators, including custom-designed … more

Oracle (11/25/25)
- Save Job - Related Jobs - Block Source
Sr. Software Development Engineer , FAR…

Amazon (San Francisco, CA)

Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers like Pieter ... run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more

Amazon (12/02/25)
- Save Job - Related Jobs - Block Source
AI Applications Engineer

quadric.io, Inc (Burlingame, CA)

…both NN graph code and conventional C++ DSP and control code. Role: The AI Applications Engineer is the key bridge between development engineering and hands-on ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...users in the field. The AI Application Engineer will [1] integrate Quadric… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Senior AI /ML Tooling Engineer

General Motors (San Francisco, CA)

**Job Description** **Senior AI /ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize distillation, ... training, and inference of ML models. You will develop and enhance...toolchain and stack, to leverage the latest advancements in AI + Influence model architecture decisions and strategy within… more

General Motors (11/04/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer , Frontier…

Amazon (San Francisco, CA)

Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll contribute to breakthrough foundation models run at production ... scale. As a Software Development Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more

Amazon (12/02/25)
- Save Job - Related Jobs - Block Source
Research Engineer , Language - Generative…

Meta (Menlo Park, CA)

… inference ; and/or multilingual and multimodal modeling. **Required Skills:** Research Engineer , Language - Generative AI Responsibilities: 1. Design methods, ... **Summary:** Meta is seeking a Research Engineer to join our Large Language Model (LLM)...for strong engineers who have a background in generative AI and NLP, with experience in areas like language… more

Meta (12/20/25)
- Save Job - Related Jobs - Block Source
AI /HPC System Performance Engineer

Meta (Menlo Park, CA)

**Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI /HPC System Performance Engineer Responsibilities: 1. Lead… more

Meta (12/20/25)
- Save Job - Related Jobs - Block Source
Software Development Engineer…

Amazon (San Francisco, CA)

…Build and maintain scalable data infrastructure to support cutting-edge AI robotics research. Design dataset management systems including automated pipelines ... hands-on technical contribution to data preparation workflows. About the team At Frontier AI & Robotics, we're not just advancing robotics - we're reimagining it… more

Amazon (01/11/26)
- Save Job - Related Jobs - Block Source
Software Engineer , Systems ML - Frameworks…

Meta (Menlo Park, CA)

…authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta.You will be working on ... help in driving next generation hardware software codesign for AI domain specific problems. **Required Skills:** Software Engineer...core compilers to support new state of the art inference and training AI hardware accelerators and… more

Meta (12/20/25)
- Save Job - Related Jobs - Block Source
Software Engineer , ML Infrastructure,…

Snap Inc. (San Francisco, CA)

…forefront. You'll play a critical role in scaling our ML Infrastructure, optimizing AI training and inference systems, and driving innovations that make ... more efficient and impactful. We're looking for a Software Engineer , ML Infrastructure to join Snap Inc! What you'll...models for content ranking and recommendations + Develop high-performance inference systems to ensure fast and efficient AI… more

Snap Inc. (01/09/26)
- Save Job - Related Jobs - Block Source
Summer Intern - Machine Learning Systems…

General Motors (San Francisco, CA)

**Job Description** **About the Team** **:** We are building a state-of-the-art AI training platform in close collaboration with model teams to deliver Autonomous ... milestones, maximizing model flop utilization and iteration speed, and enhancing inference systems. We are augmenting training, data processing, evaluation and the… more

General Motors (12/12/25)
- Save Job - Related Jobs - Block Source
Sr Staff R&D Engineer

The Walt Disney Company (Nicasio, CA)

…Skywalker Sound Development Group is seeking a highly accomplished **Sr Staff R&D Engineer ( AI /ML)** to lead the development of transformative audio intelligence ... + Own end-to-end model lifecycle management: pretraining, fine-tuning, validation, inference optimization, and CI/CD integration. + Guide the development of… more

The Walt Disney Company (11/20/25)
- Save Job - Related Jobs - Block Source
Principal Software Development Engineer

Oracle (San Francisco, CA)

…serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure to agent-based AI systems or tool-based inference workflows. - Knowledge of ... future of cloud computing-designed for enterprises, engineered for performance, and optimized for AI at scale. We are a fast-paced, mission-driven team within one of… more

Oracle (12/20/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search