GenAI Inference Engineer Scalable Jobs in Menlo Park, CA

15 jobs (page 1)

Categories

All Categories

Engineering (7)

Senior GenAI Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal and ... as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculative decoding, sparsity,… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Lead AI Engineer ( GenAI Platform…

Capital One (San Jose, CA)

Lead AI Engineer ( GenAI Platform Services) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For ... deliver our industry leading capabilities with breakthrough product experiences and scalable , high-performance AI infrastructure. At Capital One, you will help bring… more

Capital One (12/19/25)
- Save Job - Related Jobs - Block Source
Senior AI Software Engineer , GenAI…

NVIDIA (Santa Clara, CA)

…now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core (https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/core) and NeMo Framework ... ) team. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working… more

NVIDIA (12/22/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer

DataRobot (San Francisco, CA)

…& Libraries, LLM Onboarding,Tools, Multi-Agent Evaluations, Multimodality, etc.) and GenAI systems (eg Inference optimization, Distributed Training, Finetuning, ... today and in the future. As a Principal Software Engineer for Generative AI at DataRobot, you will be...DataRobot, you will be the technical anchor for our GenAI Tooling and Systems teams, shaping the architecture, ensuring… more

DataRobot (01/08/26)
- Save Job - Related Jobs - Block Source
Distinguished AI Engineer (Agentic AI…

Capital One (San Jose, CA)

Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital ... our industry leading capabilities with breakthrough product experiences and scalable , high-performance AI infrastructure. At Capital One, you will...You will contribute to crafting an end to end GenAI SDK, CLI and starter kits that let AI… more

Capital One (12/18/25)
- Save Job - Related Jobs - Block Source
(USA) Principal, Software Engineer

Walmart (Sunnyvale, CA)

…and infrastructure. All of these products and services are supported by scalable and powerful infrastructure, ensuring a secure and seamless employee and customer ... agents for multi-step reasoning, knowledge grounding, and decision-making. + Architect scalable , distributed AI systems with a focus on performance, fault tolerance,… more

Walmart (12/24/25)
- Save Job - Related Jobs - Block Source
Artificial Intelligence & Machine Learning Systems…

Honeywell (San Jose, CA)

We're seeking a highly skilled Artificial Intelligence & Machine Learning Systems Engineer to architect, design, and develop advanced AI/ML systems that power our ... engineering teams, and collaborate with cross-functional teams to deliver intelligent, scalable , and production-ready AI and machine learning technologies. You will… more

Honeywell (01/07/26)
- Save Job - Related Jobs - Block Source
Software Engineer , SystemML - Scaling…

Meta (Menlo Park, CA)

… GenAI /LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - Scaling / Performance Responsibilities: 1. Enabling reliable ... products and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU… more

Meta (12/20/25)
- Save Job - Related Jobs - Block Source
Principal/Senior Principal Machine Learning…

Genentech (South San Francisco, CA)

…and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI/ML models and output, and building impactful ... the scientific needs. **The Opportunity:** As a machine learning engineer in AI Enablement, you will be working closely...everyone in between. You'll build, own, and constantly improve scalable AI/ML based systems that unlock the potential of… more

Genentech (12/06/25)
- Save Job - Related Jobs - Block Source
Sr. Staff Software Engineer - Agentic AI…

Zscaler (San Jose, CA)

…building frameworks for all products + Evaluate and integrate state-of-the-art GenAI advances (eg, LLMs/SLMs, retrieval, fine-tuning, inference optimization) to ... agility with a cloud-first strategy. We're looking for an experienced Sr. Staff Software Engineer to join our Digital Experience team. This role is hybrid and based… more

Zscaler (12/26/25)
- Save Job - Related Jobs - Block Source
Senior, Software Engineer - MLE

Walmart (Sunnyvale, CA)

**Position Summary ** We are seeking a highly motivated Machine Learning Engineer to join our Data Science team. In this role, you will not only design, develop, and ... at scale but also play a key role in shaping next-generation AI/ML and GenAI enabled products that will help our suppliers grow their business. You will collaborate… more

Walmart (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Algorithm Engineer…

NVIDIA (Santa Clara, CA)

…and optimize diverse real world workloads. NeMo Framework is an open-source, scalable and cloud-native framework built for researchers and developers working on ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning,… more

NVIDIA (01/15/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer , AI Data,…

Google (Sunnyvale, CA)

Staff Software Engineer , AI Data, Multimodal _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and decision making, solving ... be credible with customers and engineers. + Understanding of genAI model development workflows for post-training and product fine-tuning,...on and is growing every day. As a software engineer , you will work on a specific project critical… more

Google (01/09/26)
- Save Job - Related Jobs - Block Source
(USA) Staff, Software Engineer

Walmart (Sunnyvale, CA)

…data developers and machine learning developers whose strengths are: (1) building scalable data pipelines (2) using machine learning techniques and data science (3) ... through data analysis. **What You'll Do:** Design & train GenAI models targeting CTV use cases: on device LLM...limited memory devices. + Hands on experience with edge inference (TensorRT, CoreML, TVM, ONNX, TFLite, WebGPU, or similar).… more

Walmart (11/07/25)
- Save Job - Related Jobs - Block Source
Research Scientist, AI Networking (PhD)

Meta (Menlo Park, CA)

…products and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU ... to improve the full-stack distributed ML reliability and performance (eg Large-Scale GenAI /LLM training) from the trainer down to the inter-GPU and network… more

Meta (12/20/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search