AI ML Model Runtime Jobs in Pleasanton, CA

16 jobs (page 1)

Categories

All Categories

Engineering (5)

AI / ML Model Runtime…

Broadcom (Palo Alto, CA)

…team of talented and enthusiastic engineers. This role will be a member of the Private AI Services' Model Runtime team, which is a Kubernetes based control ... a Systems Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...plane that operates ML inferencing services. The successful candidate must have experience… more

Broadcom (07/25/25)
- Save Job - Related Jobs - Block Source
Technical Lead, ML Frameworks,…

Google (Mountain View, CA)

Technical Lead, ML Frameworks, Runtime , Devices, Numerical Acceleration _corporate_fare_ Google _place_ Mountain View, CA, USA **Advanced** Experience owning ... project strategy, ML design, and optimizing industry-scale ML infrastructure (eg, model deployment, model...into priorities and projects for the broader group. The ML , Systems, & Cloud AI (MSCA) organization… more

Google (10/04/25)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer, ML…

pony.ai (Fremont, CA)

…evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to ... went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony. ai provides a...compute platform architecture design and software infrastructure. + Apply model optimization and efficient deep learning techniques to models… more

pony.ai (08/02/25)
- Save Job - Related Jobs - Block Source
Software Engineer, Systems ML - Frameworks…

Meta (Menlo Park, CA)

…strategy that delivers a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for production ... be working on one of the core areas such as PyTorch framework components, AI compiler and runtime , high-performance kernels and tooling to accelerate machine… more

Meta (09/06/25)
- Save Job - Related Jobs - Block Source
Research Scientist, AI & Systems Co-design…

Meta (Menlo Park, CA)

…via concurrent design and optimization of many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network ... sustained scaling and hardware efficiency during training and inference. 3. Benchmark, analyze, model and project the performance of AI workloads against a wide… more

Meta (08/08/25)
- Save Job - Related Jobs - Block Source
Principal AI Infrastructure Abstraction…

Cisco (San Jose, CA)

…You will bridge the gap between raw compute resources and AI / ML frameworks, allowing infrastructure teams and model developers to consume shared ... with a focus in **multi-tenant environments** . + Experience integrating with ** AI / ML platforms or pipelines** (eg, PyTorch, TensorFlow, Triton Inference Server,… more

Cisco (10/18/25)
- Save Job - Related Jobs - Block Source
Senior Developer Relations Manager, AI…

NVIDIA (Santa Clara, CA)

…several software ecosystem projects with external partners, technical background in AI / ML systems, fundamentals of computer systems architectures (ISAs) ... researchers and developers. Focus will be on accelerating GenAI model training and inference, which is making a major...and the peer NVIDIA team(s) + Proven understanding of AI / ML software ecosystem and GPU acceleration libraries… more

NVIDIA (10/06/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineering Manager, Cloud…

Google (Sunnyvale, CA)

…experience leading technical project strategy, ML design, and optimizing industry-scale ML infrastructure (eg, model deployment, model evaluation, data ... Senior Software Engineering Manager, Cloud AI , Agents _corporate_fare_ Google _place_ Sunnyvale, CA, USA...for the team designing and building our core agentic runtime . This includes solving first-principle challenges in Large Language… more

Google (10/01/25)
- Save Job - Related Jobs - Block Source
Kubernetes Platform Engineer - Private AI

Broadcom (Palo Alto, CA)

…control plane that automates the lifecycle of AI Services: Model Gallery, Model Runtime and ML API Gateway, Data Indexing and Retrieval, and Agent ... Kubernetes Platform Engineer to join VMware Cloud Foundation's (VCF) AI and Advanced Services team. This position is key...key to building a best in class private cloud AI platform. You will have a high impact by… more

Broadcom (10/08/25)
- Save Job - Related Jobs - Block Source
Staff Software Engineer, Quantization…

Google (Sunnyvale, CA)

…leading technical project strategy, ML design, and working with industry-scale ML infrastructure (eg, model deployment, model evaluation, data ... intersection of technical analysis, cross-organizational strategy, and execution. The ML , Systems, and Cloud AI (MSCA) organization...of solutions in specialized ML areas, optimize ML infrastructure, and guide the development of model… more

Google (10/09/25)
- Save Job - Related Jobs - Block Source
Senior Staff Software Engineer, On-Device Machine…

Google (Sunnyvale, CA)

…and hardware-accelerated ML inference techniques. + Understanding of Generative AI model architectures and their optimization for on-device execution. + ... strategy, ML design, and working with industry-scale ML infrastructure (eg, model deployment, model...on-device model inference via optimizations in the model structure, on-device runtime and kernel implementation.… more

Google (10/01/25)
- Save Job - Related Jobs - Block Source
Lead Engineer, Inference Platform

MongoDB (Palo Alto, CA)

…routing, and model health monitoring + Collaborate with peers across ML , infra, and product teams to define architectural patterns and operational practices that ... model serving architecture using tools like vLLM, ONNX Runtime , and container orchestration in Kubernetes + Provide technical...world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge… more

MongoDB (09/27/25)
- Save Job - Related Jobs - Block Source
Senior Machine Learning Engineer - Experience…

SAP (Palo Alto, CA)

… runtime . With Postgres, Cassandra, HANA Cloud vector engines, and SAP's Generative AI Hub for model governance, we provide an enterprise-grade substrate for ... AI services. You'll drive both design and implementation of ML pipelines, orchestrators, and agentic workflows-turning foundational model research into… more

SAP (09/04/25)
- Save Job - Related Jobs - Block Source
Software Engineer III, Infrastructure, Vertex…

Google (Sunnyvale, CA)

…compute technologies, storage or hardware architecture. + 1 year of experience with ML infrastructure (eg, model deployment, model evaluation, optimization, ... + Experience developing large-scale applications on Cloud. + Experience on AI Agents, embedding models, in-context learning, evaluation and Open Source technologies… more

Google (09/30/25)
- Save Job - Related Jobs - Block Source
Software Engineer III, Infrastructure, Vertex…

Google (Sunnyvale, CA)

…compute technologies, storage or hardware architecture. + 1 year of experience with ML infrastructure (eg, model deployment, model evaluation, optimization, ... an academic or industry setting. + Experience with Generative AI Agent. + Experience developing large-scale applications on Cloud...a high-code agent development SDK/Kit (aka ADK), a managed runtime (eg, Agent Engine) with a suite of managed… more

Google (10/17/25)
- Save Job - Related Jobs - Block Source
Senior Architecture Energy Modeling Engineer

NVIDIA (Santa Clara, CA)

…models. + Develop and own methodologies and workflows to train models using ML and/or statistical techniques. + Improve the accuracy of trained models by using ... different model representations, objective functions, and learning algorithms. + Develop...preferably in Python, C++. + Background in machine learning, AI , and/or statistical modeling. + Background in computer architecture… more

NVIDIA (09/05/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search