Staff ML Engineer Inference Jobs in Fremont, CA

81 jobs (page 1)

Categories

All Categories

Engineering (15)

Staff ML Engineer…

General Motors (Sunnyvale, CA)

…more) while maintaining reliability and cost efficiency. About the Role We are seeking a Staff ML Infrastructure engineer to help build and scale robust ... This job is eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff AI/ ML Engineer…

General Motors (Mountain View, CA)

…California, United States of America time type Full time posted on Posted 30+ Days Ago Staff ML Engineer - Offboard Embodied AI remote type Hybrid locations ... Staff AI/ ML Engineer -... Staff AI/ ML Engineer - Onboard Embodied AI...ML models, delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. Lead and architect… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Principal AI/ ML Engineer - Onboard…

General Motors (Mountain View, CA)

…California, United States of America time type Full time posted on Posted 30+ Days Ago Staff ML Engineer - Offboard Embodied AI remote type Hybrid locations ... ML models, delivering end-to-end solutions capable of real-time inference and robust autonomous driving performance. Lead and architect...position for which you are applying. Similar Jobs (5) Staff AI/ ML Engineer - Onboard… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Backend Engineer : Real-Time…

Sanas (Palo Alto, CA)

A tech innovator in human communication is seeking a Staff Software Engineer to design and build real-time translation infrastructure. This role will focus on ... developing critical microservices that enable low-latency processing and align technical strategy across teams. Candidates should have 7+ years of experience in software engineering, particularly in distributed architectures, and proficiency in Python or Go.… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Pod Networking Software Engineer

Etched.ai, Inc. (San Jose, CA)

…role in developing, qualifying, and optimizing high-performance networking solutions for large-scale inference workloads. As a Pod Software Engineer , you will ... focus on developing and qualifying software that drives communication amongst Sohu inference nodes in multi-rack inference clusters. You will collaborate closely… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Product Engineer

Etched.ai, Inc. (San Jose, CA)

…5+ years of experience in software engineering, with a strong emphasis on ML inference infrastructure and systems Proficiency with modern web frameworks (eg, ... similar) and backend infrastructure (eg, Python, Node.js) Familiarity with inference serving stacks (vLLM, SGLang), ML frameworks...between engineering and research, and we expect all of our technical staff to contribute to both as needed.… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr./ Staff Behavior Prediction…

Booster (Mountain View, CA)

…Arkansas, and Ontario, Canada. About the role: We are looking for talented ML engineers with expertise in classical and modern machine learning techniques for ... and delivery of a multi-modal prediction system. The ideal candidate will be a ML expert who has overseen a process from the R&D phase through product shipment… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Backend Engineer , Speech AI…

Inworld AI (Mountain View, CA)

…(STT), must be exceptionally fast, reliable, and cost‑effective. We are seeking a Staff Backend Engineer to build this critical infrastructure. You will be ... that deliver seamless voice experiences. Partner closely with Product Managers and ML engineers to define scope, identify technical trade‑offs, and drive the product… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer…

Sanas (Palo Alto, CA)

Staff Software Engineer : Microservice Infrastructure & Real-Time ML Inference Sanas.ai is pioneering the future of human communication. Founded by a team ... Terraform, Kubernetes, IaaC patterns and node pools (CPU/GPU). Experience in ML Inference : Triton/vLLM/TorchServe; GPU scheduling/packing, batching, A/B and… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff II Software Engineer AI/…

BlackLine (Pleasanton, CA)

…Work, Play and Grow at BlackLine! Make Your Mark: As a Machine Learning Operations Engineer , you will play a pivotal role in bridging the gap between data science ... Drift, Latency SLAs). Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments. Lead incident response… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer - AI/…

GEICO (Palo Alto, CA)

…years of platform engineering or infrastructure experience* Experience with Staff Engineer or Tech Lead roles in ML /AI organizations* Background in ... GEICO . For more information, please . Staff Software Engineer - AI/ ML...resource optimization* Design, implement, and maintain feature stores for ML model training and inference pipelines* Build… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer…

Genesis AI (San Carlos, CA)

What You'll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and ... optimize distributed inference systems on GPU clusters, pushing throughput with large-batch...stacks What You'll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer AI/ ML…

Amazon (Cupertino, CA)

Software Development Engineer AI/ ML , Inference Serving, AWS Neuron AWS Neuron is the software stack powering AWS Inferentia and Trainium machine learning ... the boundaries of what's possible in large-scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm/releases/tag/2.25.0 https://awsdocs-neuron.readthedocs-hosted.com/en/latest/libraries/nxd- inference… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff ML Engineer…

Woven (Palo Alto, CA)

…computer vision. WHO ARE WE LOOKING FOR? The AD/ADAS team is seeking a seasoned ML engineer to support the development of foundation models for our autonomy ... problems in 3D geometric computer vision to designing and deploying novel ML architectures for perception, prediction, and motion planning for Toyota customers. We… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer , AI/…

Amazon (San Francisco, CA)

Senior Software Development Engineer , AI/ ML , AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer , AI/…

Amazon (San Francisco, CA)

Software Development Engineer , AI/ ML , AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer

XPENG (Santa Clara, CA)

…learning, and smart connectivity. Job Responsibilities: Work on systems such as inference platform, ETL pipeline, data mining, and image search. Collaborate with ... ML algorithm engineers and data analysts to improve perception...data engineers and infra engineers to build a state-of-the-art ML engineering platform. Analyze requirements, system design, implementation, testing,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer - GenAI…

Databricks Inc. (San Francisco, CA)

Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... system components and driving architectural decisions end‑to‑end. Deep understanding of ML inference internals: attention, MLPs, recurrent modules, quantization,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. ML Kernel Performance Engineer…

Amazon (Cupertino, CA)

…seamlessly integrates with popular ML frameworks like PyTorch, enabling unparalleled ML inference and training performance. As part of the broader Neuron ... Sr. ML Kernel Performance Engineer , AWS Neuron,...work safely and cooperatively with other employees, supervisors, and staff ; adhere to standards of excellence despite stressful conditions;… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Tech Lead | Senior Staff Software…

Machinify Inc. (Palo Alto, CA)

…financial outcomes and drive down healthcare costs. We are looking for an experienced Staff or Principal level Software Engineer to join our growing engineering ... real-time data as well as batch mode ingestion, modeling, and inference Shape long-term architectural direction, technology stack, and engineering best practices… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search