GenAI Inference Engineer Scalable Jobs in Fremont, CA

30 jobs (page 1)

Categories

All Categories

Engineering (7)

GenAI Inference Engineer…

Databricks Inc. (San Francisco, CA)

A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and ... optimize the inference engine powering the Foundation Model API. You will collaborate closely with researchers and engage in performance-critical system challenges,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Technical Lead

Spectro Cloud (San Jose, CA)

…role in shaping the future of our cutting‑edge Palette platform. As a software engineer within our organization, you will be at the forefront of building an ... will stay ahead of emerging AI trends - small models, efficient inference (vLLM/TensorRT), multimodal systems, on‑device LLMs - and recommend tools, frameworks, or… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Software Engineer , GenAI…

NVIDIA Corporation (Santa Clara, CA)

…Frameworks ( and ) team. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on Large ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, alignment,… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Machine Learning Ops Engineer…

Rivian (Palo Alto, CA)

… Engineer , you will be instrumental in building and maintaining a scalable training and inference platform using both Databricks and open‑source technologies. ... Scalable ML Infrastructure: Design and implement a scalable training and inference platform using Databricks and open‑source technologies to support ML/AI… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
ML Ops Engineer - GenAI Platform…

Rivian (Palo Alto, CA)

…in California seeks an ML Ops Engineer to build and maintain a scalable training and inference platform. The role involves managing ML/AI models, utilizing ... cloud technologies, and collaborating with cross-functional teams. Candidates should have 5+ years in ML Ops, experience with distributed training frameworks, proficiency in programming languages like Python, and a solid background in cloud technologies. This… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer - GenAI…

Databricks Inc. (San Francisco, CA)

Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... low latency, and robust scaling. Your work will encompass the full GenAI inference stack: kernels, runtimes, orchestration, memory, and integration with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer - GenAI…

Menlo Ventures (San Francisco, CA)

About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... our large language model (LLM) serving systems are fast, scalable , and efficient. Your work will touch the full..., and efficient. Your work will touch the full GenAI inference stack - from kernels and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior / Staff GenAI Engineer

Apple Inc. (Cupertino, CA)

…with data, ultimately helping teams derive insights that drive product success. As a Staff GenAI Engineer on the Apple Data Platform group's GenAI Platform ... Platform. Description Join Apple's Data Platform as a Staff GenAI Engineer , where you'll be at the... optimization enabling teams across Apple to rapidly create scalable , context-aware GenAI solutions. You will collaborate… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Algorithm Engineer

NVIDIA Corporation (Santa Clara, CA)

…diverse real world workloads. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Cloud Platform Engineer - Agent Cloud

Rubrik, Inc. (Palo Alto, CA)

…Innovate at the intersection of AI, security, and distributed systems: Design scalable mechanisms for agent discovery and classification . Contribute to the design ... infrastructure . Experience You'll Need: You are passionate about building secure, scalable AI infrastructure that enables enterprises to safely harness the power of… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Altimate Al | Founding Generative AI…

EarlyStage Partners (Sunnyvale, CA)

…What are we looking for? We're in search of a Senior Generative AI Engineer who brings deep expertise in building and deploying large language models and AI ... ML/AI experience with at least 1 year of hands-on experience in GenAI /LLM projects Track record of successfully deploying ML/AI systems in production environments… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. ML Performance Engineer , AWS Neuron,…

Amazon (Cupertino, CA)

…Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training performances. They are enabled through a state-of-the-art… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Principal/Senior Principal Machine Learning…

Genentech (San Francisco, CA)

…and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI/ML models and output, and building impactful ... the scientific needs. The Opportunity: As a machine learning engineer in AI Enablement, you will be working closely...everyone in between. You'll build, own, and constantly improve scalable AI/ML based systems that unlock the potential of… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff / Principal Engineer - Core…

Socotra, Inc. (San Francisco, CA)

Build the Future of Scalable AI at TrueFoundry At TrueFoundry , we're redefining how ML teams train, deploy, and scale their models. Our LLMOps and MLOps platform ... on Kubernetes-with the same muscle as Big Tech. We're looking for an Engineer who is passionate about scaling deep learning workloads, optimizing multi-GPU training,… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Backend Platform

harvey.ai (San Francisco, CA)

… GenAI ‑native applications - such as supporting high‑throughput model inference , managing streaming and long‑running API interactions, and designing abstractions ... today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive backend platform that… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
System Development Engineer , AGI…

Amazon (San Francisco, CA)

…multi‑lingual large language models (LLM). AGI's mission is to leverage our hyper‑ scalable , general‑purpose large model training and inference systems to build ... cluster and node management to ensure smooth operation of GenAI infrastructure. Continuously improve and automate cluster/capacity/maintenance upgrades. Troubleshoot… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal and ... as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculative decoding, sparsity,… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior AI Software Engineer , GenAI…

NVIDIA (Santa Clara, CA)

…now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core (https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/core) and NeMo Framework ... ) team. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working… more

NVIDIA (12/22/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer

DataRobot (San Francisco, CA)

…& Libraries, LLM Onboarding,Tools, Multi-Agent Evaluations, Multimodality, etc.) and GenAI systems (eg Inference optimization, Distributed Training, Finetuning, ... today and in the future. As a Principal Software Engineer for Generative AI at DataRobot, you will be...DataRobot, you will be the technical anchor for our GenAI Tooling and Systems teams, shaping the architecture, ensuring… more

DataRobot (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Backend and AI…

Google (Mountain View, CA)

Senior Software Engineer , Backend and AI Systems, Flow _corporate_fare_ Google _place_ Mountain View, CA, USA; New York, NY, USA **Mid** Experience driving progress, ... Kotlin, or Go). + 3 years of experience testing, maintaining, or launching scalable software products, 1 year of experience with backend software design and… more

Google (01/07/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search