Senior Engineer LLM Serving Jobs in California

14 jobs (page 1)

Categories

All Categories

Engineering (5)

Software/IT (5)

Senior Engineer - LLM…

Qualcomm (San Diego, CA)

…**Ideal candidates for this position will demonstrate the following:** + Experience in serving frameworks, like vLLM + Strong development skills in PyTorch + Strong ... understanding of LLMs, Multi-modal and reasoning models + Experience in executing, analyzing, and optimizing neural networks + Experience in writing high performance software for multicore systems + Experience with Python + Understanding of multi-core… more

Qualcomm (04/26/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate ... of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM ,… more

NVIDIA (05/14/25)
- Save Job - Related Jobs - Block Source
Senior Applied AI Software Engineer…

NVIDIA (Santa Clara, CA)

…engineers enthusiastic about building the next generation of scalable AI systems. As a Senior Applied AI Software Engineer on the Dynamo project, you will ... supporting a variety of LLM frameworks (eg, TensorRT- LLM , vLLM, SGLang). + Disaggregated Serving : Architect...are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want… more

NVIDIA (06/11/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer

The Walt Disney Company (Glendale, CA)

…and entertainment content, across all media platforms. **Job Summary:** We're looking for a Senior Software Engineer to help shape the future of Ad Technology's ... media portfolio to advance the technological foundation and consumer media touch points serving millions of people around the world. Here are a few reasons why… more

The Walt Disney Company (06/18/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software…

NVIDIA (Santa Clara, CA)

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... SGLang and vLLM, which are at the forefront of efficient large-scale model serving and inference. You will play a central role in improving these platforms,… more

NVIDIA (05/08/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Algorithm…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing ... scale in real-world applications. + Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT- LLM , vLLM, SGLang. As NVIDIA… more

NVIDIA (05/02/25)
- Save Job - Related Jobs - Block Source
Senior DL Algorithms Engineer…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior DL Algorithms Engineer ! NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help ... new features, fix bugs and deliver production code to TRT- LLM , NVIDIA's open-source inference serving library. + Profile and analyze bottlenecks across the full… more

NVIDIA (04/23/25)
- Save Job - Related Jobs - Block Source
Senior AI Engineer - AI Incubation

Charles Schwab (San Francisco, CA)

…The AI Incubation and Enablement team is looking for a talented, technical, hands-on Senior Engineer to drive the development of innovative AI solutions. This ... rapid, iterative software development using Large Language Models. The Senior Engineer on the AI Incubation and...your team. + Experience working with LLMs and shipping LLM -powered applications to production is a big plus. +… more

Charles Schwab (06/20/25)
- Save Job - Related Jobs - Block Source
Senior Staff Engineer , Machine…

LinkedIn (Sunnyvale, CA)

…company. Our team works on a wide range of cutting-edge ML, like LLM fine tuning, text generation, LLM -as-a-judge, prompt engineering, embedding-based retrieval, ... or Ph.D. in Computer Science or related technical discipline * Experience with LLM , ranking, recommender systems * Full stack experience with AI systems, from… more

LinkedIn (06/04/25)
- Save Job - Related Jobs - Block Source
Senior Machine Learning Engineer

Warner Bros. Discovery (San Francisco, CA)

…you are supported, here you are celebrated, here you can thrive. **Machine Learning Engineer - Services (Video AI Platform)** **Who We Are ** At Warner Bros. ... on supporting applications of AI to video , the **M achine Learning Engineer - Services** group powers infrastructure and backend service s behind production… more

Warner Bros. Discovery (06/12/25)
- Save Job - Related Jobs - Block Source
Senior Technical Marketing Engineer…

NVIDIA (Santa Clara, CA)

…full-stack software ecosystem to power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated computing product ... JAX), and inference-specific frameworks & optimizations (Triton Inference Server, TensorRT- LLM , vLLM, SGLang). + Market Awareness - Experience conducting technical… more

NVIDIA (05/07/25)
- Save Job - Related Jobs - Block Source
Senior Artificial Intelligence (AI) Data…

Ankura (CA)

…and test cutting edge solutions in each of our respective areas. Role Overview: As a Senior AI Engineer and Data Scientist (Director) in the Data Analytics & AI ... advances to the quality and breadth of ML and LLM solutions. This role offers the chance to work...The Ankura team consists of more than 2000 professionals serving 3,000+ clients across 55 countries who are leaders… more

Ankura (05/29/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Communication…

NVIDIA (Santa Clara, CA)

…and inference workloads. + Experience in evaluating, analyzing, and optimizing LLM training and inference performance of state-of-the-art models on cutting-edge ... Tensor Parallelism, Expert Parallelism, and FSDP. + Understanding of the emerging serving architectures like Disaggregated Serving and inference servers like… more

NVIDIA (05/30/25)
- Save Job - Related Jobs - Block Source
Tax Senior Manager - Global Trade Advisory

Deloitte (Los Angeles, CA)

…trade management operations. + Supervising assignments by the Global Trade professionals serving as Consultants, Senior Consultants, and Managers. + Developing ... world in our world! What you'll do As a Senior Manager on our Global Trade Advisory team, you...+ Certified SAFe Architect + Certified SAFe Agile Software Engineer + Certified SAFe Product Owner / Product Manager… more

Deloitte (05/24/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search