Senior High Performance LLM Jobs in Pleasanton, CA

89 jobs (page 1)

Categories

All Categories

Engineering (28)

Software/IT (11)

Senior High - Performance…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior High - Performance LLM Training Engineer! NVIDIA is seeking experienced engineers specializing in performance analysis ... world's most advanced computing systems. This position focuses on optimizing NVIDIA's high - performance LLM software stack in frameworks like PyTorch and JAX… more

NVIDIA (01/07/26)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer,…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior Deep Learning Software Engineer, LLM Performance ! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... analyzing and improving the performance of LLM inference! NVIDIA is rapidly... modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU… more

NVIDIA (11/25/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer,…

NVIDIA (Santa Clara, CA)

…with performance modeling, profiling, debug, and code optimization of a DL/HPC/ high - performance application + Architectural knowledge of CPU and GPU + GPU ... We are now looking for a TensorRT- LLM Software Development Engineer! NVIDIA is hiring software...can be scaled to multiple platforms for functionality and performance + Perform benchmarking, profiling, and system-level programming for… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior AI Engineer (AI Foundations,…

Capital One (San Jose, CA)

Senior AI Engineer (AI Foundations, LLM Core and Agentic AI) **Overview:** At Capital One, we are creating responsible and reliable AI systems, changing banking ... deliver our industry leading capabilities with breakthrough product experiences and scalable, high - performance AI infrastructure. At Capital One, you will help… more

Capital One (11/06/25)
- Save Job - Related Jobs - Block Source
Principal Software Engineer - Large-Scale…

NVIDIA (Santa Clara, CA)

…equivalent experience + 15+ years of experience building large-scale distributed systems, high - performance storage, or ML systems infrastructure in C/C++ and ... NVIDIA Dynamo is a high -throughput, low-latency inference framework for serving generative AI...models across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU shards,… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior HPC and AI Networking…

NVIDIA (Santa Clara, CA)

…sounds like a fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research ... LLM training on NVIDIA supercomputers and distributed systems focusing on high - performance networking and Nvidia Collective Communications Library (NCCL). +… more

NVIDIA (12/03/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Performance…

NVIDIA (Santa Clara, CA)

We are now seeking a Senior Deep Learning Performance Architect! NVIDIA is looking for outstanding Performance Architects with a background in performance ... the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing:...+ Familiarity with advanced optimizations and SW/HW co-design in LLM training and inference + Exposure to using AI… more

NVIDIA (12/04/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Performance…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior Deep Learning Performance Architect! NVIDIA is seeking outstanding Performance Analysis Architects with a background in ... and develop the next generation of architectures that accelerate AI and high - performance computing applications. What you'll be doing: + Develop innovative… more

NVIDIA (10/16/25)
- Save Job - Related Jobs - Block Source
Senior DGX Cloud Performance…

NVIDIA (Santa Clara, CA)

…seeking highly skilled Parallel and Distributed Systems engineers to drive the performance analysis, optimization, and modeling to define the architecture and design ... have a deep understanding of the methodology to conduct end to end performance analysis of critical AI applications running on large scale parallel and distributed… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior AI Architect

NVIDIA (Santa Clara, CA)

… LLM infrastructure accordingly. + Optimize the infrastructure for performance , scalability, and reliability, ensuring secure and efficient management of data. ... to improve LLM infrastructure. + Lead with purpose and maintain high -quality engineering practices that inspire others to achieve excellence. What we need to… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior AI Engineering Manager, Enterprise…

LinkedIn (Mountain View, CA)

…the company. Our team works on a wide range of cutting-edge ML: LLM fine tuning, text generation, LLM -as-a-judge, prompt engineering, embedding-based retrieval, ... be based in Sunnyvale, CA. **Key Responsibilities** As a senior AI Manager, you will lead a team of...applied scientists and engineers to design and deliver scalable LLM and matching solutions that improve the relevance and… more

LinkedIn (12/17/25)
- Save Job - Related Jobs - Block Source
Senior , Software Engineer

Walmart (Sunnyvale, CA)

…with DevOps, cloud, and observability platforms. We are looking for a ** Senior Software Engineer** to design, build, and scale intelligent systems and automations ... techniques such as anomaly detection, classification, recommendation systems, and LLM -based reasoning to deliver actionable insights and automate decision-making.… more

Walmart (01/13/26)
- Save Job - Related Jobs - Block Source
Senior ML Engineer (Internet Security)

Palo Alto Networks (Santa Clara, CA)

…a security-sensitive environment. + Own the end-to-end lifecycle of ML and LLM components, from problem formulation and model development to production deployment, ... monitoring, and iterative improvement. + Integrate ML and LLM -based services with backend systems and data pipelines, ensuring...and build data analysis tools to continuously improve model performance as data and threats evolve. + Partner closely… more

Palo Alto Networks (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer,…

NVIDIA (Santa Clara, CA)

…sophisticated AI applications. Our team is responsible for developing and maintaining high - performance deep learning frameworks, including SGLang and vLLM, which ... as other DL frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA… more

NVIDIA (12/05/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…Familiarity with NVIDIA's deep learning SDKs (eg, TensorRT). + Experience developing high - performance GPU kernels for machine learning workloads using CUDA, ... focuses on optimizing generative AI models such as large language models ( LLM ) and diffusion models for maximal inference efficiency using techniques ranging from… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
AI Senior Staff Systems Engineer

Cadence Design Systems, Inc. (San Jose, CA)

…in techniques such as model quantization, distillation, and using high - performance serving frameworks (eg, vLLM, TGI, TensorRT- LLM ) to maximize inference ... Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in...lifecycle of our AI systems, from architecting and building high - performance GPU clusters to deploying and optimizing… more

Cadence Design Systems, Inc. (12/29/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer,…

NVIDIA (Santa Clara, CA)

…sophisticated AI applications. Our team is responsible for developing and maintaining high - performance open-source frameworks, which are at the forefront of ... NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference...frameworks. Your work will focus on identifying and driving performance improvements for state-of-the-art LLM and Generative… more

NVIDIA (12/07/25)
- Save Job - Related Jobs - Block Source
Senior Offensive Security Engineer - Web…

FocusKPI Inc. (Mountain View, CA)

…clients, a high -tech SaaS company. Team is looking for a Senior Offensive Security Engineer to proactively identify, exploit, and help eliminate security ... FocusKPI is seeking a Senior Offensive Security Engineer (Web & AI systems)...contract with potential to convert depending on the candidate's performance Pay Range: $85 - 100/hr **No C2C resumes… more

FocusKPI Inc. (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Inference Technical Product…

NVIDIA (Santa Clara, CA)

…solutions in some of the world's most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at ... We are looking for a Senior Technical Product Marketing Manager. This role will...the latest AI models and NVIDIA's platform to maximize performance and minimize TCO + Develop crisp clear positioning,… more

NVIDIA (12/25/25)
- Save Job - Related Jobs - Block Source
Senior AI Engineer, NeMo Retriever - Model…

NVIDIA (Santa Clara, CA)

…engines from NVIDIA and the community, including NVIDIA TensorRT and TensorRT- LLM , NIM microservices optimize response latency and throughput for each combination ... NIMs for building multimodal extraction, re-ranking, and embedding pipelines with high accuracy and maximum data privacy. It delivers quick, context-aware responses… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search