Sr ML Kernel Performance Jobs in California

29 jobs (page 1)

Categories

All Categories

Engineering (14)

Sr . ML Kernel…

Amazon (Cupertino, CA)

…Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more

Amazon (11/14/25)
- Save Job - Related Jobs - Block Source
Software Engineering Manager, ML…

Amazon (Cupertino, CA)

…Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more

Amazon (10/21/25)
- Save Job - Related Jobs - Block Source
Sr . Product Manager - Kernels, AI/…

Amazon (Cupertino, CA)

…stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more

Amazon (09/02/25)
- Save Job - Related Jobs - Block Source
Senior Linux Kernel Systems Software…

NVIDIA (Santa Clara, CA)

NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more

NVIDIA (10/01/25)
- Save Job - Related Jobs - Block Source
Software Dev Engineer II - Neuron Kernel…

Amazon (Cupertino, CA)

…are at the forefront of AWS innovation. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will ... deliver the best-in-class ML training performance with the most teraflops...AWS Neuron Software Development Kit (SDK), which includes an ML compiler, Neuron Kernel Interface (NKI) compiler,… more

Amazon (10/25/25)
- Save Job - Related Jobs - Block Source
Sr . Java Full Stack Developer

Deloitte (San Jose, CA)

…familiarity with Go or Rust a plus. + Strong understanding of AI/ ML frameworks (PyTorch, TensorFlow, ONNX) and performance /model optimization. + Familiarity ... Sr . Java Full Stack Developer - Project Delivery...do/Responsibilities Join our AI and Systems Co-Design team, pioneering high- performance software and hardware technologies for AI and next-generation… more

Deloitte (11/02/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer, AI/…

Amazon (Cupertino, CA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more

Amazon (11/05/25)
- Save Job - Related Jobs - Block Source
Director / Sr Program Manager, AI…

quadric.io, Inc (Burlingame, CA)

…and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems. Unlike other NPUs or neural ... Domain Knowledge: Demonstrated ability to drive complex technical projects in the AI/ ML and embedded processing domain. Highly Desired Skills and Experience (Pluses)… more

quadric.io, Inc (10/18/25)
- Save Job - Related Jobs - Block Source
Senior Computer Vision, VLM…

NVIDIA (Santa Clara, CA)

…non- ML computer vision + Strong fundamentals with system-level performance : multi-threaded, multi-process and distributed software development. + Grounding in ... pre- and post-processing. + Improve the efficiency of VLM models themselves: kernel optimization in CUDA + Upstream improvements to SDKs and libraries across… more

NVIDIA (09/03/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer,…

NVIDIA (Santa Clara, CA)

…learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, ... We are now looking for a Senior Deep Learning Software Engineer, FlashInfer. NVIDIA has...the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing… more

NVIDIA (11/01/25)
- Save Job - Related Jobs - Block Source
Remote Senior Performance Engineer

Insight Global (Palo Alto, CA)

Job Description Insight Global is looking to hire a Senior Performance Engineer for a client in the quantum computing space. This is a fully remote contract ... machine learning models on GPU clusters. - Fine-tune GPU kernels for performance optimization. - Collaborate closely with scientists to support computational needs.… more

Insight Global (11/10/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, Profiling…

NVIDIA (Santa Clara, CA)

…solutions. Your problem-solving prowess will be crucial for AON's success with ML workloads. + Architect and Build High- Performance Platforms: Transform user ... our Developer Tools Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing,...CUDA APIs, runtime, streams, kernels, and GPU architecture. + ML Ecosystem & Performance Analysis: Familiarity with… more

NVIDIA (11/07/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, Profiling…

NVIDIA (Santa Clara, CA)

…solutions. Your problem-solving prowess will be crucial for AON's success with ML workloads. + Architect and Build High- Performance Platforms : Transform ... our Developer Tools Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing,...CUDA APIs, runtime, streams, kernels, and GPU architecture. + ML Ecosystem & Performance Analysis: Familiarity with… more

NVIDIA (10/30/25)
- Save Job - Related Jobs - Block Source
(USA) Senior , Software Engineer (Machine…

Walmart (Sunnyvale, CA)

…management + Knowledge of low-latency serving architectures + Familiarity with ML -specific security requirements + Background in performance profiling and ... **Position Summary ** As a Senior Machine Learning Engineer, you are a technical...validation + Develop monitoring, logging, and alerting systems for ML services + Create infrastructure for A/B testing and… more

Walmart (09/23/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…for customer use cases, ensuring optimal end-to-end workflows and balanced accuracy- performance trade-offs. + Conduct deep GPU kernel -level profiling to ... and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI… more

NVIDIA (09/23/25)
- Save Job - Related Jobs - Block Source
Senior Solution Engineer, AI Factory Triage

NVIDIA (Santa Clara, CA)

…for an experienced engineer to triage customers' hardware platform issues and AI/ ML workloads in huge datacenters of rack-scale platforms, solve customer problems, ... solid programming skills, and experience with multi-GPU platforms. Expertise analyzing performance of distributed GPU-accelerated workloads is a plus. What you'll be… more

NVIDIA (11/01/25)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect, Spectrum-X Low…

NVIDIA (Santa Clara, CA)

NVIDIA networking designs and manufactures high- performance networking equipment that enable the most powerful super computers in the largest data centers in the ... or RoCE (RDMA over Converged Ethernet) we make powerful ML /AI platforms possible. We believe in our people and...in large clusters even more performant. As a networking Sr . Solutions Architect at NVIDIA you will have agency… more

NVIDIA (09/11/25)
- Save Job - Related Jobs - Block Source
Agentic AI, AI & Data Specialist Senior

Deloitte (Sacramento, CA)

…critical to businesses. Your contributions can help clients improve financial performance , accelerate new digital ventures, and fuel growth through innovation. AI ... and business applications - delivering production-grade reliability, scalability, and performance . + Engineer core solution components directly, including data… more

Deloitte (11/22/25)
- Save Job - Related Jobs - Block Source
Senior Manager, Machine Learning Engineer

Cisco (San Jose, CA)

…towards optimal results with regard to reusability, security, reliability, and performance . * Collaborate with various stakeholders to design complex systems ... comprised of multiple ML and non- ML services which meet the... services which meet the highest levels of security, performance , reliability, and scalability while satisfying requirements from within… more

Cisco (11/14/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, ... of use, compute and memory efficiency, and achieving the best accuracy- performance tradeoffs through software-hardware co-design. Your work will span multiple layers… more

NVIDIA (09/18/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search