Sr ML Kernel Performance Jobs in Santa Clara, CA

26 jobs (page 1)

Categories

All Categories

Engineering (12)

Software/IT (6)

Sr . ML Kernel…

Amazon (Cupertino, CA)

…Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more

Amazon (11/14/25)
- Save Job - Related Jobs - Block Source
Software Engineering Manager, ML…

Amazon (Cupertino, CA)

…Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more

Amazon (12/04/25)
- Save Job - Related Jobs - Block Source
Sr . Product Manager - Kernels, AI/…

Amazon (Cupertino, CA)

…stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more

Amazon (12/12/25)
- Save Job - Related Jobs - Block Source
Senior Linux Kernel Systems Software…

NVIDIA (Santa Clara, CA)

NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more

NVIDIA (10/01/25)
- Save Job - Related Jobs - Block Source
Sr . Java Full Stack Developer

Deloitte (San Jose, CA)

…familiarity with Go or Rust a plus. + Strong understanding of AI/ ML frameworks (PyTorch, TensorFlow, ONNX) and performance /model optimization. + Familiarity ... Sr . Java Full Stack Developer - Project Delivery...do/Responsibilities Join our AI and Systems Co-Design team, pioneering high- performance software and hardware technologies for AI and next-generation… more

Deloitte (11/02/25)
- Save Job - Related Jobs - Block Source
Sr Principal Machine Learning Engineer…

Palo Alto Networks (Santa Clara, CA)

…community. Beyond individual contribution, you will lead complex technical projects, mentor senior engineers, and set the standard for performance , scalability, ... contributions in these areas are a significant plus. + Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton… more

Palo Alto Networks (12/18/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, Core ML…

Google (Sunnyvale, CA)

Senior Software Engineer, Core ML Frameworks _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... internal and external users. + Build infrastructure and tooling for kernel development, including benchmarking suites, auto-tuning frameworks, performance … more

Google (12/04/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer - AI/…

Amazon (Cupertino, CA)

…with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more

Amazon (12/10/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer,…

NVIDIA (Santa Clara, CA)

…learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, ... We are now looking for a Senior Deep Learning Software Engineer, FlashInfer. NVIDIA has...the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing… more

NVIDIA (11/01/25)
- Save Job - Related Jobs - Block Source
Senior Embedded Software Engineer

Cisco (Milpitas, CA)

…to join our Diagnostic/BSP team, responsible for ensuring the reliability and performance of our world-class hardware. Our team develops software for Cisco's network ... responses to the diverse workload demands of AI and ML . This is a unique opportunity to grow your...grow your technical skill set and gain visibility and recognition across cross-functional teams within Cisco. We value motivated… more

Cisco (12/18/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, Profiling…

NVIDIA (Santa Clara, CA)

…solutions. Your problem-solving prowess will be crucial for AON's success with ML workloads. + Architect and Build High- Performance Platforms: Transform user ... our Developer Tools Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing,...CUDA APIs, runtime, streams, kernels, and GPU architecture. + ML Ecosystem & Performance Analysis: Familiarity with… more

NVIDIA (12/18/25)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect, Spectrum-X Low…

NVIDIA (Santa Clara, CA)

NVIDIA networking designs and manufactures high- performance networking equipment that enable the most powerful super computers in the largest data centers in the ... or RoCE (RDMA over Converged Ethernet) we make powerful ML /AI platforms possible. We believe in our people and...in large clusters even more performant. As a networking Sr . Solutions Architect at NVIDIA you will have agency… more

NVIDIA (12/11/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…for customer use cases, ensuring optimal end-to-end workflows and balanced accuracy- performance trade-offs. + Conduct deep GPU kernel -level profiling to ... and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI… more

NVIDIA (09/23/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer - Windows Silicon…

Microsoft Corporation (Mountain View, CA)

…teams to bring compelling new experiences to market. In addition to developing kernel and user-mode drivers, you will have the unique opportunity to work across ... teams to analyze and fix performance bottlenecks throughout the AI stack. Microsoft's mission is...on the architecture of Graphics and AI user-mode and kernel -mode drivers. + Leads by example within the team… more

Microsoft Corporation (12/06/25)
- Save Job - Related Jobs - Block Source
Senior Solution Engineer, System…

NVIDIA (Santa Clara, CA)

…for an experienced engineer to triage customers' hardware platform issues and AI/ ML workloads in huge datacenters of rack-scale platforms, solve customer problems, ... solid programming skills, and experience with multi-GPU platforms. Expertise analyzing performance of distributed GPU-accelerated workloads is a plus. What you'll be… more

NVIDIA (12/16/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, AI Inference…

NVIDIA (Santa Clara, CA)

…of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high- performance computing. + Strong programming ... distributed systems, deep learning theories. + Knowledgeable and passionate about performance engineering in ML frameworks (eg, PyTorch) and inference… more

NVIDIA (11/27/25)
- Save Job - Related Jobs - Block Source
Senior Research Engineer - Cuda AI Quality

NVIDIA (Santa Clara, CA)

NVIDIA's AI Developer Tools organization is seeking a Senior Research Engineer to join our Quality team, where we're building the definitive benchmarks and ... teams developing CUDA-focused AI tools to provide evaluation insights, identify performance gaps, and integrate novel capabilities (eg, RAG, profiling, web research)… more

NVIDIA (12/11/25)
- Save Job - Related Jobs - Block Source
Agentic AI, AI & Data Specialist Senior

Deloitte (San Jose, CA)

…critical to businesses. Your contributions can help clients improve financial performance , accelerate new digital ventures, and fuel growth through innovation. AI ... and business applications - delivering production-grade reliability, scalability, and performance . + Engineer core solution components directly, including data… more

Deloitte (11/22/25)
- Save Job - Related Jobs - Block Source
Agentic AI, AI & Data Senior Manager

Deloitte (San Jose, CA)

…critical to businesses. Your contributions can help clients improve financial performance , accelerate new digital ventures, and fuel growth through innovation. AI ... for this role ends 12/20/2025 Work you'll do A Senior Manager contributes to the firm's growth and development...for which you coach as a part of the performance management process. The team Our AI & Data… more

Deloitte (12/16/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, ... of use, compute and memory efficiency, and achieving the best accuracy- performance tradeoffs through software-hardware co-design. Your work will span multiple layers… more

NVIDIA (12/18/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search