Research Engineer Cuda Kernel Jobs | Juju

Research Engineer (Machine Learning)

Aldea Inc (San Francisco, CA)

…contextual, and intelligent human-machine interface. The Role We are hiring a Research Engineer (Machine Learning) to build the infrastructure that powers ... Aldea's multi-modal AI research . You will design, optimize, and scale the training...training or inference systems. Preferred Qualifications Experience with custom kernel development ( CUDA , Triton) or GPU optimization.… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Performance Engineer , GPU

Anthropic (San Francisco, CA)

…define the path forward Strong candidates may also have experience with GPU Kernel Development: CUDA , Triton, CUTLASS, Flash Attention, tensor core optimization ... innovations in GPU performance and systems engineering. As a GPU Performance Engineer , you'll architect and implement the foundational systems that power Claude and… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Model Inference

Apple Inc. (San Francisco, CA)

Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best map in the ... powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver...measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer

Red Hat, Inc. (Boston, MA)

…Must have two (2) years of experience with: Python and Modern C++; CUDA , Triton, or CUTLASS kernel optimization; Deep learning frameworks, including PyTorch; ... Machine Learning Engineer page is loaded## Machine Learning Engineerremote type:...on NVIDIA GPUs using tools such as Nsight, tune CUDA , Triton, or CUTLASS kernels for deep neural networks.*… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Training Performance Engineer

OpenAI (San Francisco, CA)

…the core distributed machine-learning training runtime that powers everything from early research experiments to frontier-scale model runs. With a dual mandate to ... iterate quickly and run reliably at any scale, partnering closely with model-stack, research , and platform teams. Success for us is measured by raising both training… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer

Relace (San Francisco, CA)

…environments. Requirements Strong background in systems-level ML engineering. Experience with CUDA , GPU kernel optimization, and performance tuning. Fluency in ... you. The Role We're looking for a Machine Learning Engineer who loves getting close to the metal. This...smart systems design. The ideal candidate is excited by CUDA kernels, memory layouts, GPU scheduling, and squeezing performance… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Staff Machine Learning Engineer

kadence (San Francisco, CA)

…Nice to Have Deep understanding of training architectures (PyTorch/JAX internals, CUDA kernel optimization, TPU environments). Experience building or managing ... datasets. About the Role We're looking for a Machine Learning Engineer with hands‑on experience in model development (training, fine‑tuning, feature engineering)… more

job goal (01/14/26)
- Save Job - Related Jobs - Block Source
Senior Research Engineer…

NVIDIA (Santa Clara, CA)

NVIDIA's AI Developer Tools organization is seeking a Senior Research Engineer to join our Quality team, where we're building the definitive benchmarks and ... important parallel computing platform. Our growing team operates at the intersection of CUDA domain expertise and cutting-edge AI research . While evaluation is… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Sr. ML Kernel Performance Engineer…

Amazon (Cupertino, CA)

…or HPC such as GPUs, CPUs, FPGAs, or custom architectures - Experience with GPU kernel optimization and GPGPU computing such as CUDA , NKI, Triton, OpenCL, SYCL, ... on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing performance for… more

Amazon (11/14/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…Face, vLLM, SGLang). You may also dive deeper into GPU-level optimization, including custom kernel development with CUDA and Triton. This role offers a unique ... with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Software Engineer , Systems ML - Frameworks…

Meta (Menlo Park, CA)

…hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment ... software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels Responsibilities: 1.… more

Meta (12/20/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is...and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels. Work and collaborate with a diverse set… more

NVIDIA (11/25/25)
- Save Job - Related Jobs - Block Source
Staff Optical Image Processing Engineer

Lockheed Martin (Manassas, VA)

…feature extraction, registration\) or comparable signal processing techniques *Proven CUDA development experience \( kernel coding, memory management, performance ... program in Manassas, Virginia, is seeking a skilled Optical Image Processing Engineer to support the development and integration of advanced imaging capabilities for… more

Lockheed Martin (11/07/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software Engineer…

NVIDIA (Santa Clara, CA)

We're now looking for a Senior Deep Learning Software Engineer for our cuDNN team! Do you love writing fast code and crafting software systems to solve complex ... across the codebase, including API design, software architecture, testing, and GPU kernel development. + Mentoring junior engineers on the team. What we need… more

NVIDIA (11/02/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , AI Inference…

NVIDIA (Santa Clara, CA)

…build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. + Define and ... inference deployments on GPU clusters across clouds. + Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Developer Technology Engineer

NVIDIA (Santa Clara, CA)

We're currently seeking a Senior Developer Technology Engineer ! NVIDIA's Developer Technology Engineering team is a global network of world-class experts ... will be doing: + In this role, you will research and develop techniques to accelerate top CSP workloads...algorithms. + A background that includes parallel programming, ideally CUDA C/C++. + Hands on experience doing low-level performance… more

NVIDIA (12/10/25)
- Save Job - Related Jobs - Block Source
Principal Developer Technology Engineer

NVIDIA (Santa Clara, CA)

We're currently seeking a Principal Developer Technology Engineer ! Are you interested in developing techniques to accelerate large application workloads on advanced ... will be doing: + In this role, you will research and develop techniques to accelerate top CSP workloads...algorithms. + A background that includes parallel programming, ideally CUDA C/C++. + Hands on experience doing low-level performance… more

NVIDIA (11/11/25)
- Save Job - Related Jobs - Block Source
Sr Principal Machine Learning Engineer…

Palo Alto Networks (Santa Clara, CA)

…are a significant plus. + Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton Language, is a plus. + ... posture from development through runtime. As a Senior Principal Machine Learning Engineer , you will drive research on cutting-edge areas, including AI-Native… more

Palo Alto Networks (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Algorithm Engineer

NVIDIA (Santa Clara, CA)

…We collaborate with a broad cross section of teams at NVIDIA ranging from DL research teams to CUDA Kernel and DL Framework development teams, to ... We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning...inference across diverse GPU platforms. You will collaborate with research scientists, software engineers, and hardware specialists to bring… more

NVIDIA (11/06/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Deep Learning…

NVIDIA (Santa Clara, CA)

…MLIR) without forgoing performance + Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to ... deep learning models and optimizations such as graph fusions, kernel implementation, KV Caching etc. + Domain experience in...operators + Experience with NVIDIA software libraries such as CUDA and TensorRT + In depth experience with the… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search