- Aldea Inc (San Francisco, CA)
- …contextual, and intelligent human-machine interface. The Role We are hiring a Research Engineer (Machine Learning) to build the infrastructure that powers ... Aldea's multi-modal AI research . You will design, optimize, and scale the training...training or inference systems. Preferred Qualifications Experience with custom kernel development ( CUDA , Triton) or GPU optimization.… more
- Anthropic (San Francisco, CA)
- …define the path forward Strong candidates may also have experience with GPU Kernel Development: CUDA , Triton, CUTLASS, Flash Attention, tensor core optimization ... innovations in GPU performance and systems engineering. As a GPU Performance Engineer , you'll architect and implement the foundational systems that power Claude and… more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best map in the ... powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver...measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead… more
- Red Hat, Inc. (Boston, MA)
- …Must have two (2) years of experience with: Python and Modern C++; CUDA , Triton, or CUTLASS kernel optimization; Deep learning frameworks, including PyTorch; ... Machine Learning Engineer page is loaded## Machine Learning Engineerremote type:...on NVIDIA GPUs using tools such as Nsight, tune CUDA , Triton, or CUTLASS kernels for deep neural networks.*… more
- OpenAI (San Francisco, CA)
- …the core distributed machine-learning training runtime that powers everything from early research experiments to frontier-scale model runs. With a dual mandate to ... iterate quickly and run reliably at any scale, partnering closely with model-stack, research , and platform teams. Success for us is measured by raising both training… more
- Relace (San Francisco, CA)
- …environments. Requirements Strong background in systems-level ML engineering. Experience with CUDA , GPU kernel optimization, and performance tuning. Fluency in ... you. The Role We're looking for a Machine Learning Engineer who loves getting close to the metal. This...smart systems design. The ideal candidate is excited by CUDA kernels, memory layouts, GPU scheduling, and squeezing performance… more
- kadence (San Francisco, CA)
- …Nice to Have Deep understanding of training architectures (PyTorch/JAX internals, CUDA kernel optimization, TPU environments). Experience building or managing ... datasets. About the Role We're looking for a Machine Learning Engineer with hands‑on experience in model development (training, fine‑tuning, feature engineering)… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's AI Developer Tools organization is seeking a Senior Research Engineer to join our Quality team, where we're building the definitive benchmarks and ... important parallel computing platform. Our growing team operates at the intersection of CUDA domain expertise and cutting-edge AI research . While evaluation is… more
- Amazon (Cupertino, CA)
- …or HPC such as GPUs, CPUs, FPGAs, or custom architectures - Experience with GPU kernel optimization and GPGPU computing such as CUDA , NKI, Triton, OpenCL, SYCL, ... on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing performance for… more
- NVIDIA (Santa Clara, CA)
- …Face, vLLM, SGLang). You may also dive deeper into GPU-level optimization, including custom kernel development with CUDA and Triton. This role offers a unique ... with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal… more
- Meta (Menlo Park, CA)
- …hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment ... software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels Responsibilities: 1.… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is...and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels. Work and collaborate with a diverse set… more
- Lockheed Martin (Manassas, VA)
- …feature extraction, registration\) or comparable signal processing techniques *Proven CUDA development experience \( kernel coding, memory management, performance ... program in Manassas, Virginia, is seeking a skilled Optical Image Processing Engineer to support the development and integration of advanced imaging capabilities for… more
- NVIDIA (Santa Clara, CA)
- We're now looking for a Senior Deep Learning Software Engineer for our cuDNN team! Do you love writing fast code and crafting software systems to solve complex ... across the codebase, including API design, software architecture, testing, and GPU kernel development. + Mentoring junior engineers on the team. What we need… more
- NVIDIA (Santa Clara, CA)
- …build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. + Define and ... inference deployments on GPU clusters across clouds. + Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey… more
- NVIDIA (Santa Clara, CA)
- We're currently seeking a Senior Developer Technology Engineer ! NVIDIA's Developer Technology Engineering team is a global network of world-class experts ... will be doing: + In this role, you will research and develop techniques to accelerate top CSP workloads...algorithms. + A background that includes parallel programming, ideally CUDA C/C++. + Hands on experience doing low-level performance… more
- NVIDIA (Santa Clara, CA)
- We're currently seeking a Principal Developer Technology Engineer ! Are you interested in developing techniques to accelerate large application workloads on advanced ... will be doing: + In this role, you will research and develop techniques to accelerate top CSP workloads...algorithms. + A background that includes parallel programming, ideally CUDA C/C++. + Hands on experience doing low-level performance… more
- Palo Alto Networks (Santa Clara, CA)
- …are a significant plus. + Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton Language, is a plus. + ... posture from development through runtime. As a Senior Principal Machine Learning Engineer , you will drive research on cutting-edge areas, including AI-Native… more
- NVIDIA (Santa Clara, CA)
- …We collaborate with a broad cross section of teams at NVIDIA ranging from DL research teams to CUDA Kernel and DL Framework development teams, to ... We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning...inference across diverse GPU platforms. You will collaborate with research scientists, software engineers, and hardware specialists to bring… more
- NVIDIA (Santa Clara, CA)
- …MLIR) without forgoing performance + Stay up to date with the latest research and innovations in deep learning, implement and experiment with new insights to ... deep learning models and optimizations such as graph fusions, kernel implementation, KV Caching etc. + Domain experience in...operators + Experience with NVIDIA software libraries such as CUDA and TensorRT + In depth experience with the… more