- NVIDIA (Santa Clara, CA)
- …Collaborate with model AI inference and compiler teams to iterate on kernel fusion, auto tuning, and sophisticated GPU programming techniques. + Benchmark ... speed and flexibility. + Design and build high-level DSLs and innovative compiler infrastructure to increase kernel developer productivity while achieving near… more
- NVIDIA (Santa Clara, CA)
- We're looking for a Senior Performance Compiler Engineer to join our team and work on the open-source Triton compiler project. This opportunity involves ... point) to identify new opportunities for optimization. + Designing and implementing compiler technology using MLIR to optimize high-level kernel descriptions… more
- Amazon (Cupertino, CA)
- …software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler , Neuron Kernel Interface (NKI) compiler , and runtime that ... Neuron Kernel Interface (NKI) is a bare-metal language and compiler for directly programming NeuronDevices available on AWS Trn/Inf instances. You can… more
- Amazon (Cupertino, CA)
- …ML or HPC such as GPUs, CPUs, FPGAs, or custom architectures - Experience with GPU kernel optimization and GPGPU computing such as CUDA, NKI, Triton, OpenCL, ... https://github.com/aws/aws-neuron-sdk https://www.amazon.science/how-silicon-innovation-became-the-secret-sauce-behind-awss-success Key job responsibilities Our kernel engineers collaborate across compiler… more
- Amazon (Cupertino, CA)
- …software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler , Neuron Kernel Interface (NKI) compiler , and runtime that ... Neuron Kernel Interface (NKI) is a bare-metal language and compiler for directly programming NeuronDevices available on AWS Trn/Inf instances. You can… more
- NVIDIA (Santa Clara, CA)
- Do you have expertise in CUDA kernel optimization, C++ systems programming, or compiler infrastructure? Join NVIDIA's nvFuser (https://github.com/NVIDIA/Fuser) ... with hardware architects, framework maintainers, and optimization experts to create compiler infrastructure that advances GPU performance, developing manual… more
- Meta (Menlo Park, CA)
- …high performance for production environments across specialized hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for ... codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels...one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and… more
- NVIDIA (Santa Clara, CA)
- …As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and ... in machine learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++,… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Software Engineer , LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about ... and serving of these DL solutions. We specialize in developing GPU -accelerated Deep learning software like TensorRT, DL benchmarking software and performant… more
- NVIDIA (Santa Clara, CA)
- …Ways to stand out from the crowd: + Tuning BLAS or deep learning library kernel code + CUDA/OpenCL GPU programming + Numerical methods and linear algebra + ... We are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning...revolution in artificial intelligence! We're always striving for peak GPU efficiency on current and future-generation GPUs. To get… more
- NVIDIA (Santa Clara, CA)
- …AI with GPU computing workflows (eg, CUDA, PyTorch, Triton, or compiler toolchains). + Knowledge of planning algorithms or program synthesis using LLMs. + ... Learning Safety team is looking for a Senior Software Engineer to build intelligent, autonomous software for the next...+ Integrate agentic AI systems with NVIDIA platforms, including GPU computing frameworks and runtime engines. + Work with… more
- Meta (Menlo Park, CA)
- …Software stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next ... way. As part of the AI acceleration software stack, we develop kernel libraries exploiting various hardware architectural features, achieving high performance for… more