• Menlo Ventures (San Francisco, CA)
    …The ideal candidate will have a strong background in CUDA and experience with distributed training. This role offers a competitive salary range of $166,000 - $225,000 ... A leading data and AI company based in San Francisco is seeking a Research Engineer to optimize GPU training models and frameworks.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web Services ... software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia...such large models across the stack from system level optimizations through to Pytorch or JAX is a must… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators....such large models across the stack from system level optimizations through to Pytorch or JAX is a must… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer…

    NVIDIA (Santa Clara, CA)
    …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI ... vLLM, SGLang). You may also dive deeper into GPU-level optimization, including custom kernel development with CUDA and Triton. This role offers a unique opportunity… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager, ML Kernel

    Amazon (Cupertino, CA)
    …Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... The Acceleration Kernel Library team is at the forefront of maximizing...AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators.… more
    Amazon (12/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software Engineer, LLM…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing ... GPUs to edge SoCs. Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels. Work and… more
    NVIDIA (11/25/25)
    - Save Job - Related Jobs - Block Source