• OpenReq (Cupertino, CA)
    …with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software , LLM Compilation Software sells chips. Etched ... able to run transformer models, we still need production-grade software to map existing LLMs onto our chip. You...issues that hurt performance. You will work with the software team to build integrations with existing libraries like… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web Services ... (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... ML accelerators. Working across the stack from PyTorch till the hardware- software boundary, our engineers build systematic infrastructure, innovate new methods and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Senior GenAI Algorithms Engineer

    NVIDIA (Santa Clara, CA)
    …AI software stack, eg, TensorRT Model Optimizer, NeMo/Megatron, and TensorRT- LLM . + Construct and curate large problem specific datasets for post-training, ... focuses on optimizing generative AI models such as large language models ( LLM ) and diffusion models for maximal inference efficiency using techniques ranging from… more
    NVIDIA (12/18/25)
    - Save Job - Related Jobs - Block Source