- OpenReq (Cupertino, CA)
- …with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. Software , LLM Compilation Software sells chips. Etched ... able to run transformer models, we still need production-grade software to map existing LLMs onto our chip. You...issues that hurt performance. You will work with the software team to build integrations with existing libraries like… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web Services ... (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... ML accelerators. Working across the stack from PyTorch till the hardware- software boundary, our engineers build systematic infrastructure, innovate new methods and… more
- NVIDIA (Santa Clara, CA)
- …AI software stack, eg, TensorRT Model Optimizer, NeMo/Megatron, and TensorRT- LLM . + Construct and curate large problem specific datasets for post-training, ... focuses on optimizing generative AI models such as large language models ( LLM ) and diffusion models for maximal inference efficiency using techniques ranging from… more
Locations:
California