- Amazon (Cupertino, CA)
- …The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's ... ML accelerators. Working across the stack from PyTorch till the hardware- software boundary, our engineers build systematic infrastructure, innovate new methods and… more
- NVIDIA (Santa Clara, CA)
- …AI software stack, eg, TensorRT Model Optimizer, NeMo/Megatron, and TensorRT- LLM . + Construct and curate large problem specific datasets for post-training, ... focuses on optimizing generative AI models such as large language models ( LLM ) and diffusion models for maximal inference efficiency using techniques ranging from… more