- DataRobot (San Francisco, CA)
- …& Libraries, LLM Onboarding,Tools, Multi-Agent Evaluations, Multimodality, etc.) and GenAI systems (eg Inference optimization, Distributed Training, Finetuning, ... today and in the future. As a Principal Software Engineer for Generative AI at DataRobot, you will be...DataRobot, you will be the technical anchor for our GenAI Tooling and Systems teams, shaping the architecture, ensuring… more
- Meta (Menlo Park, CA)
- … GenAI /LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - Scaling / Performance Responsibilities: 1. Enabling reliable ... products and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU… more
- Genentech (South San Francisco, CA)
- …and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI/ML models and output, and building impactful ... the scientific needs. **The Opportunity:** As a machine learning engineer in AI Enablement, you will be working closely...everyone in between. You'll build, own, and constantly improve scalable AI/ML based systems that unlock the potential of… more
- Meta (Menlo Park, CA)
- …products and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU ... to improve the full-stack distributed ML reliability and performance (eg Large-Scale GenAI /LLM training) from the trainer down to the inter-GPU and network… more