- NVIDIA (Santa Clara, CA)
- …open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal and ... as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculative decoding, sparsity,… more
- Capital One (San Jose, CA)
- Lead AI Engineer ( GenAI Platform Services) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For ... deliver our industry leading capabilities with breakthrough product experiences and scalable , high-performance AI infrastructure. At Capital One, you will help bring… more
- NVIDIA (Santa Clara, CA)
- …now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core (https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/core) and NeMo Framework ... ) team. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working… more
- DataRobot (San Francisco, CA)
- …& Libraries, LLM Onboarding,Tools, Multi-Agent Evaluations, Multimodality, etc.) and GenAI systems (eg Inference optimization, Distributed Training, Finetuning, ... today and in the future. As a Principal Software Engineer for Generative AI at DataRobot, you will be...DataRobot, you will be the technical anchor for our GenAI Tooling and Systems teams, shaping the architecture, ensuring… more
- Capital One (San Jose, CA)
- Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital ... our industry leading capabilities with breakthrough product experiences and scalable , high-performance AI infrastructure. At Capital One, you will...You will contribute to crafting an end to end GenAI SDK, CLI and starter kits that let AI… more
- Walmart (Sunnyvale, CA)
- …and infrastructure. All of these products and services are supported by scalable and powerful infrastructure, ensuring a secure and seamless employee and customer ... agents for multi-step reasoning, knowledge grounding, and decision-making. + Architect scalable , distributed AI systems with a focus on performance, fault tolerance,… more
- Honeywell (San Jose, CA)
- We're seeking a highly skilled Artificial Intelligence & Machine Learning Systems Engineer to architect, design, and develop advanced AI/ML systems that power our ... engineering teams, and collaborate with cross-functional teams to deliver intelligent, scalable , and production-ready AI and machine learning technologies. You will… more
- Meta (Menlo Park, CA)
- … GenAI /LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - Scaling / Performance Responsibilities: 1. Enabling reliable ... products and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU… more
- Genentech (South San Francisco, CA)
- …and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI/ML models and output, and building impactful ... the scientific needs. **The Opportunity:** As a machine learning engineer in AI Enablement, you will be working closely...everyone in between. You'll build, own, and constantly improve scalable AI/ML based systems that unlock the potential of… more
- Zscaler (San Jose, CA)
- …building frameworks for all products + Evaluate and integrate state-of-the-art GenAI advances (eg, LLMs/SLMs, retrieval, fine-tuning, inference optimization) to ... agility with a cloud-first strategy. We're looking for an experienced Sr. Staff Software Engineer to join our Digital Experience team. This role is hybrid and based… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** We are seeking a highly motivated Machine Learning Engineer to join our Data Science team. In this role, you will not only design, develop, and ... at scale but also play a key role in shaping next-generation AI/ML and GenAI enabled products that will help our suppliers grow their business. You will collaborate… more
- NVIDIA (Santa Clara, CA)
- …and optimize diverse real world workloads. NeMo Framework is an open-source, scalable and cloud-native framework built for researchers and developers working on ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning,… more
- Google (Sunnyvale, CA)
- Staff Software Engineer , AI Data, Multimodal _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and decision making, solving ... be credible with customers and engineers. + Understanding of genAI model development workflows for post-training and product fine-tuning,...on and is growing every day. As a software engineer , you will work on a specific project critical… more
- Walmart (Sunnyvale, CA)
- …data developers and machine learning developers whose strengths are: (1) building scalable data pipelines (2) using machine learning techniques and data science (3) ... through data analysis. **What You'll Do:** Design & train GenAI models targeting CTV use cases: on device LLM...limited memory devices. + Hands on experience with edge inference (TensorRT, CoreML, TVM, ONNX, TFLite, WebGPU, or similar).… more
- Meta (Menlo Park, CA)
- …products and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU ... to improve the full-stack distributed ML reliability and performance (eg Large-Scale GenAI /LLM training) from the trainer down to the inter-GPU and network… more