- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... of AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the… more
- Capital One (San Francisco, CA)
- Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700… more
- Meta (Menlo Park, CA)
- …the new network products that enable networking for AI training and Inference .A Network Production Engineer in this role would support leading Meta's server ... **Summary:** Meta is seeking a Production Engineer with in-depth understanding of networking, systems, automation,...network is a foundational component in achieving the company's AI goals and this role would play a key… more
- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number ... of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel… more
- Genentech (South San Francisco, CA)
- …scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on full stack engineering, you will be… more
- Genentech (South San Francisco, CA)
- …scale and optimize workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on backend and data engineering, you will… more
- Oracle (Redwood City, CA)
- … at the Principal Engineer level. We are at the forefront of AI innovation, exploring the next generation of AI accelerators and hardware solutions. As ... a Senior Principal software engineer , part of our growing team, you will be...will be involved in evaluation, prototyping, and optimizing cutting-edge AI hardware, AI accelerators, including custom-designed … more
- Amazon (San Francisco, CA)
- Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers like Pieter ... run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more
- quadric.io, Inc (Burlingame, CA)
- …both NN graph code and conventional C++ DSP and control code. Role: The AI Applications Engineer is the key bridge between development engineering and hands-on ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...users in the field. The AI Application Engineer will [1] integrate Quadric… more
- General Motors (San Francisco, CA)
- **Job Description** **Senior AI /ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize distillation, ... training, and inference of ML models. You will develop and enhance...toolchain and stack, to leverage the latest advancements in AI + Influence model architecture decisions and strategy within… more
- Amazon (San Francisco, CA)
- Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll contribute to breakthrough foundation models run at production ... scale. As a Software Development Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more
- Meta (Menlo Park, CA)
- …and FAIR orgs to produce SOTA research and results. **Required Skills:** Research Engineer , Conversational AI - Reality Labs Responsibilities: 1. Design methods, ... **Summary:** Reality Labs is seeking a Research Engineer to join our Large Language Model (LLM)...Language Model (LLM) Research team for the device driven AI Assistant effort. We conduct focused research and engineering… more
- Meta (Menlo Park, CA)
- … inference ; and/or multilingual and multimodal modeling. **Required Skills:** Research Engineer , Language - Generative AI Responsibilities: 1. Design methods, ... **Summary:** Meta is seeking a Research Engineer to join our Large Language Model (LLM)...for strong engineers who have a background in generative AI and NLP, with experience in areas like language… more
- Meta (Menlo Park, CA)
- **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI /HPC System Performance Engineer Responsibilities: 1. Lead… more
- Amazon (San Francisco, CA)
- Description We are seeking a highly skilled Machine Learning Systems Engineer to join Frontier AI Robotics team. This role focuses on building and optimizing ... scientists and engineers to deliver scalable, high-performance systems that power state-of-the-art AI research and applications. About the team At Frontier AI … more
- Elevance Health (San Francisco, CA)
- **Lead AI Platform Engineer ** **Location:** This role requires associates to be in-office 1 - 2 days per week, fostering collaboration and connectivity, while ... for employment, unless an accommodation is granted as required by law. The **Lead AI Platform Engineer ** will own technical outcomes for core areas of the… more
- Meta (Menlo Park, CA)
- **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI . This results in a dramatic ... and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active member of… more
- DoorDash (San Francisco, CA)
- …systems that empower efficient machine learning at scale, especially in the Generative AI area. This is a remote opportunity, with Pacific Time working hours ... data and ML pipelines-from retrieval (eg, RAG) to batch inference -that adapt quickly to new technologies. + Develop an...fast iteration and deployment of products powered by Generative AI . + Improve the reliability, scalability, and monitoring of… more
- Meta (Menlo Park, CA)
- …MTIA (Meta Training & Inference Accelerator) Software team is part of the AI & Compute Foundation org. The team's mission is to explore, develop and help ... PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance 3. Analyze… more
- Meta (Menlo Park, CA)
- …authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta.You will be working on ... help in driving next generation hardware software codesign for AI domain specific problems. **Required Skills:** Software Engineer...core compilers to support new state of the art inference and training AI hardware accelerators and… more