- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best ... take end-to-end ownership, and deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead the design… more
- MongoDB (Palo Alto, CA)
- We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and ... Atlas and designed for developer-first experiences. As a Senior Engineer , you'll focus on building core systems and services...What You'll Do Design and build components of a multi -tenant inference platform integrated directly with MongoDB… more
- Together AI (San Francisco, CA)
- Role Together AI is seeking a Machine Learning Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI ... inference systems. This role involves working with state-of-the-art large language models models and ensuring they run efficiently and effectively at scale. If you… more
- Together AI (San Francisco, CA)
- About the Role Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated AI ... C/C++. Excellent understanding of low-level operating systems concepts including multi -threading, memory management, networking, and storage, performance, and scale.… more
- GEICO (Palo Alto, CA)
- …Careers.**GEICO AI ML Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus ... Design, implement, and maintain feature stores for ML model training and inference pipelines* Build and optimize LLM inference systems using frameworks… more
- NVIDIA Corporation (Santa Clara, CA)
- Principal Software Engineer - Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer - Large-Scale LLM Memory and ... Posted Todayjob requisition id: JR2010271NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models across … more
- Baseten (San Francisco, CA)
- Senior Software Engineer - Enterprise Platform Join to apply for the Senior Software Engineer - Enterprise Platform role at Baseten. Base Pay Range ... request routing, enterprise‑grade data and security integrations, and more. Example Initiatives Multi ‑cloud capacity management Inference on B200 GPUs Multi… more
- Baseten (San Francisco, CA)
- …build the platform engineers turn to to ship AI products. The Role As a Senior Software Engineer on the Core Product team at Baseten, you will be building and ... all new product development at the company. Example Initiatives Chains for multi ‑component workflows Asynchronous inference Model APIs for frontier models Model… more
- NVIDIA (Santa Clara, CA)
- NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models across multi -node distributed environments. ... resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory management of large-scale… more
- Vizcom (San Francisco, CA)
- …a modern TypeScript stack, and serving real enterprise The Role As the Senior Software Engineer - Backend (Systems / Infrastructure) you'll architect and deliver ... caching, and observability Collaborate with AI engineers to integrate GPU inference pipelines into user workflows Improve reliability: lead incident reviews,… more
- Chef Robotics (San Francisco, CA)
- …and tech leaders from leading companies. About The Role As a Senior Software Engineer , Backend specializing in database architecture and AI systems, you ... and architecture for real-time robotics operations. As a senior engineer , you will mentor team members and drive technical...data storage and retrieval systems for training datasets and inference results Design and implement systems to collect and… more
- Aira Technologies (San Francisco, CA)
- Senior Software Engineer - Data Platform Get AI-powered advice on this job and more exclusive features. We build AI agents that run cellular networks. Mobile ... monolithic systems don't scale, operational complexity multiplies. We deploy multi -agent systems with fine-tuned LLMs that autonomously optimize network performance,… more
- Mundanelabs (Palo Alto, CA)
- Principal Software Engineer , Embodied Systems Mundane is a venture-backed seed-stage robot learning startup founded by a team of Stanford researchers and ... team of engineers, roboticists, and dreamers. About the Role As a Principal Software Engineer on the Embodied Systems team , you'll architect and… more
- Waymo (Seattle, WA)
- …from a diverse set of sensors, allowing ML practitioners like you to develop multi ‑modal models and techniques at scale. You will report to our Head of Perception. ... large‑scale data extraction for model training, large model training pipelines, model inference systems, etc. Work closely with all the infrastructure teams that… more
- PubMatic, Inc. (Redwood City, CA)
- Senior Principal Software Engineer , Mobile App Monetization About the Role: PubMatic is seeking an experienced and technically driven Principal Software ... rewarded video , and native ads . The ideal candidate is a seasoned backend engineer with deep domain expertise in mobile app advertising , OpenRTB protocols , and… more
- Avy (San Francisco, CA)
- …talk with your recruiter to learn more. Base pay range $142,000.00/yr - $188,000.00/yr Software Engineer Location: San Francisco, CA (Hybrid) Avy's mission is to ... execute at the highest level. The Opportunity As a Software Engineer at Avy, your job is...Optimize system performance across the entire stack, from model inference to distributed agent orchestration Promote effective engineering culture… more
- Second Renaissance (San Francisco, CA)
- Overview As a Senior Staff Software Optimization Engineer you will collaborate with a specialized team to drive C++ system optimizations for enhancing LLM model ... inference and training performance. You will play a pivotal...algorithms to enhance performance. Stay at the forefront of software , hardware optimization and AI advancements, utilizing this knowledge… more
- Scale AI (San Francisco, CA)
- Senior Software Engineer , Enterprise GenAI Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge ... retrieval, inference , evaluation, and more. We are looking for a strong engineer to join our team and help us build and scale our product in a fast-paced… more
- Aldea Inc (San Francisco, CA)
- …systems that enable rapid iteration. Scale infrastructure from single-node to multi -node distributed training and deploy production inference systems for ... Aldea is a multi -modal foundational AI company reimagining the scaling laws...today's architectures create unnecessary bottlenecks for the evolution of software . Our mission is to build the next generation… more
- Hp Iq (San Francisco, CA)
- …seamlessly integrating with cloud infrastructure. We are looking for a Senior Software Engineer to design and develop high‑performance, scalable services to ... edge devices. Optimize data pipelines and storage solutions for real‑time AI inference and processing. Implement security and privacy best practices for distributed… more