- Virtue AI (San Francisco, CA)
- An innovative AI security company in San Francisco is seeking an Inference Engineer who will be pivotal in optimizing ML model inferences. The role requires ... deep knowledge of serving LLMs and experience in designing inference APIs. Candidates should be comfortable in a fast-paced...presents an opportunity to work at the cutting edge of AI security with competitive compensation and growth potential.… more
- Baseten (San Francisco, CA)
- …is seeking a skilled individual to enhance the API infrastructure supporting AI models. The role involves designing and optimizing backend services, focusing on ... performance and reliability. Candidates should have over 3 years of experience with distributed systems and be comfortable debugging complex systems. This unique opportunity includes a competitive compensation package and a supportive culture emphasizing… more
- Databricks Inc. (San Francisco, CA)
- …leading AI -focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and optimize ... the inference engine powering the Foundation Model API. You will collaborate closely with researchers and engage in performance-critical system challenges, focusing… more
- Capital One (San Francisco, CA)
- …experience programming with Python, Go, Scala, or Java* 6 years of experience deploying scalable and responsible AI solutions on cloud platforms (eg AWS, Google ... Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java,… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... our large language model (LLM) serving systems are fast, scalable , and efficient. Your work will touch the full...and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration… more
- Databricks Inc. (San Francisco, CA)
- Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the architecture, ... development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production demands, ensuring… more
- Arcade (San Francisco, CA)
- …technology company in San Francisco seeks a Software Engineer to build scalable backend systems and enhance generative AI workflows. The ideal candidate will ... design efficient architecture for model execution and collaborate closely with product and AI teams. This role offers the opportunity to innovate in a fast-paced… more
- OpenAI (San Francisco, CA)
- …on delivering a world-class developer experience while pushing the boundaries of what AI can do. We're expanding into multimodal inference , building the ... About the Team OpenAI's Inference team powers the deployment of our most...Our work ensures these models are available, performant, and scalable in production, and we partner closely with Research… more
- Pantera Capital (San Francisco, CA)
- …to build, deploy, and optimize our large-scale AI training and inference clusters Responsibilities Design, deploy, and maintain scalable Kubernetes clusters ... Department AI We are looking for an AI Infra engineer to join our growing...APIs and orchestration systems for both training pipelines and inference services Implement resource scheduling and job management systems… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform. You will lead the design and implementation of core ... From day one, you'll own critical subsystems for managed AI inference , helping to serve large language...scalable , and impactful products. Bonus Points: Experience with AI /ML frameworks such as TensorFlow, PyTorch, or Hugging Face… more
- Capital One National Association (San Francisco, CA)
- …upon number of hours to be regularly worked. McLean, VA: $225,400 - $257,200 for Sr. Lead AI Engineer New York, NY: $245,900 - $280,600 for Sr. Lead AI ... our industry leading capabilities with breakthrough product experiences and scalable , high‑performance AI infrastructure. At Capital One,...San Francisco, CA: $245,900 - $280,600 for Sr. Lead AI Engineer San Jose, CA: $245,900 -… more
- Fabrion (San Francisco, CA)
- …across many tenants, models, and tasks. This is the backbone of a secure, reliable, and scalable AI -native enterprise system. If you dream about using AI to ... About the Role ML Ops Engineer - Agentic AI Lab (Founding...model governance, and security. Responsibilities Build and maintain secure, scalable , and automated pipelines for: LLM fine-tuning, SFT, LoRA,… more
- Eloquent AI (San Francisco, CA)
- …Kubernetes, and LLM and ML deployment pipelines. If you're passionate about scalable AI systems and optimizing ML models for real-world applications, ... of financial services. Your Role As a Senior Software Engineer , AIOps & Infrastructure at Eloquent AI ,...you will be responsible for designing, building, and optimizing scalable , high-performance AI infrastructure to support the… more
- Genentech (San Francisco, CA)
- …Product leaders, DevOps, and everyone in between. You'll build, own, and constantly improve scalable AI /ML based systems that unlock the potential of our diverse ... We also work on scaling up model training and inference , evaluating the quality of AI /ML models...the scientific needs. The Opportunity: As a machine learning engineer in AI Enablement, you will be… more
- Qualified (San Francisco, CA)
- Overview Qualified is the Agentic Marketing Platform for B2B companies. With Piper the AI SDR Agent, Qualified offers a new way to grow inbound pipeline. Piper ... Qualified, we are developing PipelineAI, an enterprise-grade SaaS platform powered by AI , with the goal of revolutionizing pipeline generation for our customers. Our… more
- Fabrion (San Francisco, CA)
- ML/ AI Research Engineer - Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful ... knowledge graphs, and multi‑tenant governance. We're looking for an ML/ AI Research Engineer to join our ...to optimize agent behaviors (eg RLHF, DPO, PPO) Establish scalable evaluation harnesses for LLM and agent performance, including… more
- Genentech (San Francisco, CA)
- …leaders, DevOps, and everyone in between. You\'ll build, own, and constantly improve scalable AI /ML based systems that unlock the potential of our diverse ... We also work on scaling up model training and inference , evaluating the quality of AI /ML models...the scientific needs. The Opportunity As a machine learning engineer in AI Enablement, you will be… more
- Tonal (San Francisco, CA)
- …and Power Progress for our members. Overview Tonal is looking for an AI Engineer to help expand Tonal's intelligence across movements, training modalities, ... ML best practices Who You Are A self driven AI or ML Engineer passionate about bringing...IMUs, or cameras Familiar with MLOps best practices and scalable model training pipelines Strong communicator who can collaborate… more
- Genentech (San Francisco, CA)
- …Product leaders, DevOps, and everyone in between. You'll build, own, and constantly improve scalable AI /ML based systems that unlock the potential of our diverse ... We also work on scaling up model training and inference , evaluating the quality of AI /ML models...meet the scientific needs. The Opportunity: As a software engineer in AI Enablement with a focus… more
- Synagi (San Francisco, CA)
- At Synagi, we are pushing the frontier of distributed and decentralised AI agents. Our research spans vector-driven retrieval systems, agentic swarms, and ... focus on real-world performance and human-in-the-loop alignment. We explore scalable , context-aware multi-agent designs that outperform monolithic approaches and… more