- Hamilton Barnes Associates Limited (San Francisco, CA)
- …of the most advanced AI workloads worldwide. They're now building a serverless inference platform , beginning with cost-efficient batch inference and ... want to miss this opportunity! Key Responsibilities Take ownership of the inference platform architecture, from batch to low-latency workloads. Design, build,… more
- quadric.io, Inc (Burlingame, CA)
- …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...platform ; [2] optimize the model deployment for efficient inference ; [3] profile and benchmark the model performance. This… more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference San...best map in the world. In this role on ML Platform , you will help bring advanced deep learning and large ... measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead...will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models… more
- Menlo Ventures (Burlingame, CA)
- …About the Role This role is for a highly-skilled ML Research Engineer who thrives at the intersection of fundamental research and production-grade engineering. ... models that form the backbone of our drug discovery platform . Positions are available at various levels of seniority:.... Positions are available at various levels of seniority: Senior , Staff, and Principal. You will Drive the R&D… more
- Comfy (San Francisco, CA)
- A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal ... candidate will engage in building efficient AI models and tackling complex challenges. The role requires a strong background in PyTorch and a passion for pushing performance limits. Join a dynamic team focused on creating innovative AI solutions and shaping… more
- Inference (San Francisco, CA)
- A technology company in San Francisco is seeking a Senior Full-Stack Engineer to develop frontend features for its AI platform . Responsibilities include ... optimizing performance, mentoring team members, and creating user-centric applications. Candidates should have 5+ years of experience with React, Tailwind, and Typescript. A competitive salary of $120,000 - $180,000 plus equity and benefits is offered.… more
- Inference (San Francisco, CA)
- Inference .net is hiring a Senior Full-Stack (Frontend-Focused) Engineer Help us build beautiful, performant web experiences that give users super-powers over ... our globally distributed LLM inference platform . If you love shipping React apps that feel snappy at planet-scale, we'd love to meet you. About Inference .net… more
- Quadric Inc. (Burlingame, CA)
- …Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from ... graph code and conventional C++ DSP and control code. Role: The AI Applications Engineer is the key bridge between development engineering and hands‑on users in the… more
- quadric.io, Inc (Burlingame, CA)
- …Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from ... code and conventional C++ DSP and control code. Role: The Corporate Applications Engineer is the key bridge between development engineering and hands-on users in the… more
- harvey.ai (San Francisco, CA)
- …being written today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive ... incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform , and deep domain expertise, we're reshaping how critical knowledge work… more
- Ellipsis Health (San Francisco, CA)
- …of the most preeminent venture capital teams. We are currently looking for an experienced Senior Data Platform Engineer , at the Staff or Principal level, ... Responsibilities Design, develop, and operation of a scalable and secure data platform to support analytics, ML Ops, and business intelligence Collaborate closely… more
- Synagi (San Francisco, CA)
- …ML tasks. Nice-to-Haves Experience with DeepSpeed or vLLM for efficient inference serving. Familiarity with LangChain or LlamaIndex for rapid agent prototyping. ... in decentralised or edge deployments (eg, WASM at the edge) for ultra-low-latency inference . Applying Send your resume-plus a short note (3-5 sentences) describing a… more
- Capital One (San Francisco, CA)
- …services* Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, ... developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and… more
- Lambda Inc. (San Francisco, CA)
- Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference . Lambda's mission is to make compute as ubiquitous as electricity ... securely and accurately. Our scope includes enterprise applications, integrations, data platform and analytics, compliance automation, and all things IT. What You'll… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform . You will lead the design and implementation of core ... computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability...Cloud Managed AI team seeks an ambitious and experienced Senior Software Engineer to join their team.… more
- Databricks Inc. (San Francisco, CA)
- …customers to operationalize models at scale with strong SLAs and cost efficiency. As a Senior Engineer , you'll play a critical role in shaping both the product ... do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business.… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform . You will lead the design and implementation of core ... About This Role: As a Senior Staff Software Engineer on the...day one, you'll own critical subsystems for managed AI inference , helping to serve large language models (LLMs) to… more
- Roblox Corporation (San Mateo, CA)
- Senior Hardware Engineer - GPU & AI...and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that ... meet Roblox's unique demands for real-time rendering and low-latency AI inference . Firmware & Systems: Lead firmware qualification (BIOS/BMC) and troubleshooting,… more
- Icon Ventures (San Francisco, CA)
- …their outcomes in the most effective and delightful way. Our $1B+ learning platform serves tens of millions of students every month, including two-thirds of US ... with Product, Data Science, and the AI & Data Platform to deliver an AI‑driven learning coach that's recognized...as best‑in‑class. About the Role As an Applied AI Engineer , you will be working at the forefront of… more
- Qualified (San Francisco, CA)
- …exceptional value to our customers and driving our organization's success. Your Opportunity as a Senior Software Engineer As a Senior Software Engineer , ... Overview Qualified is the Agentic Marketing Platform for B2B companies. With Piper the AI...optimizing these pipelines to support RAG models\' training and inference processes efficiently. Ensure the core functionality of our… more