• OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...of what AI can do. We're expanding into multimodal inference , building the infrastructure needed to serve models that… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (San Francisco, CA)
    Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best ... take end-to-end ownership, and deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead the design… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …GPU offtake. About the Role As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase ... This isn't SaaS anymore - application layer companies sign multi -year contracts for computer and inference , but...fit for you if You enjoy the craftsmanship of software You're a thoughtful high-agency engineer Have… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Virtue AI (San Francisco, CA)
    …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... versioning, and backward compatibility Build routing and load‑balancing logic for inference traffic Multi ‑model routing Fallback and degradation strategies vLLM… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... scenarios (autoregressive models, denoising models, hierarchical models, state machines, multi -agent systems, cloud-based inference ). Adapt optimization solutions… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Pulse (San Francisco, CA)
    …models. Own profiling, batching, and autoscaling across single-tenant and multi -tenant environments. Responsibilities Build inference services with smart ... and growing quickly. What makes our tech special is our multi -stage architecture: Layout understanding with specialized component detection models Low-latency OCR… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    …with event-driven or serverless architectures. Exposure to hybrid cloud or multi -cluster environments. Contributions to open-source ML or inference systems ... and B200s, ready to go for experimentation, full-scale model training, or inference . Our client operates high-performance GPU clusters powering some of the most… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that ... to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …a variety of benefits, including business-only pricing and selection, a multi -seller marketplace, single- or multi -user business accounts, approval workflow, ... rapid pace. We are looking for a Machine Learning Engineer (MLE) to join the team to drive key...building data pipelines to produce inputs for training and inference in both online and offline contexts; 2) Training… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Voxel (San Francisco, CA)
    …by industry leading VC's. Voxel is looking for a Staff Machine-Learning Infrastructure Engineer to drive the next wave of our computer-vision platform for workplace ... stay within cost constraints. Build and operate training infrastructure - create multi -GPU / multi -node training frameworks (Ray, Spark, Kubernetes), optimize… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Vizcom (San Francisco, CA)
    …a modern TypeScript stack, and serving real enterprise The Role As the Senior Software Engineer - Backend (Systems / Infrastructure) you'll architect and deliver ... caching, and observability Collaborate with AI engineers to integrate GPU inference pipelines into user workflows Improve reliability: lead incident reviews,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Lodestar (San Francisco, CA)
    …end-to-end, autonomous in-space bodyguarding service. About the Job At Lodestar, as a Software Engineer - Localization, State Estimation & Prediction , you'll be ... future trajectories and behavioral patterns of targets Implement intent inference models to identify actions and dynamically rank threat...more about the ITARhere . Pay Range: (E1) Junior Software Engineer : $120,000 - $140,000 / year… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Baseten (San Francisco, CA)
    ABOUT BASETEN Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and ... operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …we make: a liquid market for GPU offtake. About the Role As a Principal Software Engineer - Networking, you'll be responsible for designing and operating the ... This isn't SaaS anymore - application layer companies sign multi -year contracts for computer and inference , but...footprint of GPU clusters. Your scope will include system software , orchestration, and distributed automation. This is a … more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • The San Francisco Compute Company (San Francisco, CA)
    …risk, there's a bubble. This isn't SaaS anymore - application layer companies sign multi -year contracts for computer and inference , but sell to customers on ... make: a liquid market for GPU offtake. About the Role As a Product Engineer at SFC, you'll build products which together de-risk the largest infrastructure buildout… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Virtue AI (San Francisco, CA)
    …essential guardrails and red‑teaming tools that enable organizations to deploy multi ‑modal AI applications confidently and responsibly. We are a well‑funded, ... for passionate builders to join our core team. Are you a high‑performing, motivated engineer ready to make a significant impact in the AI security space? Virtue AI… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Specter (San Francisco, CA)
    …the US Special Forces. Role + Responsibilities Specter is hiring an infrastructure software engineer to design, deploy, and scale distributed systems that power ... over their physical assets. To do so, we are creating a connected hardware- software ecosystem on top of multi -modal wireless mesh sensing technology. This… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Replicate, Inc. (San Francisco, CA)
    Staff Software Engineer - Machine Learning Platform (San Francisco) Replicate makes it easy for software engineers to run and customize machine learning ... utilization and reliability of our Kubernetes clusters and GPUs, including multi -regional traffic shifting and failover capabilities. Owning and optimizing fair and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …risk, there's a bubble. This isn't SaaS anymore - application layer companies sign multi -year contracts for computer and inference , but sell to customers on ... About the Role We're looking for a high agency engineer to help build the compute delivery platform that...systems that integrate our compute market with the orchestration software managing virtual machines running on cutting-edge HPC hardware.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Decagon AI, Inc. (San Francisco, CA)
    …and on‑prem environments. ML Infra: GPU and model‑serving platforms for LLM inference with multi ‑provider routing and support for on‑prem/air‑gapped deployments. ... and accurately. About the Role We're hiring a Senior Infrastructure Engineer to design, build, and operate production infrastructure for high‑scale, low‑latency… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source