• OpenAI (San Francisco, CA)
    …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Databricks Inc. (San Francisco, CA)
    Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...with research. Are comfortable dealing with systems that span networking , distributed compute, and high-throughput data handling. Have familiarity… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …we make: a liquid market for GPU offtake. About the Role As a Principal Software Engineer - Networking , you'll be responsible for designing and operating ... distributed systems that can withstand node or cluster-wide failures Architect software -defined networking solutions that integrate with underlay switches and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Rockstar (San Francisco, CA)
    …promise is simple: they make your AI system better. They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core ... or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference , and long-term model maintenance. Their customers are Series A-C AI… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • DatologyAI (Redwood City, CA)
    …are in office 4 days a week. About the Role We're looking for an engineer with deep experience building and operating large-scale training and inference systems. ... infrastructure that powers both our internal ML research workflows and the high-performance inference pipelines that deliver curated data to our customers. As one of… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Lodestar (San Francisco, CA)
    …end-to-end, autonomous in-space bodyguarding service. About the Job At Lodestar, as a Software Engineer - Localization, State Estimation & Prediction , you'll be ... future trajectories and behavioral patterns of targets Implement intent inference models to identify actions and dynamically rank threat...more about the ITARhere . Pay Range: (E1) Junior Software Engineer : $120,000 - $140,000 / year… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Decagon AI, Inc. (San Francisco, CA)
    …The Infrastructure team builds and operates the foundations that power Decagon: networking , data, ML serving, developer platform, and real‑time voice. We partner ... around five focus areas: Core Infra: The foundational cloud stack- networking , compute, storage, security, and infrastructure‑as‑code-to ensure reliability, scale,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Baseten (San Francisco, CA)
    ABOUT BASETEN Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and ... operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Sr. Software Development Engineer , Annapurna Labs In this role you will be responsible for leading a technical team that provides profiling and optimization ... fleet. You will work closely with the hardware and software teams to ensure that the right tools are...teams. Collect requirements from various other teams including training, inference , and runtime. Collaborate with the compiler performance team… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Sierra (San Francisco, CA)
    …Clay led the product and design teams for Google Workspace. What you'll do As a Software Engineer on our Site Reliability team at Sierra, you will be responsible ... Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture). Strong background in observability… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …with Linux kernel development, system programming, or related low-level software engineering Understand virtualization technologies (KVM, Xen, QEMU, etc.) and ... optimization for ML/AI specific workloads Network stack optimization and high-performance networking Experience with TPUs, custom ASICs, or other ML accelerators… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …(our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by ... engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our teams breadth of talent, we… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Epsilon (San Francisco, CA)
    …(potentially infinite) external sites Qualifications 7+ years of experience as a software engineer , demonstrably delivering on time, at quality Full stack ... This is a unique opportunity for a strong generalist engineer who thrives in ambiguity, loves solving complex problems,...of terabytes of data Build and optimize ML model inference pipelines for both live traffic and offline data.… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Eridu Corporation (San Francisco, CA)
    …an RTL Engineer to help define and implement our industry‑leading Networking IC. If you're a highly motivated self‑starter eager to solve real‑world problems, ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Eridu Corporation (San Francisco, CA)
    …or FPGA prototyping platforms for pre-silicon validation. Exposure to hardware/ software co-validationfor networking protocols or control-plane software ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Roblox Corporation (San Mateo, CA)
    Senior Hardware Engineer - GPU & AI Infrastructure Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in ... meet Roblox's unique demands for real-time rendering and low-latency AI inference . Firmware & Systems: Lead firmware qualification (BIOS/BMC) and troubleshooting,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Eridu Corporation (San Francisco, CA)
    …an RTL Engineer to help define and implement our industry-leading Networking IC. If you're a highly motivated self-starter eager to solve real-world problems, ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source