- OpenAI (San Francisco, CA)
- …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
- Databricks Inc. (San Francisco, CA)
- Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
- Menlo Ventures (San Francisco, CA)
- …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
- OpenAI (San Francisco, CA)
- …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...with research. Are comfortable dealing with systems that span networking , distributed compute, and high-throughput data handling. Have familiarity… more
- San Francisco Compute Co. (San Francisco, CA)
- …we make: a liquid market for GPU offtake. About the Role As a Principal Software Engineer - Networking , you'll be responsible for designing and operating ... distributed systems that can withstand node or cluster-wide failures Architect software -defined networking solutions that integrate with underlay switches and… more
- Rockstar (San Francisco, CA)
- …promise is simple: they make your AI system better. They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core ... or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference , and long-term model maintenance. Their customers are Series A-C AI… more
- Lodestar (San Francisco, CA)
- …end-to-end, autonomous in-space bodyguarding service. About the Job At Lodestar, as a Software Engineer - Localization, State Estimation & Prediction , you'll be ... future trajectories and behavioral patterns of targets Implement intent inference models to identify actions and dynamically rank threat...more about the ITARhere . Pay Range: (E1) Junior Software Engineer : $120,000 - $140,000 / year… more
- Decagon AI, Inc. (San Francisco, CA)
- …The Infrastructure team builds and operates the foundations that power Decagon: networking , data, ML serving, developer platform, and real‑time voice. We partner ... around five focus areas: Core Infra: The foundational cloud stack- networking , compute, storage, security, and infrastructure‑as‑code-to ensure reliability, scale,… more
- Baseten (San Francisco, CA)
- ABOUT BASETEN Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and ... operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling… more
- Amazon (San Francisco, CA)
- Sr. Software Development Engineer , Annapurna Labs In this role you will be responsible for leading a technical team that provides profiling and optimization ... fleet. You will work closely with the hardware and software teams to ensure that the right tools are...teams. Collect requirements from various other teams including training, inference , and runtime. Collaborate with the compiler performance team… more
- Sierra (San Francisco, CA)
- …Clay led the product and design teams for Google Workspace. What you'll do As a Software Engineer on our Site Reliability team at Sierra, you will be responsible ... Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture). Strong background in observability… more
- Menlo Ventures (San Francisco, CA)
- …with Linux kernel development, system programming, or related low-level software engineering Understand virtualization technologies (KVM, Xen, QEMU, etc.) and ... optimization for ML/AI specific workloads Network stack optimization and high-performance networking Experience with TPUs, custom ASICs, or other ML accelerators… more
- Amazon (San Francisco, CA)
- …(our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by ... engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our teams breadth of talent, we… more
- DatologyAI (Redwood City, CA)
- …are in office 4 days a week. About the Role We're looking for an engineer with deep experience building and operating large-scale training and inference systems. ... infrastructure that powers both our internal ML research workflows and the high-performance inference pipelines that deliver curated data to our customers. As one of… more
- Epsilon (San Francisco, CA)
- …(potentially infinite) external sites Qualifications 7+ years of experience as a software engineer , demonstrably delivering on time, at quality Full stack ... This is a unique opportunity for a strong generalist engineer who thrives in ambiguity, loves solving complex problems,...of terabytes of data Build and optimize ML model inference pipelines for both live traffic and offline data.… more
- Eridu Corporation (San Francisco, CA)
- …an RTL Engineer to help define and implement our industry‑leading Networking IC. If you're a highly motivated self‑starter eager to solve real‑world problems, ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
- Eridu Corporation (San Francisco, CA)
- …or FPGA prototyping platforms for pre-silicon validation. Exposure to hardware/ software co-validationfor networking protocols or control-plane software ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
- Eridu Corporation (San Francisco, CA)
- …an RTL Engineer to help define and implement our industry-leading Networking IC. If you're a highly motivated self-starter eager to solve real-world problems, ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
- CompScience (San Francisco, CA)
- About CompScience At CompScience, we're not just building software , we're saving lives. We're a high-growth startup on a mission to prevent 1 million workplace ... and engineering teams are composed of distinguished computer vision engineers, software architects, data scientists and product and design leaders from Amazon… more