- OpenAI (San Francisco, CA)
- …and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... About the Team Our Inference team brings OpenAI's most capable research and.... About the Role We're looking for a senior engineer to design and build the load balancer that… more
- Databricks Inc. (San Francisco, CA)
- Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production… more
- Menlo Ventures (San Francisco, CA)
- About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more
- Menlo Ventures (San Francisco, CA)
- …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our robots operate with… more
- OpenAI (San Francisco, CA)
- …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...with research. Are comfortable dealing with systems that span networking , distributed compute, and high-throughput data handling. Have familiarity… more
- Etched.ai, Inc. (San Jose, CA)
- … networking solutions for large-scale inference workloads. As a Pod Software Engineer , you will focus on developing and qualifying software ... Overview We are seeking highly motivated and skilled Pod Networking Software Engineers to join our System...communication amongst Sohu inference nodes in multi-rack inference clusters. You will collaborate closely with kernel, platform,… more
- San Francisco Compute Co. (San Francisco, CA)
- …we make: a liquid market for GPU offtake. About the Role As a Principal Software Engineer - Networking , you'll be responsible for designing and operating ... distributed systems that can withstand node or cluster-wide failures Architect software -defined networking solutions that integrate with underlay switches and… more
- Rockstar (San Francisco, CA)
- …promise is simple: they make your AI system better. They are hiring a Backend Software Engineer (ML Infrastructure) to help design, build, and scale the core ... or raw compute, but a full-stack backend for fine-tuning, reinforcement learning, inference , and long-term model maintenance. Their customers are Series A-C AI… more
- Lodestar (San Francisco, CA)
- …end-to-end, autonomous in-space bodyguarding service. About the Job At Lodestar, as a Software Engineer - Localization, State Estimation & Prediction , you'll be ... future trajectories and behavioral patterns of targets Implement intent inference models to identify actions and dynamically rank threat...more about the ITARhere . Pay Range: (E1) Junior Software Engineer : $120,000 - $140,000 / year… more
- Decagon AI, Inc. (San Francisco, CA)
- …The Infrastructure team builds and operates the foundations that power Decagon: networking , data, ML serving, developer platform, and real‑time voice. We partner ... around five focus areas: Core Infra: The foundational cloud stack- networking , compute, storage, security, and infrastructure‑as‑code-to ensure reliability, scale,… more
- Baseten (San Francisco, CA)
- ABOUT BASETEN Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and ... operate the Model APIs surface with focus on advanced inference capabilities: structured outputs (JSON mode, grammar-constrained generation), tool/function calling… more
- Amazon (San Francisco, CA)
- Sr. Software Development Engineer , Annapurna Labs In this role you will be responsible for leading a technical team that provides profiling and optimization ... fleet. You will work closely with the hardware and software teams to ensure that the right tools are...teams. Collect requirements from various other teams including training, inference , and runtime. Collaborate with the compiler performance team… more
- Sierra (San Francisco, CA)
- …Clay led the product and design teams for Google Workspace. What you'll do As a Software Engineer on our Site Reliability team at Sierra, you will be responsible ... Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture). Strong background in observability… more
- Menlo Ventures (San Francisco, CA)
- …with Linux kernel development, system programming, or related low-level software engineering Understand virtualization technologies (KVM, Xen, QEMU, etc.) and ... optimization for ML/AI specific workloads Network stack optimization and high-performance networking Experience with TPUs, custom ASICs, or other ML accelerators… more
- Ll Oefentherapie (San Jose, CA)
- …agents that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and implementation ... will work in a collaborative environment with applied scientists, ML engineers, and software teams to deliver performant and reliable AI infrastructure. This is a… more
- Amazon (San Francisco, CA)
- …(our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by ... engineers. Our team covers multiple disciplines including silicon engineering, hardware design, software and operations. Because of our teams breadth of talent, we… more
- Aeva Inc. (Mountain View, CA)
- …to make more intelligent and safe decisions. Role Overview We're looking for an engineer who can own data collection, scalable data systems, MLOps workflows and ML ... debug, and tune GPU performance; support CUDA kernels, TensorRT integration, and inference optimization. Build and maintain CI/CD and automated MLOps pipelines for… more
- Epsilon (San Francisco, CA)
- …(potentially infinite) external sites Qualifications 7+ years of experience as a software engineer , demonstrably delivering on time, at quality Full stack ... This is a unique opportunity for a strong generalist engineer who thrives in ambiguity, loves solving complex problems,...of terabytes of data Build and optimize ML model inference pipelines for both live traffic and offline data.… more
- Eridu Corporation (San Francisco, CA)
- …an RTL Engineer to help define and implement our industry‑leading Networking IC. If you're a highly motivated self‑starter eager to solve real‑world problems, ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more
- Eridu Corporation (San Francisco, CA)
- …or FPGA prototyping platforms for pre-silicon validation. Exposure to hardware/ software co-validationfor networking protocols or control-plane software ... hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently… more