- San Francisco Compute Co. (San Francisco, CA)
- …located in San Francisco is looking for a dedicated professional to manage GPU training clusters. You will ensure the smooth operation of high-performance computing ... for hardware management. Applicants must have experience with Linux, networking fundamentals, and GPU clusters. The role offers a salary ranging from $170k to $300k… more
- Datacrunch (San Francisco, CA)
- …(DNS/TCP), and infrastructure‑as‑code tools (Terraform, Ansible). Experience managing Slurm-based HPC GPU clusters, diagnosing performance issues, and designing ... vision and expectations. About the role We're seeking a Senior or Principal Site Reliability Engineer (SRE)...our European engineering teams to scale our high-performance compute ( HPC ) and cloud infrastructure globally. As our initial US‑based… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …self‑healing systems, automated remediation, or event‑driven operations Interest in scaling AI/ HPC infrastructure and solving reliability challenges in GPU ‑heavy ... the heart of that mission. As a Site Reliability Engineer focused on Operational Excellence, you will help ensure...help ensure the stability, resilience, and performance of Crusoe's GPU cloud. This role is ideal for engineers who… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …experimentation, full-scale model training, or inference. Our client operates high-performance GPU clusters powering some of the most advanced AI workloads ... workloads. Design, build, and optimise distributed inference systems to maximise GPU utilisation and minimise cold starts. Integrate, tune, and operate inference… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …setting the pace for responsible, transformative cloud infrastructure. About This Role: As a Senior Software Engineer on our storage team, you'll be joining our ... optimize our next-generation cloud storage products. We're looking for a hands-on engineer with deep expertise in building storage systems. You will be responsible… more
- Epoch Biodesign (San Francisco, CA)
- …cloud infrastructure. About the Role Crusoe Cloud is seeking a Staff Solutions Engineer to work closely with our most strategic enterprise customers deploying AI/ML ... workloads on Crusoe's high‑performance GPU infrastructure. This is a hands‑on, customer‑facing role requiring...with a primary focus on containerized MLOps over traditional HPC Multi‑cloud deployment or migration experience (especially AWS ➝… more
- Fluidstack (San Francisco, CA)
- …the future of intelligence, join us in building what's next. About the Role Senior / Staff SREs at Fluidstack sit at the core of our infrastructure, working ... and operations to ensure the reliability and performance of our global GPU cloud. They partner closely with teams including networking, platform engineering, and… more
- Fluidstack (San Francisco, CA)
- About Fluidstack Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, ... to squeeze microseconds off packet latency for AI & HPC workloads. Deploy & optimize at scale. Roll out...track record scaling low-latency, high-throughput networks for AI/ML or HPC clusters. Benefits Competitive total compensation package (cash +… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …transformative cloud infrastructure. About the Role: Crusoe Cloud is seeking a Sr. to Senior Staff level Solutions Engineer to work closely with our most ... strategic enterprise customers deploying AI/ML workloads on Crusoe's high-performance GPU infrastructure. This is a hands‑on, customer‑facing role requiring deep… more
- Roblox Corporation (San Mateo, CA)
- Senior Hardware Engineer - GPU & AI Infrastructure Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with ... world's play. In this specialized role, you will be the technical lead for our GPU and AI accelerator ecosystem. You will be responsible for the full lifecycle of … more
- Dyna Robotics (Redwood City, CA)
- …in California is looking for an experienced Machine Learning Infrastructure Engineer . This role involves designing scalable ML training platforms, optimizing ... high-performance computing systems, and ensuring robust job scheduling and reliability. Ideal candidates will have 7+ years in software with hands-on experience in ML model tuning and managing cloud environments. Join us to shape the future of AI-driven… more
- DriveNets Ltd. (Redwood City, CA)
- Senior Solutions Engineer , AI/ HPC Networking Product Redwood City Description Location: Bay Area - remote WFH-Remote role with travel to customers DriveNets ... Infiniband, RoCEv2, lossless Ethernet technologies (PFC, ECN, etc),accelerated computing, GPU , NIC, DPU, etc. Understanding ofAI/ HPC networking infrastructure… more
- Deloitte (San Francisco, CA)
- …and data engineers. SFL Scientific, a Deloitte Business, is looking to add a Senior AI Engineer to their vibrant environment. SFL Scientific is part of ... role ends on 2/28/2026. Work You'll Do As a Senior AI Engineer /Solutions Architect, you'll work cross-functionally..., Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system… more