• Lawrence Berkeley National Laboratory (Berkeley, CA)
    …Lab's ( LBNL ) Information Technology Division ( IT ) has an opening for a Senior HPC Cluster Systems Administrator to join their ScienceIT Team ! In ... by building, integrating, and maintaining Linux-based resources, high-performance computing cluster systems , and Kubernetes clusters. This role provides… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • The Voleon Group (Berkeley, CA)
    …multibillion‑dollar asset manager, and we have ambitious goals for the future. As a Senior Cluster Site Reliability Engineer (SRE), you will help scale our ... research compute cluster to meet our growing needs, and you will...in SRE or DevOps roles, preferably working as a senior engineer or tech lead Knowledge of HPC more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Ring Inc (San Francisco, CA)
    …networking, observability, security, disaster recovery, and cost management. Familiarity with HPC cluster management softwares such as Slurm Familiarity with ... and retrieval workloads. Previous success managing engineering teams delivering production-grade, HPC -scale RAG systems . Deep understanding of infra domains:… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    systems . Requirements 5+ years' experience building large-scale, fault-tolerant distributed systems (ML inference, HPC , or similar). Proficiency in Python, ... multi- cluster environments. Contributions to open-source ML or inference systems projects. Proven track record of cost optimisation in high-performance compute… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Fluidstack (San Francisco, CA)
    …infrastructure. We treat our customers' outcomes as our own, taking pride in the systems we build and the trust we earn. If you're motivated by purpose, obsessed ... join us in building what's next. About the Role Senior / Staff SREs at Fluidstack sit at the...networking, platform engineering, and data center operations to build systems that scale with the demands of AI workloads.… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source