• Cartesia (San Francisco, CA)
    …including the world's foremost experts in AI. About the Role We're looking for a Cluster Infrastructure Engineer to help build and scale the compute backbone ... this role, you'll work at the intersection of distributed systems and infrastructure engineering, designing and operating the large-scale GPU clusters that train and… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • The Voleon Group (Berkeley, CA)
    …asset manager, and we have ambitious goals for the future. As a Senior Cluster Site Reliability Engineer (SRE), you will help scale our research compute ... cluster to meet our growing needs, and you will...at scale. You will support both on‑prem and cloud infrastructure , and work to provide the best experience to… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …urgency of keeping mission-critical systems running Qualifications Experience as an infrastructure , systems, or distributed systems engineer in large-scale or ... frontier research. This role blends distributed systems engineering with hands-on infrastructure work on our largest datacenters. You will scale Kubernetes clusters… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    …across blob storage down to hardware caching Much more! About the Role As an engineer within Fleet infrastructure , you will design, write, deploy, and operate ... This role will support the fleet infrastructure team at OpenAI. The fleet team focuses...low maintenance platform by building push-button automation for kubernetes cluster provisioning and upgrades Supporting research workflows with service… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • OpenAI (San Francisco, CA)
    About the Role As an engineer within Fleet infrastructure , you will design, write, deploy, and operate infrastructure systems for model deployment and ... and operate components of our compute fleet including job scheduling, cluster management, snapshot delivery, and CI/CD systems. Interface with researchers and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and business leaders building beneficial AI systems. Anthropic is seeking talented Infrastructure Engineers to join our team and support the development, scaling, ... hundreds of thousands of machines), partnering with cloud service providers on cluster build out and features Consult with stakeholders to understand … more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Crusoe Energy Systems LLC (San Francisco, CA)
    …cloud infrastructure . We are seeking a highly skilled Staff Infrastructure Security Engineer to architect, deploy, and operationalize the foundational ... Zero Trust Architecture: Architect a highly available, disaster-resilient, and scalable multi- cluster secrets management platform that serves as the foundation for… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Hedra, Inc (San Francisco, CA)
    …it's the next evolution of AI-driven content creation. Summary As a Senior/Staff Infrastructure Engineer , you will own the reliability, availability, and ... You will be responsible for designing, maintaining, and improving the production infrastructure that keeps Hedra online: Kubernetes for orchestration, AWS as the… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Dolby (San Francisco, CA)
    …work. Job Description Dolby Laboratories Inc. is seeking a Machine Learning Operations Engineer to join the Consumer Entertainment Group, to help bring the next ... best performance and efficient use of machine-learning resources. Collaborate with infrastructure teams physical compute, storage and network infrastructure more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    … and node management to ensure smooth operation of GenAI infrastructure . Continuously improve and automate cluster /capacity/maintenance upgrades. Troubleshoot ... pipelines using AWS CodePipeline, GitHub Actions, or similar platforms. Familiarity with Infrastructure as Code (IaC) tools such as AWS CloudFormation, Terraform, or… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Voxel (San Francisco, CA)
    …is backed by industry leading VC's. Voxel is looking for a Staff Machine-Learning Infrastructure Engineer to drive the next wave of our computer-vision platform ... our ML lifecycle - ground-truth data & labeling workflows, large-scale training infrastructure , and continuous model lifecycle management . If you excel at designing… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Aldea Inc (San Francisco, CA)
    …expressive, contextual, and intelligent human-machine interface. The Mission We are seeking an Infrastructure Engineer to bridge the gap between complex hybrid ... VPC) and Bare Metal clusters. You will treat physical infrastructure as mutable software, using tools like Cluster... infrastructure as mutable software, using tools like Cluster API , Metal3 , or Tinkerbell to manage… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    …access. This is a rare opportunity to work at the intersection of hyperscale infrastructure and AI, shaping the operational backbone of one of the largest GPU ... clusters in private deployment. If you want to build and operate infrastructure for frontier AI workloads, automate systems at petascale, and be part of a founding… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Pantera Capital (San Francisco, CA)
    …with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer , you will be partnering closely with our Inference and ... bottlenecks, and implement improvements across both training and inference infrastructure Build monitoring, alerting, and observability solutions tailored to ML… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    We're building the company which will de-risk the largest infrastructure build-out in history. When people finance GPU clusters, the datacenters housing them, and ... the infrastructure powering them, they need "offtake" - meaning someone has signed a contract to lease the cluster for a period of time before its even built.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    We're building the company which will de-risk the largest infrastructure build-out in history. When people finance GPU clusters, the datacenters housing them, and ... the infrastructure powering them, they need "offtake" - meaning someone has signed a contract to lease the cluster for a period of time before its even built.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    We're building the company which will de-risk the largest infrastructure build-out in history. When people finance GPU clusters, the datacenters housing them, and ... the infrastructure powering them, they need "offtake" - meaning someone has signed a contract to lease the cluster for a period of time before its even built.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    We're building the company which will de-risk the largest infrastructure build-out in history. When people finance GPU clusters, the datacenters housing them, and ... the infrastructure powering them, they need "offtake" - meaning someone has signed a contract to lease the cluster for a period of time before its even built.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • The San Francisco Compute Company (San Francisco, CA)
    We're building the company which will de-risk the largest infrastructure build-out in history. When people finance GPU clusters, the datacenters housing them, and ... the infrastructure powering them, they need "offtake" - meaning someone has signed a contract to lease the cluster for a period of time before its even built.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • CatchProbe Intelligence Technologies (San Francisco, CA)
    Senior Database Engineer (Elastic/Mongo/Hadoop) Senior Database Engineer (Elastic/Mongo/Hadoop) Workplace Type : Remote - Region : San Francisco, CA Job ... of Hadoop systems will be an added advance. * Deploy Hadoop(Bigdata) cluster , Comm/decommissioning of nodes, track jobs, monitor services like zookeeper, hbase, SOLR… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source