- Cartesia (San Francisco, CA)
- …including the world's foremost experts in AI. About the Role We're looking for a Cluster Infrastructure Engineer to help build and scale the compute backbone ... this role, you'll work at the intersection of distributed systems and infrastructure engineering, designing and operating the large-scale GPU clusters that train and… more
- Theklicker (Palo Alto, CA)
- An innovative internet technology startup is seeking a passionate Web Engineer to join their dynamic team. This role involves working with cutting-edge technologies ... like Rust and Kubernetes while operating some of the world's largest GPU supercomputing clusters. The company values initiative and excellence, encouraging engineers to take ownership of their work. You will be part of a small, motivated team where… more
- The Voleon Group (Berkeley, CA)
- …asset manager, and we have ambitious goals for the future. As a Senior Cluster Site Reliability Engineer (SRE), you will help scale our research compute ... cluster to meet our growing needs, and you will...at scale. You will support both on‑prem and cloud infrastructure , and work to provide the best experience to… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior AI-HPC EDA Cluster Engineer page is loaded## Senior AI-HPC EDA Cluster Engineerlocations: US, CA, Santa Clara: US, TX, Austin: US, CA, Remote: US, WA, ... a lasting impact on the world.We are seeking a highly skilled and experienced AI-HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA… more
- Google Inc. (Sunnyvale, CA)
- …development. 5 years of experience building and developing large-scale infrastructure , distributed systems or networks, or experience with compute technologies, ... on and is growing every day. As a software engineer , you will work on a specific project critical...ground-up product building opportunity to own the foundational Kubernetes cluster platform. This platform will provide a secure, resilient,… more
- OpenAI (San Francisco, CA)
- …urgency of keeping mission-critical systems running Qualifications Experience as an infrastructure , systems, or distributed systems engineer in large-scale or ... frontier research. This role blends distributed systems engineering with hands-on infrastructure work on our largest datacenters. You will scale Kubernetes clusters… more
- Genesis Therapeutics Inc. (Burlingame, CA)
- …necessary). Nice to haves Experienced with building, maintaining and debugging low-level cluster infrastructure running on multiple clouds using Kubernetes and ... learn from all kinds of molecular data, leveraging our cluster with 1000s of GPUs and 10,000s of CPUs....of CPUs. About the Role We're seeking experienced ML infrastructure engineers to join the team and lead engineering… more
- PlanetiQ (Boston, MA)
- PlanetiQ is looking for an infrastructure engineer to set up, manage, and maintain a compute environment for both AI/ML applications as well as traditional ... flows is a must. Core Function: You are an ' engineer 's engineer ' whose background is skewed heavily...whose background is skewed heavily towards the ops and infrastructure side, ie with a heavier focus on the… more
- Menlo Ventures (Burlingame, CA)
- …necessary). Nice to haves Experienced with building, maintaining and debugging low-level cluster infrastructure running on multiple clouds using Kubernetes and ... learn from all kinds of molecular data, leveraging our cluster with 1000s of GPUs and 10,000s of CPUs....which is instrumental to our mission. As an ML Engineer at Genesis, you will lead rapid iteration on… more
- Accenture (Washington, DC)
- We Are: The Global Infrastructure Engineering AI & HPC team is at the center of enabling infrastructure reinvention for the next era of digital solutions powered ... on-prem, and hybrid environments to design, build, and operate accelerated infrastructure that powers high-performance workloads at scale. Our solutions enable some… more
- Aldea Inc (San Francisco, CA)
- …expressive, contextual, and intelligent human-machine interface. The Mission We are seeking an Infrastructure Engineer to bridge the gap between complex hybrid ... VPC) and Bare Metal clusters. You will treat physical infrastructure as mutable software, using tools like Cluster... infrastructure as mutable software, using tools like Cluster API , Metal3 , or Tinkerbell to manage… more
- OpenAI (San Francisco, CA)
- …across blob storage down to hardware caching Much more! About the Role As an engineer within Fleet infrastructure , you will design, write, deploy, and operate ... This role will support the fleet infrastructure team at OpenAI. The fleet team focuses...low maintenance platform by building push-button automation for kubernetes cluster provisioning and upgrades Supporting research workflows with service… more
- Rivian (Palo Alto, CA)
- …future for all. We are seeking a highly skilled and experienced Sr. Infrastructure Engineer to further our DevOps initiatives and drive continuous integration, ... software delivery, and deployment. As a Sr. Infrastructure Engineer , you will collaborate with cross‑functional teams to design, implement, and manage our … more
- Boson AI (Palo Alto, CA)
- …as we continue to scale. Responsibilities Manage and optimize HPC cluster operations Deploy and maintain infrastructure ‑as‑code solutions Support ML/research ... The Role We're looking for a Senior Site Reliability Engineer to help us run one of the most...You'll be hands‑on with the full lifecycle of HPC infrastructure : planning, building, testing, deploying, and keeping everything running… more
- OpenAI (San Francisco, CA)
- About the Role As an engineer within Fleet infrastructure , you will design, write, deploy, and operate infrastructure systems for model deployment and ... and operate components of our compute fleet including job scheduling, cluster management, snapshot delivery, and CI/CD systems. Interface with researchers and… more
- Apple Inc. (Cupertino, CA)
- AIML - Sr Software Engineer , Machine Learning Platform Technologies Cupertino, California, United States Machine Learning and AI Are you an open-source contributor ... passionate about building the next generation of cloud‑native ML infrastructure ? We're seeking a hands‑on technical leader with deep expertise in Kubernetes,… more
- Pathway Genomics Corporation (Palo Alto, CA)
- …in Palo Alto, California. The opportunity We are looking for a Senior ML Infrastructure / DevOps Engineer who loves Linux, distributed systems, and scaling GPU ... management). Automate infrastructure provisioning and configuration using infrastructure ‑as‑code (Terraform, CloudFormation, cluster ‑tooling) and configuration management.… more
- Menlo Ventures (San Francisco, CA)
- …and business leaders building beneficial AI systems. Anthropic is seeking talented Infrastructure Engineers to join our team and support the development, scaling, ... hundreds of thousands of machines), partnering with cloud service providers on cluster build out and features Consult with stakeholders to understand … more
- Guardant Health (Palo Alto, CA)
- Staff HPC Infrastructure Engineer page is loaded## Staff HPC Infrastructure Engineerlocations: Palo Alto, CAtime type: Full timeposted on: Posted ... improving your skills, value to the company and improve the computational infrastructure . You are dedicated to engineering excellence yet pragmatic and flexible. You… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …cloud infrastructure . We are seeking a highly skilled Staff Infrastructure Security Engineer to architect, deploy, and operationalize the foundational ... Zero Trust Architecture: Architect a highly available, disaster-resilient, and scalable multi- cluster secrets management platform that serves as the foundation for… more