- Sygaldry Technologies (San Francisco, CA)
- …and cross-functional collaboration. Ideal candidates have five years of experience in high- performance systems and familiarity with GPU architectures. The role ... A technology company specializing in quantum AI solutions is seeking a Senior Systems Architect in San Francisco. You will design and analyze systems while… more
- Labelbox (San Francisco, CA)
- …provider is looking for a Senior C++ Full-Stack Engineer to work remotely on high- performance systems supporting AI data pipelines. Candidates should have at ... of experience in production C++ development, a strong background in systems programming, and excellent communication skills. The role involves optimizing existing… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …cloud service. This role involves building robust software solutions for high- performance computing, requiring deep knowledge of Slurm and Kubernetes. Candidates ... should have over 7 years of software engineering experience with strong programming skills, particularly in GoLang. Competitive compensation is offered ranging from $185,000 to $224,000, aligning with candidate qualifications. #J-18808-Ljbffr more
- Lambda Inc. (San Francisco, CA)
- …for ensuring Lambda delivers world-class support to the most demanding environments in AI . You'll combine deep HPC technical expertise with strong leadership, ... Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda's mission...customer-focused leader to build and guide our Super Intelligence HPC Support Engineering team. This team partners directly with… more
- Labelbox (San Francisco, CA)
- …Competitive, hourly (based on experience) Role Responsibilities Design, build, and optimize high- performance systems in C++ supporting AI data pipelines ... week Preferred Prior experience with data annotation, data quality, or evaluation systems Familiarity with AI /ML workflows, model training, or benchmarking… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …stacks (Prometheus, Grafana, Loki) and incident response frameworks. Familiarity with high‑ performance computing ( HPC ) or AI /ML training infrastructure ... in private deployment. If you want to build and operate infrastructure for frontier AI workloads, automate systems at petascale, and be part of a founding… more
- Amazon (San Francisco, CA)
- Sr. System Development Engineer, High- Performance Accelerator Servers for AI /ML Do you want to shape the future of Generative AI at AWS? Join the team ... design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing… more
- Pantera Capital (San Francisco, CA)
- …job management systems across heterogeneous compute environments Benchmark system performance , diagnose bottlenecks, and implement improvements across both ... ML frameworks like TensorFlow or distributed training libraries Background in HPC environments, parallel computing, and high‑ performance networking Knowledge of… more
- Menlo Ventures (San Francisco, CA)
- …and optimize high‑ performance computing ( HPC ) infrastructure. Integrate AI models into production systems , platforms, and front‑end applications, ... Deep understanding of operating system internals (Unix). Experience with high‑ performance computing ( HPC ) infrastructure such as Slurm and Kubernetes.… more
- Lavendo (San Francisco, CA)
- …team. What You'll Do Architect and optimize distributed training and inference systems for large-scale AI models Design and deliver customer-focused solutions ... a publicly traded company at the forefront of the AI revolution, offering an AI -centric cloud platform...that maximize performance and business value Lead the transition of ML… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …crucial for delivering next-generation orchestration capabilities to power GPU-accelerated and high- performance computing ( HPC ) at scale. Your expertise will be ... performance . You will shape the technical direction of systems that allow customers to run advanced workloads across...our managed Slurm offering, providing a seamless experience for AI /ML and HPC customers who rely on… more
- Ring Inc (San Francisco, CA)
- … AI infrastructure platforms: a petabyte-scale ingestion and inference system powering mission-critical government and enterprise deployments. We need an ... 10+ years in software engineering, 5+ years in management roles with large-scale AI /ML systems and infrastructure. Expert-level proficiency in Python and Golang,… more
- Datacrunch (San Francisco, CA)
- …Senior Employment type: Full-time, permanent Your responsibilities Ensure the reliability, scalability, and performance of HPC and cloud systems . Build and ... Coast. You'll work closely with our European engineering teams to scale our high- performance compute ( HPC ) and cloud infrastructure globally. As our initial… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …Building core components of our foundational storage products, purpose built for high performance AI and ML workloads Contributing to distributed file, block and ... that are critical to our infrastructure and our customers' AI / HPC workloads. What You'll Be Working On:...storage products, with a focus on filesystem based solutions System Design & Architecture Design and implement high- performance… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …building, and operating the global edge, backbone, and data center network for High- Performance Compute ( HPC ) Clusters with GPUs. The ideal individual will be ... powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability. Be...gain marketable network Engineering experiences in edge, backbone, and HPC ‑based data center networking at a massive scale. This… more
- OpenAI (San Francisco, CA)
- …artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through ... edge of speed and scale, combining the traditions of High- Performance Computing ( HPC ) with a modern cloud...systems for visualizing hardware components, monitoring training job performance on the platform, and ensuring the health of… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …systems , automated remediation, or event‑driven operations Interest in scaling AI / HPC infrastructure and solving reliability challenges in GPU‑heavy ... powers a world where people can create ambitiously with AI - without sacrificing scale, speed, or sustainability. Be...Excellence, you will help ensure the stability, resilience, and performance of Crusoe's GPU cloud. This role is ideal… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …Engineer to work closely with our most strategic enterprise customers deploying AI /ML workloads on Crusoe's high- performance GPU infrastructure. This is a ... efficiency. Infrastructure‑Centric Thinking: Go beyond abstracted services-deploy and optimize AI /ML workloads directly on Crusoe infrastructure. Ensure performance… more
- Amadeus Search (San Francisco, CA)
- …is your opportunity to join a mission-driven startup building the foundation of sustainable AI . The team is creating a high- performance compiler and runtime that ... parallelization, and hardware optimization. Your work will directly shape the performance and accessibility of next-generation AI infrastructure. What You… more
- Prima Mente (San Francisco, CA)
- …focus - Foundation Models for Biology Architect, build, and scale our foundational AI infrastructure. You'll ensure our ML models are developed and deployed on ... highly performant, scalable, and reliable systems . Your expertise will enable rapid experimentation and seamless deployment of large-scale multi-omic models,… more