- Canonical (San Jose, CA)
- …and Virtualisation Infrastructure role at Canonical . Join a leading open ‑ source company building next‑generation private cloud infrastructure using KVM, ... and the Americas. Canonical is a global provider of open ‑ source software and operating systems, with Ubuntu...with Ubuntu widely deployed in public cloud, data science, AI , engineering innovation, and IoT. We have 1,200+ colleagues… more
- Meta (Menlo Park, CA)
- …Experience with large- scale network designs and operations Experience working with open source networking projects Experience with network routing ... that connects all our locations, our edge points-of-presence, and AI network. We are looking for a manager who...evolving network infrastructure. Routing is a fundamental area of networking and an exciting field of large scale… more
- Georgian Partners (Mountain View, CA)
- …tooling, and integrations using Python, Ruby, and Terraform, enabling teams to scale infrastructure and AI services efficiently. Design and enforce secure, ... Crossplane, Helm) and GitOps (Argo CD). In‑depth knowledge of Kubernetes networking , autoscaling, and workload orchestration for AI /ML inference workloads.… more
- Aera Technology (Mountain View, CA)
- …tooling, and integrations using Python, Ruby, and Terraform, enabling teams to scale infrastructure and AI services efficiently. Design and enforce secure, ... Crossplane, Helm) and GitOps (Argo CD). In-depth knowledge of Kubernetes networking , autoscaling, and workload orchestration for AI /ML inference workloads.… more
- Altera (San Jose, CA)
- …deploy, across the cloud to the edge, enabling limitless possibilities for AI . Our broad portfolio includes FPGAs, SoCs, CPLDs, IP, development tools, ... the deployment, maintenance, and lifecycle management of compute, storage, and networking systems across global engineering sites.* **Lab IT Governance** Define and… more
- NVIDIA Corporation (Santa Clara, CA)
- …subsystems, and represent the team in internal reviews and external forums ( open source , conferences, and customer-facing technical deep dives).**What we need ... Principal Software Engineer - Large- Scale LLM Memory and Storage Systems page is...to stand out from the crowd: Prior contributions to open - source LLM serving or systems projects focused… more
- NVIDIA (Santa Clara, CA)
- …storage subsystems, and represent the team in internal reviews and external forums ( open source , conferences, and customer-facing technical deep dives). What we ... customer teams. Ways to stand out from the crowd: Prior contributions to open ‑ source LLM serving or systems projects focused on KV‑cache optimization,… more
- GEICO (Palo Alto, CA)
- …Infrastructure* Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)* Architect and ... Great Company, Great Culture, Great Rewards and Great Careers.**GEICO AI ML Infrastructure team is seeking an exceptional Senior...Go, Rust, or Java preferred* Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral,… more
- NVIDIA Corporation (Santa Clara, CA)
- …stand out from the crowd: Experience as a maintainer or significant contributor to large open source software projects In depth knowledge of state of the art on ... doing: Lead efforts to transform a driver with the scale of some operating systems into a design with...by law. Similar Jobs (5) Principal Software Architect, GPU Networking locations US, CA, Santa Clara time type Full… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …record of contributions to the open source community (eg, Open vSwitch/OVS, Open Virtual Networking /OVN, Multus, Cilium). Bonus Points: Advanced ... powers a world where people can create ambitiously with AI - without sacrificing scale , speed, or...kernel. Experience with Open vSwitch, Openflow, and Open Virtual Networking . Knowledge of professional software… more
- TrustIn (San Francisco, CA)
- Staff Distributed Systems Engineer - AI Infrastructure ( Open Source ) We're an early‑stage, well‑funded AI infrastructure company operating in stealth. ... with large‑ scale systems and production reliability Comfortable working in open ‑ source environments Nice to have : micro‑VMs, container isolation,… more
- Canonical (San Francisco, CA)
- … source . As the company that publishes Ubuntu, one of the most important open ‑ source projects and the platform for AI , IoT, and the cloud, we are changing ... Engineer - Canonical Canonical is a leading provider of open source software and operating systems to...breakthrough enterprise initiatives such as public cloud, data science, AI , engineering innovation, and IoT. Our customers include the… more
- E2b (San Francisco, CA)
- …workloads Contributions to Firecracker, Cloud Hypervisor, or similar open source projects Experience with observability at scale (distributed tracing, ... Skills : Go, Building and managing large clusters, Linux, Networking , Kubernetes, Virtualization Who we are E2B is a...when engineers collaborate face-to-face on hard problems Excited about open source - Comfortable with our code… more
- HumanSignal (San Francisco, CA)
- …and industry backing. Our product is already beloved by a thriving open source and enterprise community. You'll work alongside experienced, mission-driven ... the Solutions Architect (Post Sales) role at HumanSignal Get AI -powered advice on this job and more exclusive features....models are grounded in real-world signal, not noise. Our open - source product, Label Studio , has become… more
- Hyperbolic (San Francisco, CA)
- …service that promise affordability and accessibility for all. As pioneers at the intersection of AI and open - source technology, we believe in an open ... to democratize AI by breaking down the barriers to computing power with our Open -Access AI Cloud. By making better use of idle computing resources across the… more
- Canonical (San Francisco, CA)
- … source . As the company that publishes Ubuntu, one of the most important open ‑ source projects and the platform for AI , IoT, and the cloud, we are changing ... Manager - Container and Virtualisation Infrastructure Canonical is a leading provider of open source software and operating systems to the global enterprise and… more
- Crusoe (San Francisco, CA)
- …Experience with hardware fleet management across multiple datacenters Contributions to open source Kubernetes or related ecosystem projects Experience ... recovery strategies at scale Familiarity with GPUs, HPC clusters, or large- scale AI /ML workloads Benefits Industry competitive pay Restricted Stock Units in… more
- Together AI (San Francisco, CA)
- …and build scalable machine learning systems that power our accelerated AI initiatives. This role involves developing large- scale , fault-tolerant distributed ... efficient. Join us in shaping the future at Together AI ! Responsibilities Design and build large- scale , distributed...hardware, algorithms, and models. We have contributed to leading open - source research, models, and datasets to advance… more
- Together AI (San Francisco, CA)
- About the Role Together AI is building the next-generation AI compute platform, and networking is at the center of that mission. As a Network Architect, you ... is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive...hardware, algorithms, and models. We have contributed to leading open - source research, models, and datasets to advance… more
- Together AI (San Francisco, CA)
- …including multi-threading, memory management, networking , storage, performance, and scale . Preferred: Knowledge of existing AI inference systems such ... at scale . Develop and optimize runtime inference services for large- scale AI applications. Collaborate with researchers, engineers, product managers, and… more