- NVIDIA (Santa Clara, CA)
- … research to the world's fastest supercomputers. We are seeing a highly motivated Senior Solutions Architect to join the Cluster Design and Architecture team ... with a focus on networking technologies. As AI workloads scale to...doing: + Partner with internal engineering efforts in GPU cluster building and networking and convey architecture… more
- NVIDIA (Santa Clara, CA)
- …graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that ... join us today! As a member of the GPU AI /HPC Infrastructure team, you will provide leadership in the...us with the strategic challenges we encounter including: compute, networking , and storage design for large scale, high performance… more
- NVIDIA (Santa Clara, CA)
- …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... the management of large-scale HPC systems including the deployment of compute, networking , and storage. + Develop and improve our ecosystem around GPU-accelerated… more
- NVIDIA (Santa Clara, CA)
- …DevOps tools to automate software updates, perform maintenance tasks, and monitor cluster availability, ensuring seamless operations. + Take ownership of daily ... cluster failures and issues, troubleshooting them promptly to maintain...and alerting infrastructure. + Proficiency in designing large scale networking technologies and the associated challenges. Your base salary… more
- NVIDIA (Santa Clara, CA)
- …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute...of large-scale HPC systems including the deployment of compute, networking , and storage. + Develop and improve our ecosystem… more
- NVIDIA (Santa Clara, CA)
- …Frameworks Infrastructure team as a Senior Systems Engineer focusing on High-Performance AI & Networking Applications, committed to ground-breaking AI & ... methodologies in HPC networking deployments. + Share insights on improving networking strategies for substantial AI and deep learning infrastructure. What we… more
- NVIDIA (Santa Clara, CA)
- …security in the compute domain + Networking Patterns Mastery: Understand and apply networking patterns at a chassis, rack, cluster and data center level in ... We are now looking for a Senior Solution Network Architect, Enterprise Products! Join the... Network Architect, where your passion and expertise in networking , compute hardware, storage, and cloud-native software will be… more
- NVIDIA (Santa Clara, CA)
- …problems like memory or networking + Create benchmarking and simulation technologies for AI system or GPU cluster + Partner with HW architects to propose new ... for AI researchers and SW/HW teams running AI workload in GPU cluster . As a...cluster job scheduling (Slurm or Kubernetes), storage and networking + Experience with NVIDIA GPUs, CUDA Programming and… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …an impact on the world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual ... will be pivotal in leading the development, operations, and support of our entire AI infrastructure. You will be responsible for the entire lifecycle of our AI… more
- NVIDIA (Santa Clara, CA)
- …Do you want to be part of a team that brings new Artificial Intelligence ( AI ) hardware and software technologies to production in customer data centers? As part of ... What you will be doing: + Working with NVIDIA AI Native and Consumer Internet customers on large data...Internet customers on large data center GPU server and networking system deployments as Solution Architect Engineer. Guide customer… more
- NVIDIA (Santa Clara, CA)
- …infrastructure engineer who excels in solving complex orchestration problems in distributed AI /ML systems. What you'll be doing: + Architect, develop, and deploy ... and artifact delivery. + Optimize job scheduling, storage access, and networking across hybrid and multi-cloud Kubernetes environments (eg, OCI, Azure, on-prem).… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and networking , familiarity with software testing and deployment, familiarity with...deploy leading infrastructure solutions for a broad range of AI -based applications that affect core data science. For two… more
- NVIDIA (Santa Clara, CA)
- … Manager/Base Command Manager clusters is a definite plus. + Proficiency with cluster networking including InfiniBand and Spectrum-X NVIDIA is widely considered ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...a few to several thousands of nodes, and streamlines cluster provisioning, workload management, and infrastructure monitoring. It provides… more
- NVIDIA (Santa Clara, CA)
- …hardware (such as GPUs, ETH/IB networking components, storage, etc.) within extensive AI and HPC cluster settings. + Practical knowledge of NVIDIA systems ... performance testing, AI benchmarking, and more. + Practical involvement in cluster administration and coordination (SLURM, K8s, etc.). We have some of the most… more
- NVIDIA (Santa Clara, CA)
- …of Ethernet, InfiniBand, RoCE, NVLink interconnects, and large-scale cluster networking . + Understanding of SuperPod architecture, AI datacenter scaling, and ... be a subject matter expert-promoting and teaching large-scale customers about NVIDIA's purpose-built AI networking solutions for the AI Factory. This… more
- NVIDIA (Santa Clara, CA)
- We are seeking a highly skilled Senior Network Automation Architect to design, implement, and oversee end-to-end automation frameworks for provisioning Baremetal and ... across hybrid and multi-cloud environments. This role blends deep networking expertise with infrastructure-as-code principles, enabling rapid, reliable, and secure… more
- Insight Global (San Jose, CA)
- …scalability, performance efficiency, and operational automation across compute, storage, and networking layers. . Develop and integrate AI -driven workflows, ... Deploy and manage enterprise-grade Kubernetes clusters (EKS/AKS), implementing advanced networking , multi-tenancy, autoscaling policies, and cluster lifecycle… more
- NVIDIA (Santa Clara, CA)
- …and telemetry frameworks. + Familiarity with GPU computing (CUDA), large-scale AI /HPC workloads, NVLink, Grace, and cluster -level deployment/management. + ... NVIDIA is seeking a Senior Manager to lead our System Software SWAT...validated fix across firmware, Linux kernel / device drivers, networking , and virtualization. You will build and mentor the… more
- NVIDIA (Santa Clara, CA)
- …every new AI -powered application is built. We are seeking a senior engineer to design and build factory automation for NVIDIA Inference Microservices (NIMs). ... and PCIe devices + Experience working with hardware clusters, distributed system, networking , GPU interconnects (PCie, NVlink), node and cluster interconnect… more
- pony.ai (Fremont, CA)
- Founded in 2016 in Silicon Valley, Pony. ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies ... world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. ai is an industry leader in the commercialization of autonomous driving… more