- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI / HPC advanced job schedulers, such as Slurm, K8s, RTDA or LSF… more
- NVIDIA (Santa Clara, CA)
- …solutions on any of the leading Cloud environment [AWS, Azure or GCP] + Experience with AI / HPC cluster job schedulers such as SLURM, LSF + In depth ... InfiniBand with IBOIP and RDMA + Background with Software Defined Networking and AI / HPC cluster networking + Familiarity with deep learning frameworks like… more
- NVIDIA (Santa Clara, CA)
- …, HW, and SW engineering and research teams to define a vision and roadmap for AI / HPC cluster observability. + Architect and lead teams to develop, test, and ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....vision and roadmap for distributed observability systems for large-scale AI and HPC clusters and workloads and… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior HPC Engineer to join its...the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is ... customers, partners and internal teams to analyze, define, and implement large-scale AI / HPC projects. These efforts include a combination of networking, system… more
- NVIDIA (Santa Clara, CA)
- …a variety of HPC or EDA workloads. + Solid understanding of cluster configuration managements tools such as Ansible. + Proficiency in Perl for maintaining legacy ... NVIDIA is the leader in AI , machine learning and datacenter acceleration. NVIDIA is...and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to… more
- NVIDIA (Santa Clara, CA)
- …Make the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of ... You will also be maintaining and building deep learning AI - HPC GPU clusters at scale and supporting...cluster . + Deep understanding of GPU computing and AI infrastructure. + Passion for solving complex technical challenges… more
- Amazon (Cupertino, CA)
- …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... solutions that for Machine Learning (ML) and High-Performance Computing ( HPC ) workloads on AWS. We are seeking an experienced...and TPUs. This role is on the forefront of AI /ML, we spend a good deal of the day… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... works on multimodal foundation models, large-scale robot learning, embodied AI , and physics simulation. Our past projects include Eureka… more
- NVIDIA (Santa Clara, CA)
- …Python, Rust, Angular, React. Ways to stand out from the crowd: + Experience in HPC and/or AI training. + Knowledge of LLMs and agentic workflows. + Have ... We are now looking for a Senior Software Architect. Do you love to provide...in our journey of building software for most performant AI servers. What you'll be doing: + Research, design… more
- NVIDIA (Santa Clara, CA)
- …telemetries, scale out cluster , test plan development, track record in developing AI tools and NLP, DevOps, CI/CD experience to join our platform SWQA team. What ... We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional...OEM business. NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains… more
- NVIDIA (Santa Clara, CA)
- …directly impact NVIDIA's ability to deliver robust, secure, and high-performing solutions for AI , HPC , and cloud-scale systems. You will: + Define End-to-End ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...world! We are seeking a highly skilled and hard-working Senior Test Architect to join our multifaceted Enterprise Software… more