We interpreted Mountain View, CA as Mountain View, CA. Other options include: Mountain View (Contra Costa County), CA
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
- NVIDIA (Santa Clara, CA)
- …team to deliver innovative advances in high- performance computing AI systems . + Responsible for leading our HPC projects' planning, implementation, and ... and validating hardware and software for the Customer AI High- Performance Computing ( HPC ) systems . + Leads, handles, mentors, and builds a very… more
- NVIDIA (Santa Clara, CA)
- …designing and operating large scale storage infrastructure. + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Experience ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...solutions to enable runs of demanding deep learning, high performance computing, and computationally intensive workloads. We seek an… more
- NVIDIA (Santa Clara, CA)
- …designing and operating large scale compute infrastructure. + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Working ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek an...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more
- Meta (Menlo Park, CA)
- …requirements of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across ... fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, test and… more
- NVIDIA (Santa Clara, CA)
- …doing: + Work with NVIDIA Product Teams to understand new product requirements including HPC and AI /ML Products. + Finding Optimum Solutions to deploy these ... hosts a heterogeneous mix of machines and devices with various operating systems (Windows/Linux/Android), a multitude of hardware platforms both NVIDIA GPUs and… more
- NVIDIA (Santa Clara, CA)
- …team at NVIDIA. We deliver libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the ... stakes are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this… more
- NVIDIA (Santa Clara, CA)
- …is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the ... We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a Distinguished Software Architect to help co-design our… more
- NVIDIA (Santa Clara, CA)
- …long term maintenance strategy. What you'll be doing: + Design highly available and scalable systems to meet the demands of our HPC clusters + Evaluate new and ... graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a "learning… more
- NVIDIA (Santa Clara, CA)
- …industries + Technical Expertise: Deep understanding of cloud infrastructure, distributed systems , large-scale ML/ HPC workloads, Kubernetes, Slurm, and AWS ... Joining NVIDIA's AI Efficiency Team means contributing to the infrastructure...selecting data storage solutions (HDFS, object storage, distributed file systems , such as Lustre) based on cost, performance… more
- Amazon (Cupertino, CA)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- Amazon (Cupertino, CA)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- Amazon (Cupertino, CA)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- NVIDIA (Santa Clara, CA)
- …experience in performance optimization and benchmarking on large-scale distributed systems + Hands-on experience with NVIDIA GPUs, HPC storage, networking, ... NVIDIA is an industry leader with groundbreaking developments in High- Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention,… more
- Meta (Menlo Park, CA)
- … AI product introductions and AI ops initiatives supporting Meta's growing AI / HPC infrastructure to enable AI product development for our Family of ... & technologies. The ideal candidate will have experience in AI / HPC product development and operations, strong program...Define and track key metrics and key quality and performance indicators and drive cross functional execution of program… more
- NVIDIA (Santa Clara, CA)
- …GH200 superchip provides performance and productivity required for strong scaling for HPC and generative AI workload.Scale out is inherent to design of this ... the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company...issue closure. + Identify new technologies, features to improve performance , functionality, uptime of GPU systems to… more
- NVIDIA (Santa Clara, CA)
- …in their fields (industry and academia) to perform in-depth analysis and optimization of complex AI and HPC algorithms to ensure the best possible AI ... Artificial Intelligence Would you enjoy researching parallel algorithms to accelerate AI workloads on advanced computer architectures? Is it rewarding to… more
- Meta (Menlo Park, CA)
- …in high- performance computation. **Required Skills:** Engineering Manager, PyTorch - AI Acceleration Responsibilities: 1. Grow a team of domain experts within ... **Summary:** AI Acceleration is an org within PyTorch. It's...should have strong technical skills - GPU / ML Systems knowledge is preferred, though not required. We work… more
- NVIDIA (Santa Clara, CA)
- …for a Senior Performance Engineer focused on Deep Learning (DL) & High Performance Computing ( HPC ) to join our team. Our team is responsible for generating ... What you'll be doing: + Plan and execute GPU performance benchmarking across a wide range of HPC...quality, and want to be at the forefront of AI & HPC , we would love for… more