- Meta (Menlo Park, CA)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...responsible for design, model, develop, test, deploy and operate AI / HPC Networks at scale 2. Provide continual… more
- NVIDIA (Santa Clara, CA)
- …and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new ... runtime designs, and new network hardware features. + Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX,… more
- Meta (Menlo Park, CA)
- …5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques **Minimum ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead… more
- NVIDIA (Santa Clara, CA)
- …challenges and provide outstanding HPC solutions. + Collaborate closely with hardware engineering , CUDA engineering , and AI research groups to apply the ... healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as...integrating genomic solutions into mainstream healthcare. As a healthcare HPC engineer, you will join a dynamic development team… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, ... and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly… more
- NVIDIA (Santa Clara, CA)
- …guide us to be the best we can be. We are seeking a highly motivated system network architect to join our team of experts and take part in shaping the future of high ... networking solutions + Work on multi-functional teams to provide Ethernet network expertise to server infrastructure builds, accelerated computing workloads and GPU… more
- NVIDIA (Santa Clara, CA)
- …Familiarity with datacenter automation, advanced network protocols, and supporting large HPC or AI clusters in production environments. + Understanding of ... , or related field, or equivalent experience. + 8+ years of proven experience in AI / HPC Infrastructure. + Familiarity with AI / HPC job schedulers and… more
- NVIDIA (Santa Clara, CA)
- …networking problems for scalable AI clusters. This is a hands-on network engineering position focused on the architecture, design, development and deployment ... We are seeking a highly skilled Principal Network Engineer to join our dynamic team to...and deployment of global-scale DCs inter-connects and fabric for HPC , AI , and GPU computing clusters. +… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's AI Factories are built to accelerate AI and HPC workloads. At their core the Digital Twin (physics-based model used to design, validate, and operate ... be shaping the digital and physical foundation of NVIDIA's AI Factories, engineering virtual replicas that not...to stand out from the crowd + Background in AI / HPC data center cooling, including immersion and… more
- Meta (Menlo Park, CA)
- …many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network and storage. The team invests significantly ... develop and help productionize high performance software & hardware technologies for AI at datacenter scale. We achieve this via concurrent design and optimization… more
- NVIDIA (Santa Clara, CA)
- …experience serving enterprise customers. + Technical Expertise: Deep understanding of AI /ML infrastructure, high-performance computing ( HPC ) and networking ; ... centers are powering the most sophisticated, groundbreaking research and AI products for the company. We are looking for...+ BS or MS degree in Computer Science, Computer Engineering , or similar field (or equivalent experience) and 12+… more
- Amazon (Cupertino, CA)
- …for the entire AI industry. You'll join a diverse AWS Hardware Engineering team of software, hardware, and network engineers, supply chain specialists, ... design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing… more
- Amazon (Cupertino, CA)
- …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
- Amazon (Cupertino, CA)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. Utility Computing (UC) AWS Utility Computing (UC) ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
- NVIDIA (Santa Clara, CA)
- … Architecture team is seeking experienced candidates in the extensive domain of network architecture & engineering . This is a hands-on architecture position ... Lead the architecture, design, and deployment of global-scale backbone and fabric for HPC , AI , and GPU computing clusters. + Develop high-performance data center… more
- Cisco (Milpitas, CA)
- …an agile team engaged in the design, development and execution of tests to qualify network performance for AI /ML capability. You will be a part of our solutions ... per week **Meet the Team** The Cisco Distributed System Engineering (DSE) group is at the forefront of developing...the next generation infrastructure to meet the needs of AI /ML workloads and continuously increasing internet users and application.… more
- NVIDIA (Santa Clara, CA)
- …your resume, you're expressing interest in one of our 202 6 Systems Software Engineering Internships. We'll review resumes on an ongoing basis, and a recruiter may ... computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting… more
- NVIDIA (Santa Clara, CA)
- …+ Experience working with engineering or academic research community supporting HPC or AI + Practical experience with high performance networking: ... runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner...to get an end to end understanding of the AI networking stack. Are you ready for to contribute… more
- NVIDIA (Santa Clara, CA)
- At NVIDIA, we are pioneers in making the impossible achievable, particularly within AI , ML, and HPC . Joining our team as a Storage & Networking Product Engineer ... high-performance networking architectures for storage environments, ensuring low-latency data paths for AI /ML and HPC workloads. + Configure and tune RDMA,… more
- NVIDIA (Santa Clara, CA)
- …Data Platform Architect. We serve and collaborate directly with NVIDIA's rapidly growing AI , HW, and SW engineering and research teams across the company. ... for distributed data platform and observability systems for large-scale AI and HPC clusters and workloads and...the world. What You'll Be Doing: + Collaborate with AI , HW, and SW engineering and research… more