- Cisco (Research Triangle Park, NC)
- …and communicate advanced technical concepts. A talented and passionate engineer comfortable working in high-pressure, large-scale enterprise environments. What You ... world. You will be a technical leader in the Infrastructure Services organization, building and managing the internal NVIDIA...and managing the internal NVIDIA DGX and Cisco-UCS based AI platforms at Cisco. You will provide leadership in… more
- Meta (Bellevue, WA)
- …networking, communication libraries, and scheduling infrastructure . **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic… more
- Meta (Menlo Park, CA)
- …and host networking, comms lib and scheduling infrastructure . **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, test and ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic… more
- NVIDIA (Santa Clara, CA)
- …and intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and ... years of experience designing and operating large scale compute infrastructure + Experience with AI / HPC ...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …and intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and ... of distributed storage services. + Design, implement an on-prem AI / HPC infrastructure supplemented with cloud...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- Meta (Menlo Park, CA)
- …host networking, comms lib and scheduling infrastructure . **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic… more
- Meta (Columbus, OH)
- …and host networking, comms lib and scheduling infrastructure . **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active member ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI . This results in a dramatic… more
- Samsung SDS America (Ridgefield Park, NJ)
- …Data Center Storage Engineer with exposure to High Performance Computing ( HPC ) and GPU Infrastructure . The ideal candidate will design, implement, and ... our expertise in Managed Cloud Services, Cloud Security, and AI innovation. We're proud to play a pivotal role...manage cutting-edge storage and backup solutions, HPC infrastructure , GPU clusters, etc. This role… more
- Meta (Menlo Park, CA)
- …in multiple locations. **Required Skills:** Software Engineer , Systems ML - HPC Specialist Responsibilities: 1. Apply relevant AI and machine learning ... **Summary:** Meta is seeking an AI Software Engineer to join our...The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will… more
- Novo Nordisk (Lexington, MA)
- …lives for a living. Are you ready to make a difference? The Position The HPC Engineer III will make significant contributions to the research of life changing ... medical devices at Novo Nordisk. The HPC (Operation) Engineer plays an instrumental role...maintaining, and supporting the data-, middleware - and IT infrastructure used by the R&ED units across Novo Nordisk… more
- General Dynamics Information Technology (Annapolis Junction, MD)
- …**Experience:** 10 + years of related experience **US Citizenship Required:** Yes **Job Description:** HPC Systems Engineer GDIT is seeking a TS cleared HPC ... the subject matter expert (SME) for a range of HPC solutions and apply your systems administrative expertise to...+ Install, configure, test, and maintain system management tools, infrastructure , and server applications + Participate in the design… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI /ML systems. This role involves working on collective operations - the fundamental ... operations that enable AI to scale across multiple accelerators & servers. Most...systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is the leader in AI , machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...diverse team today! As a member of the Hardware Infrastructure Farm team, you will provide leadership in the… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial Intelligence ... ( AI ) hardware and software technologies to production in customer...GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design, compute/storage and… more
- NVIDIA (Santa Clara, CA)
- …see how you can make a lasting impact on the world. As a Senior Technical Marketing Engineer for AI Infrastructure , you will join a dedicated team that is ... systems for performance evaluations. + Conduct performance benchmarking of AI infrastructure with industry-standard models and frameworks...of experience. + Proficiency in Python and C++ for AI and HPC applications. + Experience using… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and ... You will also be maintaining and building deep learning AI - HPC GPU clusters at scale and supporting...GPUs cluster. + Deep understanding of GPU computing and AI infrastructure . + Passion for solving complex… more
- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are currently hiring an AI /ML Infrastructure Software Engineer at NVIDIA to join our Hardware ... Infrastructure team. As an Engineer , you will play a crucial role in boosting...related field, with 5+ years of proven experience in AI /ML and HPC workloads and infrastructure… more
- NVIDIA (Santa Clara, CA)
- …you'll work alongside world-class engineers solving some of the hardest challenges in AI infrastructure . You'll have the opportunity to contribute directly to ... We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA,...Production Deployments: Assist in debugging and performance tuning large-scale AI workloads in cloud and HPC environments,… more
- Meta (Menlo Park, CA)
- …approach to hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Lead Responsibilities: 1. Lead the bring-up, validation, ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP)...Qualifications:** Preferred Qualifications: 14. 5+ years of experience supporting AI / HPC system architecture at rack level and… more
- Meta (Austin, TX)
- …Meta Silicon hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities: 1. Lead the bring-up, validation, and ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP)...**Preferred Qualifications:** Preferred Qualifications: 16. Proficiency in High-Performance Computing ( HPC ) or AI system architecture at rack… more