- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior HPC Engineer to join its Infrastructure Specialists team. Academic, commercial and government groups around the world are ... be doing: + Primary responsibilities will include deploying, managing, and validating AI/ HPC infrastructure in Linux-based environments for new and existing… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI/ HPC advanced job...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation ... of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of...Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information + Collaborate… more
- NVIDIA (Santa Clara, CA)
- … Software Engineer to join our mission to continue improving our HPC infrastructure . Our team builds and operates sophisticated infrastructure to ... to provide better tools to build and manage this infrastructure . Ideal candidate is strong in software development, designing...and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as… more
- NVIDIA (Santa Clara, CA)
- …the choice, join our diverse team today! As a member of the Hardware Infrastructure Farm team, you will provide leadership in the design and implementation of ground ... efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are… more
- Amazon (Santa Clara, CA)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... technologies in a multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC services. - Experience… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial Intelligence ... center GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design, compute/storage and support bring… more
- NVIDIA (Santa Clara, CA)
- …design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering infrastructure teams on the ... highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to craft global and dynamic HPC clusters used by… more
- Amazon (Sunnyvale, CA)
- …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...peripheral device development (PCIe or NVMe) and building compute infrastructure to support High Memory and High performance computing… more
- NVIDIA (Santa Clara, CA)
- … infrastructure and tools to enable NVIDIA's AV program. We are seeking a motivated Senior Engineer to join our team in building and scaling our cloud-native ... which powers 100s of micro-services and large scale HPC clusters (15k+ GPUs). You'll play a critical role...(15k+ GPUs). You'll play a critical role in driving infrastructure innovation across our organization. Ideal candidates will have… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in ... 10+ years of full-time industry experience in large-scale MLOps and AI infrastructure ; + Proven experience designing and optimizing distributed training systems with… more
- LinkedIn (Mountain View, CA)
- …be hybrid in LinkedIn's Sunnyvale, CA campus. About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU ... infrastructure for machine learning (ML) and AI workloads. In...of experience designing and managing large-scale, distributed systems or HPC environments, with at least 3+ years focused on… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... is open to on-site and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services… more
- NVIDIA (Santa Clara, CA)
- …amplify human imagination and intelligence. Join us today! As a member of the GPU/ HPC Infrastructure team, you will provide leadership in the design and ... to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads… more
- NVIDIA (Santa Clara, CA)
- …Make the choice, join our diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ... automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of...You will also be maintaining and building deep learning AI- HPC GPU clusters at scale and supporting our researchers… more
- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are looking for an outstanding engineer for a Senior Performance Engineer role for at scale AI system ... workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems specialist to architect, develop… more
- NVIDIA (Santa Clara, CA)
- …storage systems, and ensuring low-latency data access for high-performance computing ( HPC ) and AI/ML workloads. Production Engineers at NVIDIA ensure that our ... automation frameworks, capacity management, and launch reviews. + Maintain storage infrastructure once live by monitoring availability, latency, and system health,… more
- NVIDIA (Santa Clara, CA)
- …ecosystem to power AI at scale! We are seeking a highly technical and creative Senior Technical Marketing Engineer to join our team to showcase the innovations ... Marketing. + 7+ years of experience in deep learning engineering, HPC systems, AI infrastructure , or technical evangelism roles. + Strong grasp of distributed… more