• Senior GPU and HPC

    NVIDIA (Santa Clara, CA)
    …cluster and network telemetry. + Work on software that manages NVLINK topography across GPU clusters. + Build automated test infrastructure that we use to ... NVIDIA is hiring engineers to scale up its AI Infrastructure . We expect you to have a strong programming background, knowledge of datacenter hardware, operations,… more
    NVIDIA (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer - HPC , AI…

    Cisco (San Jose, CA)
    Principal Engineer - HPC , AI Infrastructure...of enterprise-grade AI infrastructure . As a principal engineer within our GPU and CUDA Runtime ... Technology InterestAI or Artificial Intelligence, Internet & Mass Scale Infrastructure + Job Id1445895 **This position requires a hybrid...+ PhD is a plus, especially with research in GPU systems, compilers, or HPC . **Message to… more
    Cisco (07/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Cluster Engineer - EDA

    NVIDIA (Santa Clara, CA)
    …the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA and ... Join our engineering team and collaborate with researchers and infrastructure teams to ensure our GPU clusters...ahead of new technologies and effective approaches in the HPC infrastructure fields. Ways to stand out… more
    NVIDIA (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI- HPC Cluster Engineer

    NVIDIA (Santa Clara, CA)
    …+ Minimum of 6 years of experience crafting and operating large scale compute infrastructure . + Experience with AI/ HPC job schedulers and orchestrators, such as ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...ahead of new technologies and effective approaches in the HPC and AI/ML infrastructure fields. Ways to… more
    NVIDIA (07/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance Engineer

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... in Artificial Intelligence, High Performance Computing and Visualization. The GPU , our invention, serves as the visual cortex of...scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of… more
    NVIDIA (08/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance Engineer - AI…

    NVIDIA (Santa Clara, CA)
    …upon which every new AI-powered application is built. We are seeking a Sr. HPC Performance engineer to join our team of scientists and engineers passionate ... using low level acceleration and scaling strategies such as GPU porting, data structure innovations, distributed learning technologies +...in digital biology and beyond + Collaborate with multiple HPC , AI infrastructure , and research teams +… more
    NVIDIA (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - HPC

    NVIDIA (Santa Clara, CA)
    …a Senior Software Engineer to join our mission to continue improving our HPC infrastructure . Our team builds and operates sophisticated infrastructure to ... reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC...to provide better tools to build and manage this infrastructure . Ideal candidate is strong in software development, designing… more
    NVIDIA (08/27/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Santa Clara, CA)
    …technologies in a multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC services. - Experience ... following programming languages: C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI/ML frameworks....the cloud computing delivery model as it relates to HPC . - Knowledge of the underlying infrastructure more
    Amazon (09/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI and ML Storage Infra Software…

    NVIDIA (Santa Clara, CA)
    …make a lasting impact on the world. We are currently hiring an AI/ML Storage Infrastructure Software Engineer at NVIDIA to join our Capability Systems team. As ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...with 6+ years of shown experience in AI/ML and HPC workloads and infrastructure . + Hands-on experience… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Principal AI and ML Infra Software Engineer

    NVIDIA (Santa Clara, CA)
    …Infra Software Engineer , GPU Clusters at NVIDIA to join our Hardware Infrastructure team. As an Engineer , you will have a pivotal role in enhancing ... deficiencies, facilitating groundbreaking AI and ML research on GPU Clusters. Together, we can craft potent, effective, and...Hands-on experience in using or operating High Performance Computing ( HPC ) grade infrastructure as well as in-depth… more
    NVIDIA (08/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior ML Storage Engineer - GPU

    NVIDIA (Santa Clara, CA)
    …NVIDIA. Join our engineering team and collaborate with researchers, AI engineers, and Infrastructure teams to ensure our GPU clusters perform efficiently, scale ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...are seeking a highly skilled and experienced Sire Reliability Engineer to design, deploy, and manage high speed storage… more
    NVIDIA (07/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior ML Platform Engineer , AI…

    NVIDIA (Santa Clara, CA)
    …imagination and intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design ... reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC...years of experience designing and operating large scale compute infrastructure + Experience with AI/ HPC advanced job… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer , AI…

    NVIDIA (Santa Clara, CA)
    …take our products to market, we need a dedicated and motivated System Software Engineer who is passionate about AI Infrastructure . You will collaborate with ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...What we need to see: + Passionate about AI infrastructure and performance optimization. + 3+ years in software… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer , Foundation Model…

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... as C++ for efficient system development. + Strong experience with large-scale GPU clusters, HPC environments, and job scheduling/orchestration tools (eg, SLURM,… more
    NVIDIA (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI…

    Meta (Menlo Park, CA)
    …software stack around NCCL (NVIDIA Collective Communications Library), which enables multi- GPU and multi-node data communication through HPC -style collectives. ... of the following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems, AI infrastructure , high performance computing,… more
    Meta (08/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer , NCCL…

    NVIDIA (Santa Clara, CA)
    …out from the crowd: + Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp ... in Artificial Intelligence, High Performance Computing and Visualization. The GPU , our invention, serves as the visual cortex of...runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner… more
    NVIDIA (10/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - Storage

    NVIDIA (Santa Clara, CA)
    …of ground breaking projects. What You'll Be Doing: + Design, implement an on-prem HPC infrastructure supplemented with cloud computing to support the growing IT ... our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play...HPC and AI solution technologies from CPU's and GPU 's to high speed interconnects and supporting software +… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    …software and systems engineers to help us develop and operate our enterprise GPU infrastructure management systems across Clouds. In this role, you will ... closely with the broader NVIDIA team to operate, design and build infrastructure management systems, Kubernetes operators, and end-to-end HPC integration… more
    NVIDIA (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Principal Network Engineer - DC and AI…

    NVIDIA (Santa Clara, CA)
    …the architecture, design, and deployment of global-scale DCs inter-connects and fabric for HPC , AI, and GPU computing clusters. + Develop high-performance data ... We are seeking a highly skilled Principal Network Engineer to join our dynamic team to build...latency and high reliability. + Partner with system, OS, GPU , and HPC teams to deliver scalable,… more
    NVIDIA (10/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage Performance Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is in search of a highly skilled Senior Storage Performance Engineer to join our ambitious team in Santa Clara, CA. This role is essential as we continue to ... push the boundaries of AI and HPC technologies. You will have the chance to create,...and analyze complex benchmarks to optimize performance across NVIDIA's infrastructure stack. Your efforts will directly impact the efficiency… more
    NVIDIA (09/25/25)
    - Save Job - Related Jobs - Block Source