• Senior HPC Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior HPC Engineer to join its Infrastructure Specialists team. Academic, commercial and government groups around the world are ... be doing: + Primary responsibilities will include deploying, managing, and validating AI/ HPC infrastructure in Linux-based environments for new and existing… more
    NVIDIA (06/12/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance…

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of...Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information + Collaborate… more
    NVIDIA (05/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    … Software Engineer to join our mission to continue improving our HPC infrastructure . Our team builds and operates sophisticated infrastructure to ... to provide better tools to build and manage this infrastructure . Ideal candidate is strong in software development, designing...and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as… more
    NVIDIA (05/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …the choice, join our diverse team today! As a member of the Hardware Infrastructure Farm team, you will provide leadership in the design and implementation of ground ... efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are… more
    NVIDIA (07/03/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Santa Clara, CA)
    …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... technologies in a multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC services. - Experience… more
    Amazon (06/12/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...peripheral device development (PCIe or NVMe) and building compute infrastructure to support High Memory and High performance computing… more
    Amazon (04/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation ... years of experience designing and operating large scale compute infrastructure + Experience with AI/ HPC advanced job...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (07/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer - Autonomous…

    NVIDIA (Santa Clara, CA)
    infrastructure and tools to enable NVIDIA's AV program. We are seeking a motivated Senior Engineer to join our team in building and scaling our cloud-native ... which powers 100s of micro-services and large scale HPC clusters (15k+ GPUs). You'll play a critical role...(15k+ GPUs). You'll play a critical role in driving infrastructure innovation across our organization. Ideal candidates will have… more
    NVIDIA (05/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in ... 10+ years of full-time industry experience in large-scale MLOps and AI infrastructure ; + Proven experience designing and optimizing distributed training systems with… more
    NVIDIA (06/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior High Performance Computing…

    SLAC National Accelerator Laboratory (Menlo Park, CA)
    Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... is open to on-site and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services… more
    SLAC National Accelerator Laboratory (07/26/25)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Engineer , ML…

    LinkedIn (Mountain View, CA)
    …be hybrid in LinkedIn's Sunnyvale, CA campus. About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU ... infrastructure for machine learning (ML) and AI workloads. In...of experience designing and managing large-scale, distributed systems or HPC environments, with at least 3+ years focused on… more
    LinkedIn (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Developer Advocate Engineer

    NVIDIA (Santa Clara, CA)
    …a variety of programming models, frameworks, and tools. We are looking for a Senior Developer Advocate Engineer to own the technical engagements for a rapidly ... High Performance Computing ( HPC ) and Artificial Intelligence (AI) are key markets...and Bootcamps. + Assess each hackathon's computing and software infrastructure . + Write comprehensive internal feedback reports and find… more
    NVIDIA (07/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …out from the crowd: + Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp ... runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner...applications. We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with… more
    NVIDIA (07/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer

    Amazon (Cupertino, CA)
    Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving...software components that are critical building blocks for EC2 infrastructure . Every instance in EC2 is running some type… more
    Amazon (07/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Observability Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA's AI Infrastructure organization is seeking a Senior AI Observability Engineer to help architect and implement distributed observability systems for ... AI and HPC clusters. We serve and collaborate directly with NVIDIA's...Practical experience in machine learning, deep learning, open-source software, infrastructure technologies, and GPU technology. + Prior experience in… more
    NVIDIA (07/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , AI…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking ... a Senior Software Engineer to lead the development...and performance tuning large-scale AI workloads in cloud and HPC environments, ensuring seamless operation of AI training and… more
    NVIDIA (07/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Performance Engineer

    NVIDIA (Santa Clara, CA)
    …how you can make a lasting impact on the world. We are looking for an outstanding engineer for a Senior Performance Engineer role for at scale AI system ... workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems specialist to architect, develop… more
    NVIDIA (04/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Networking Application…

    NVIDIA (Santa Clara, CA)
    …and independent individuals to join our team! We are searching for a senior networking application engineer with domain expertise in Infiniband and/or NVLINK ... deploy cutting-edge NVIDIA networking platforms to run AI and HPC workloads + Address sophisticated and highly visible customer...perl, python, and shell scripts) + Knowledge in Cloud infrastructure and AI workflows + Familiarity AI workloads +… more
    NVIDIA (06/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage Production Engineer

    NVIDIA (Santa Clara, CA)
    …storage systems, and ensuring low-latency data access for high-performance computing ( HPC ) and AI/ML workloads. Production Engineers at NVIDIA ensure that our ... automation frameworks, capacity management, and launch reviews. + Maintain storage infrastructure once live by monitoring availability, latency, and system health,… more
    NVIDIA (05/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Marketing Engineer

    NVIDIA (Santa Clara, CA)
    …ecosystem to power AI at scale! We are seeking a highly technical and creative Senior Technical Marketing Engineer to join our team to showcase the innovations ... Marketing. + 7+ years of experience in deep learning engineering, HPC systems, AI infrastructure , or technical evangelism roles. + Strong grasp of distributed… more
    NVIDIA (07/17/25)
    - Save Job - Related Jobs - Block Source