• DGX Cloud Automation

    NVIDIA (Santa Clara, CA)
    …for a passionate member to join our DGX Cloud Engineering Team as a Cloud Software Engineer . In this role, you will play a significant part in helping to ... craft and guide the future of AI & GPUs in the Cloud . NVIDIA DGX Cloud is a cloud platform tailored for AI tasks, enabling organizations to transition AI… more
    NVIDIA (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud Software…

    NVIDIA (Santa Clara, CA)
    …building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll have the opportunity to support our customers' journeys ... compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem. What you'll be...automate it where the ROI of building and maintaining automation is worth it. + Practice sustainable blameless incident… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Bare Metal…

    NVIDIA (Santa Clara, CA)
    …GPU deep learning. What you will be doing: + You will be part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to ... competency in managing and automating large-scale distributed systems independent of cloud providers. Advanced hands-on experience and deep understanding of managing… more
    NVIDIA (09/29/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an outstanding, passionate, and talented Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will ... cloud enabling technologies like Kubernetes and OpenStack. DGX Cloud SRE at NVIDIA ensures that...system health. + Scale systems sustainably through mechanisms like automation , and evolve systems by pushing for changes that… more
    NVIDIA (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Infrastructure Security Engineer

    NVIDIA (Santa Clara, CA)
    …workload isolation, Zero Trust). + Ability to partner effectively across central security, and DGX Cloud teams. Ways To Stand Out From The Crowd: + Expertise ... NVIDIA is looking for a Sr Infrastructure Security Engineer who will design and implement security best...design and implement security best practices for on-premise and cloud access, keeping in mind boundaries that securely enable… more
    NVIDIA (10/31/25)
    - Save Job - Related Jobs - Block Source
  • Principal Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    The NVIDIA DGX Cloud organization is looking for software engineering talent to build NVIDIA's accelerated compute infrastructure. This includes software to ... in responses to real-time operational events. + Build network and systems automation software for managing a multi-tenant cloud infrastructure. + Participate… more
    NVIDIA (10/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX AI Cloud Performance…

    NVIDIA (Santa Clara, CA)
    Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing ... to work efficiently with a wide variety of DGXC cloud AI systems as they seek out opportunities for...and build consensus + Passion for "it just works" automation , eliminating repetitive tasks, and enabling team members Ways… more
    NVIDIA (09/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Network Security Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Sr Network Security Engineer who will design and implement security best practices for on-premise and cloud access, keeping in mind ... the implementation and management of security solutions that protect our cloud and on-prem network infrastructure, support advanced workload scalability, and align… more
    NVIDIA (11/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage Production Engineer

    NVIDIA (Santa Clara, CA)
    …continuous delivery and deployment, as well as open-source cloud -enabling technologies like Kubernetes, containers, and virtualization. Their responsibilities ... Storage Production Engineers at NVIDIA ensure that our internal and external-facing GPU cloud services meet reliability and uptime goals as promised to the users… more
    NVIDIA (11/12/25)
    - Save Job - Related Jobs - Block Source
  • Senior Network Engineer - DGX

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Network Engineer to develop a cloud network infrastructure. The goal is to craft a reliable, scalable and efficient network to ... To achieve this goal, we are looking for an engineer who has a deep understanding of L3 underlay...+ Lead the overall architecture and design of our cloud network infrastructure including intra-DC, inter-DC, Corp IT, and… more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Service Reliability Operations…

    NVIDIA (Santa Clara, CA)
    …Service Reliability Operations Center, to provide extraordinary levels of support for our Cloud products and services. As a key member of the CIS Team (Compute ... large-scale production systems. 3+ years of experience in high-availability Internet, Cloud , or Data Center environments (Systems Administration, SRE, or NOC). +… more
    NVIDIA (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Network Operations Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Network Operations Engineer to support and maintain our cloud and datacenter network infrastructures. This network serves the ... and Artificial Intelligence. In this role, the Senior Network Operations Engineer will remediate critical alerts within defined SLAs, triage production impacting… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    We are seeking a AI Infrastructure Engineer to integrate third-party infrastructure partners into NVIDIA's operational excellence programs. This cross-functional ... should possess experience in delivering production infrastructure across various cloud providers, including hands-on experience in building and managing this… more
    NVIDIA (10/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures ... that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and at the same time enabling developers to make… more
    NVIDIA (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer , AI…

    NVIDIA (Santa Clara, CA)
    We are looking for a Senior AI Infrastructure Engineer (AI Tooling) to design and build the backend systems and infrastructure powering our internal AI tools and ... with Incident Commanders, incident response, and SRE teams to integrate AI-driven automation and analytics into operational workflows + Design, develop, and maintain… more
    NVIDIA (10/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Data Ingestion…

    NVIDIA (Santa Clara, CA)
    NVIDIA is seeking a Senior Software Engineer to build a worldwide network of fast, efficient, and reliable data transfer systems. The goal is to enable NVIDIA AI ... spanning the areas of orchestration, service modeling, API development, monitoring, and automation + Build highly reliable distributed systems that our customers can… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior GPU and HPC Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    …that automates GPU asset provisioning, configuration, and lifecycle management across cloud providers. You'll contribute to this platform to build end-to-end ... automation of datacenter operations, break/fix, and lifecycle management for...in architecting and managing large-scale distributed systems, independent of cloud providers. Deep knowledge of datacenter operations and GPU… more
    NVIDIA (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , AI…

    NVIDIA (Santa Clara, CA)
    NVIDIA DGX Cloud is a managed, multi- cloud...is looking for a passionate member to join our DGX Engineering Team as a Senior Software Engineer . ... services and virtualization frameworks that come together to form our NVIDIA DGX Cloud Reference Architecture. These services have requirements for high… more
    NVIDIA (09/19/25)
    - Save Job - Related Jobs - Block Source
  • Forward Deployed Engineer , AI Accelerator

    NVIDIA (Santa Clara, CA)
    …virtual collaboration + Infrastructure Management: Deploy and manage AI workloads across DGX Cloud , customer data centers, and CSP environments using Kubernetes, ... NVIDIA is seeking a Forward Deployed Engineer to join our AI Accelerator team, working...DGX systems, CUDA, NeMo, Triton, or NIM + Cloud platforms hands-on experience with AWS, Azure, or GCP… more
    NVIDIA (09/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Infrastructure Software Engineer

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries! NVIDIA's Deep Learning Libraries Group is seeking excellent software ... of platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters and large language models. Join...testing and analysis of our codebases + Building scalable automation for build, test, integration, and release processes for… more
    NVIDIA (09/02/25)
    - Save Job - Related Jobs - Block Source