- NVIDIA (Santa Clara, CA)
- …for a passionate member to join our DGX Cloud Engineering Team as a Cloud Software Engineer . In this role, you will play a significant part in helping to ... craft and guide the future of AI & GPUs in the Cloud . NVIDIA DGX Cloud is a cloud platform tailored for AI tasks, enabling organizations to transition AI… more
- NVIDIA (Santa Clara, CA)
- …building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll have the opportunity to support our customers' journeys ... compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem. What you'll be...automate it where the ROI of building and maintaining automation is worth it. + Practice sustainable blameless incident… more
- NVIDIA (Santa Clara, CA)
- …GPU deep learning. What you will be doing: + You will be part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to ... competency in managing and automating large-scale distributed systems independent of cloud providers. Advanced hands-on experience and deep understanding of managing… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an outstanding, passionate, and talented Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will ... cloud enabling technologies like Kubernetes and OpenStack. DGX Cloud SRE at NVIDIA ensures that...system health. + Scale systems sustainably through mechanisms like automation , and evolve systems by pushing for changes that… more
- NVIDIA (Santa Clara, CA)
- …workload isolation, Zero Trust). + Ability to partner effectively across central security, and DGX Cloud teams. Ways To Stand Out From The Crowd: + Expertise ... NVIDIA is looking for a Sr Infrastructure Security Engineer who will design and implement security best...design and implement security best practices for on-premise and cloud access, keeping in mind boundaries that securely enable… more
- NVIDIA (Santa Clara, CA)
- The NVIDIA DGX Cloud organization is looking for software engineering talent to build NVIDIA's accelerated compute infrastructure. This includes software to ... in responses to real-time operational events. + Build network and systems automation software for managing a multi-tenant cloud infrastructure. + Participate… more
- NVIDIA (Santa Clara, CA)
- Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing ... to work efficiently with a wide variety of DGXC cloud AI systems as they seek out opportunities for...and build consensus + Passion for "it just works" automation , eliminating repetitive tasks, and enabling team members Ways… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Sr Network Security Engineer who will design and implement security best practices for on-premise and cloud access, keeping in mind ... the implementation and management of security solutions that protect our cloud and on-prem network infrastructure, support advanced workload scalability, and align… more
- NVIDIA (Santa Clara, CA)
- …continuous delivery and deployment, as well as open-source cloud -enabling technologies like Kubernetes, containers, and virtualization. Their responsibilities ... Storage Production Engineers at NVIDIA ensure that our internal and external-facing GPU cloud services meet reliability and uptime goals as promised to the users… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior Network Engineer to develop a cloud network infrastructure. The goal is to craft a reliable, scalable and efficient network to ... To achieve this goal, we are looking for an engineer who has a deep understanding of L3 underlay...+ Lead the overall architecture and design of our cloud network infrastructure including intra-DC, inter-DC, Corp IT, and… more
- NVIDIA (Santa Clara, CA)
- …Service Reliability Operations Center, to provide extraordinary levels of support for our Cloud products and services. As a key member of the CIS Team (Compute ... large-scale production systems. 3+ years of experience in high-availability Internet, Cloud , or Data Center environments (Systems Administration, SRE, or NOC). +… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior Network Operations Engineer to support and maintain our cloud and datacenter network infrastructures. This network serves the ... and Artificial Intelligence. In this role, the Senior Network Operations Engineer will remediate critical alerts within defined SLAs, triage production impacting… more
- NVIDIA (Santa Clara, CA)
- We are seeking a AI Infrastructure Engineer to integrate third-party infrastructure partners into NVIDIA's operational excellence programs. This cross-functional ... should possess experience in delivering production infrastructure across various cloud providers, including hands-on experience in building and managing this… more
- NVIDIA (Santa Clara, CA)
- …database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures ... that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and at the same time enabling developers to make… more
- NVIDIA (Santa Clara, CA)
- We are looking for a Senior AI Infrastructure Engineer (AI Tooling) to design and build the backend systems and infrastructure powering our internal AI tools and ... with Incident Commanders, incident response, and SRE teams to integrate AI-driven automation and analytics into operational workflows + Design, develop, and maintain… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to build a worldwide network of fast, efficient, and reliable data transfer systems. The goal is to enable NVIDIA AI ... spanning the areas of orchestration, service modeling, API development, monitoring, and automation + Build highly reliable distributed systems that our customers can… more
- NVIDIA (Santa Clara, CA)
- …that automates GPU asset provisioning, configuration, and lifecycle management across cloud providers. You'll contribute to this platform to build end-to-end ... automation of datacenter operations, break/fix, and lifecycle management for...in architecting and managing large-scale distributed systems, independent of cloud providers. Deep knowledge of datacenter operations and GPU… more
- NVIDIA (Santa Clara, CA)
- NVIDIA DGX Cloud is a managed, multi- cloud...is looking for a passionate member to join our DGX Engineering Team as a Senior Software Engineer . ... services and virtualization frameworks that come together to form our NVIDIA DGX Cloud Reference Architecture. These services have requirements for high… more
- NVIDIA (Santa Clara, CA)
- …virtual collaboration + Infrastructure Management: Deploy and manage AI workloads across DGX Cloud , customer data centers, and CSP environments using Kubernetes, ... NVIDIA is seeking a Forward Deployed Engineer to join our AI Accelerator team, working...DGX systems, CUDA, NeMo, Triton, or NIM + Cloud platforms hands-on experience with AWS, Azure, or GCP… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries! NVIDIA's Deep Learning Libraries Group is seeking excellent software ... of platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters and large language models. Join...testing and analysis of our codebases + Building scalable automation for build, test, integration, and release processes for… more