- NVIDIA (Santa Clara, CA)
- … analysis, optimization, and modeling to define the architecture and design of NVIDIA's DGX Cloud clusters. The ideal candidate will have a deep understanding of ... the methodology to conduct end to end performance analysis of critical AI applications running on large...will work closely with the multi-functional teams to define DGX Cloud cluster architecture for different CSPs,… more
- NVIDIA (Santa Clara, CA)
- Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing ... Engineers to design and develop tools for AI application performance analysis. Your work will enable AI researchers to...to work efficiently with a wide variety of DGXC cloud AI systems as they seek out opportunities for… more
- NVIDIA (Santa Clara, CA)
- …AI innovation powering breakthroughs in research, autonomous vehicles, robotics, and more. The DGX Cloud team builds and operates the AI infrastructure that ... Manager for Technical Program Management team to lead a high-impact team within our DGX Cloud Infrastructure organization. You will play a critical role in… more
- NVIDIA (Santa Clara, CA)
- We are looking for a Senior Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA's high- performance GPU ... to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you'll work on ground breaking technology that powers the… more
- NVIDIA (Santa Clara, CA)
- …building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll have the opportunity to support our customers' journeys ... bare-metal, accelerated compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem. What you'll be doing: + Design,… more
- NVIDIA (Santa Clara, CA)
- …engineering practices to ensure high efficiency and availability of AI systems. As a senior DGX Cloud AI Infrastructure software engineer at NVIDIA, you ... Joining NVIDIA's DGX Cloud AI Efficiency Team means...leads the way in groundbreaking developments in Artificial Intelligence, High- Performance Computing, and Visualization. The GPU, our invention, serves… more
- NVIDIA (Santa Clara, CA)
- …GPU deep learning. What you will be doing: + You will be part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to ... to ensure production AI clusters run reliability and consistently with maximum performance . Evaluating system failures and improving services based on a well-defined… more
- NVIDIA (Santa Clara, CA)
- …AI Infrastructure Engineers at NVIDIA ensure that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and ... through careful preparation and planning while keeping an eye on capacity, latency and performance . What you'll be doing: + Lead a team of software and AI engineers… more
- NVIDIA (Santa Clara, CA)
- …systems, tooling, and data infrastructure that enable operation of our GPU cloud services. We are enabling engineering teams to innovate while proactively ... services that surface security signals and automate enforcement across multi-tenant cloud environments. + Develop and operate risk management workflows that… more
- NVIDIA (Santa Clara, CA)
- …that automates GPU asset provisioning, configuration, and lifecycle management across cloud providers. You'll contribute to this platform to build end-to-end ... with cluster management systems (Kubernetes, SLURM) + Understanding of performance , security and reliability in complex distributed systems. Familiarity with… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a passionate member to join our DGX Cloud Engineering Team as a Cloud Software Engineer. In this role, you will play a significant part ... guide the future of AI & GPUs in the Cloud . NVIDIA DGX Cloud is...These services have requirements for high security & maximum performance to support extensive AI workloads. + Design, build,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring ... Software Engineer who will help build simulators for our DGX Server platforms. Simulations play a significant role in...kernel & platform driver teams distributed globally. + Improve performance , fix bugs across user and kernel stack, and… more
- NVIDIA (Santa Clara, CA)
- …We're looking for a Senior Full-Stack Software Engineer to join our DGX Cloud AI Infrastructure team and help deliver the next-generation user experience ... AI innovation powering breakthroughs in research, autonomous vehicles, robotics, and more. The DGX Cloud team builds and operates the AI infrastructure that… more
- NVIDIA (Santa Clara, CA)
- …enterprise customers. + Technical Expertise: Deep understanding of AI/ML infrastructure, high- performance computing (HPC) and networking ; cloud technologies ... products for the company. We are looking for a Senior Product Manager that will drive the roadmap and...and solutions for NVIDIA's Enterprise Infrastructure products using NVIDIA DGX and NVIDIA Networking. + Create GTM Collaterals and… more
- NVIDIA (Santa Clara, CA)
- …expanding ecosystem of data center platform & node designs. From single node HGX/ DGX systems all the way up to large multi-node NVLink domain rack architectures. ... These designs have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. Each bringing together the full power of NVIDIA GPUs, NVIDIA… more
- NVIDIA (Santa Clara, CA)
- …company and establish teams with the most thoughtful people in the world. NVIDIA DGX , HGX, and MGX servers deliver the world's leading solutions for enterprise AI ... development of scalable full-stack applications using modern frameworks and cloud -native technologies. + Establish technical direction and standard methodologies for… more
- NVIDIA (Santa Clara, CA)
- …crafting NVIDIA's GPUs and SoCs into groundbreaking platforms for autonomous machines, Cloud and Data Centers, Deep learning, High- Performance Computing, Gaming, ... improve the silicon validation process, which will help meet upcoming performance , adaptability, and safety industry standards. + Ensure interoperability with… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA's core generative AI technologies. This includes NVIDIA GPU architectures, DGX systems, high- performance networking (InfiniBand), CUDA-X libraries, NeMo ... capabilities and strong value proposition. + Understanding of large-scale system performance optimization, container orchestration (eg, Kubernetes), and Cloud … more
- NVIDIA (Santa Clara, CA)
- …lasting impact on the world! We are seeking a highly skilled and hard-working Senior Test Architect to join our multifaceted Enterprise Software QA team. This role ... ability to deliver robust, secure, and high-performing solutions for AI, HPC, and cloud -scale systems. You will: + Define End-to-End Test Strategy: Own and drive the… more
- NVIDIA (Santa Clara, CA)
- NVIDIA data center platforms/solutions, such as DGX , MGX, HGX and PCIe, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. ... be the cross-section between execution and strategy, leading a team of Senior TPMs driving impactful programs and delivering measurable results across many functions… more