AI HPC System Performance Jobs in Cupertino, CA

114 jobs (page 1)

Categories

All Categories

Engineering (60)

Software/IT (14)

Staff Product Manager, HBM

Micron Technology, Inc. (San Jose, CA)

…position in the Artificial Intelligence ( AI ), Machine Learning (ML) and High Performance Computing ( HPC ) business segments. You will be working on innovative ... you will be charged with defining and accomplishing the strategy for a High Performance Memory product portfolio that will further fortify Micron's leadership… more

DirectEmployers Association (11/13/25)
- Save Job - Related Jobs - Block Source
Principal Product Manager, HBM

Micron Technology, Inc. (San Jose, CA)

…in growing the Artificial Intelligence ( AI ), Machine Learning (ML) and High- Performance Computing ( HPC ) business segments. You will be working on innovative ... of Work (SOWs), business term sheets, and other customer-facing documents for high- performance memory products. + Represent the Product Management team in Product… more

DirectEmployers Association (11/13/25)
- Save Job - Related Jobs - Block Source
AI / HPC System…

Meta (Menlo Park, CA)

…fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more

Meta (11/06/25)
- Save Job - Related Jobs - Block Source
Senior HPC and AI Networking…

NVIDIA (Santa Clara, CA)

…fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools… more

NVIDIA (09/03/25)
- Save Job - Related Jobs - Block Source
Senior AI and ML HPC Cluster…

NVIDIA (Santa Clara, CA)

…with AI / HPC workflows that use MPI + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Passion for continual learning ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek a...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more

NVIDIA (10/19/25)
- Save Job - Related Jobs - Block Source
Senior AI - HPC Cluster Engineer…

NVIDIA (Santa Clara, CA)

…analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... and implement GPU compute clusters for deep learning and high- performance computing. What you'll be doing: + Provide leadership...storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning… more

NVIDIA (10/30/25)
- Save Job - Related Jobs - Block Source
Senior Engineer - AI and HPC…

NVIDIA (Santa Clara, CA)

…, time-series databases, and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC environments. + Experience ... teams to optimize observability for model training, inference workloads, and HPC performance . + Leverage machine learning and statistical techniques… more

NVIDIA (10/22/25)
- Save Job - Related Jobs - Block Source
Senior Software Architect, AI…

NVIDIA (Santa Clara, CA)

…architecture group at NVIDIA has openings for software architects in the field of AI and high- performance networking and system software. We research, ... and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new… more

NVIDIA (10/30/25)
- Save Job - Related Jobs - Block Source
AI / HPC Network Engineering Manager

Meta (Menlo Park, CA)

…These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more

Meta (10/16/25)
- Save Job - Related Jobs - Block Source
Senior Solution Architect, HPC…

NVIDIA (Santa Clara, CA)

…Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... of large-scale AI clusters, focusing on performance at scale,...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more

NVIDIA (09/17/25)
- Save Job - Related Jobs - Block Source
Sr. Worldwide Specialist Solutions Architect,…

Amazon (Santa Clara, CA)

…computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more

Amazon (09/11/25)
- Save Job - Related Jobs - Block Source
Senior HPC Cluster Engineer - EDA

NVIDIA (Santa Clara, CA)

…analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... deploy, and operate GPU Compute Clusters for EDA and high- performance computing workloads used across multiple teams and projects.... systems such as Lustre and GPFS for AI / HPC workload. + Familiarity with metrics collection… more

NVIDIA (09/17/25)
- Save Job - Related Jobs - Block Source
Senior Product Architect, HPC…

NVIDIA (Santa Clara, CA)

…management, and fabric scalability. + Experience working with benchmarking tools and performance analysis for large-scale HPC / AI networking deployments. + ... engine of modern Artificial Intelligence, Advanced Networking, and High Performance Computing ( HPC ) - the biggest technology...Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to… more

NVIDIA (10/02/25)
- Save Job - Related Jobs - Block Source
Senior System Software Engineer - Genomics…

NVIDIA (Santa Clara, CA)

…concurrency control, memory management and scalability. + Strong understanding of computer system architecture and operating systems . Ways To Stand Out from ... healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as...integrating genomic solutions into mainstream healthcare. As a healthcare HPC engineer, you will join a dynamic development team… more

NVIDIA (11/11/25)
- Save Job - Related Jobs - Block Source
Senior Software Architect - Deep Learning…

NVIDIA (Santa Clara, CA)

…vision? What you will be doing: + Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems . + Design and ... implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative...NVSHMEM, OpenSHMEM, UCX, UCC). + Deep understanding of operating systems , computer and system architecture. + Solid… more

NVIDIA (11/04/25)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect, HPC…

NVIDIA (Santa Clara, CA)

…Machine Learning ecosystems. You'll be called on to help architect and scale high- performance , distributed AI infrastructure on-prem or in the cloud built with ... validation and debugging of large-scale GPU clusters focused on performance . As part of the Solution Architecture organization, we...metal level, all the way up to the operating system , software stack, and application level. + Share knowledge… more

NVIDIA (10/01/25)
- Save Job - Related Jobs - Block Source
Sr. Software Development Engineer, HPC /ML…

Amazon (Cupertino, CA)

Description We are seeking an experienced engineer to work on distributed AI /ML systems . This role involves working on collective operations - the fundamental ... operations that enable AI to scale across multiple accelerators & servers. Most...building networking solutions that for Machine Learning (ML) and High- Performance Computing ( HPC ) workloads on AWS. We… more

Amazon (11/10/25)
- Save Job - Related Jobs - Block Source
Senior HPC Architect

NVIDIA (Santa Clara, CA)

…improved workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...looking for an outstanding hands-on architect/engineer for a Senior HPC architect role to support deployment and bringup of… more

NVIDIA (10/08/25)
- Save Job - Related Jobs - Block Source
Senior GPU and HPC Infrastructure Engineer…

NVIDIA (Santa Clara, CA)

…of performance , security and reliability in complex distributed systems . Familiarity with system level architecture, data synchronization, fault ... , and excellent communication and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high- performance networking (RDMA,… more

NVIDIA (10/09/25)
- Save Job - Related Jobs - Block Source
System Software Architect, HPC…

NVIDIA (Santa Clara, CA)

…that guide us to be the best we can be. We are seeking a highly motivated system network architect to join our team of experts and take part in shaping the future of ... high performance DGX SuperPOD. The ideal candidate is self motivated,...server infrastructure builds, accelerated computing workloads and GPU enabled AI applications + Crafting and evaluating DevOps automation scripts… more

NVIDIA (10/23/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search