• AI / HPC Systems

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
    Meta (11/18/25)
    - Save Job - Related Jobs - Block Source
  • HPC / AI Platform Engineering

    Lilly (Indianapolis, IN)
    …infrastructure! The Cloud and Connectivity organization is seeking experts and leaders in AI and High- Performance Computing ( HPC ), and Nvidia DGX server ... of advanced Linux platforms supporting AI and HPC workloads, managing Nvidia DGX systems using...infrastructure. + Hands-on experience in using or operating High Performance Computing ( HPC ) grade infrastructure as well… more
    Lilly (11/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI and ML HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …with AI / HPC workflows that use MPI + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Passion for continual learning ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek a...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more
    NVIDIA (10/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI - HPC Cluster Engineer…

    NVIDIA (Santa Clara, CA)
    …analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... and implement GPU compute clusters for deep learning and high- performance computing. What you'll be doing: + Provide leadership...storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning… more
    NVIDIA (10/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Engineer - AI and HPC

    NVIDIA (Santa Clara, CA)
    …, time-series databases, and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC environments. + Experience ... teams to optimize observability for model training, inference workloads, and HPC performance . + Leverage machine learning and statistical techniques… more
    NVIDIA (10/22/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC System Performance

    Meta (New York, NY)
    …and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
    Meta (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC and AI Networking…

    NVIDIA (Santa Clara, CA)
    …fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools… more
    NVIDIA (12/03/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Network Engineering Manager

    Meta (Menlo Park, CA)
    …These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more
    Meta (10/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect, AI

    NVIDIA (Santa Clara, CA)
    …group at NVIDIA has openings for software architects in the field of AI and high- performance networking and system software. We research, develop, and ... and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new… more
    NVIDIA (10/30/25)
    - Save Job - Related Jobs - Block Source
  • HPC Sr. Scientific Software Engineer (IT@JH…

    Johns Hopkins University (Baltimore, MD)
    …Deployment and Design** + Develop and refine deployment strategies for scientific software on HPC and AI systems . + Design computational workflows, selecting ... Agents). _Performance Optimization_ + Analyze and optimize the performance of AI models and HPC...Ensure compliance with security and regulatory standards for all HPC and AI systems . _In… more
    Johns Hopkins University (11/21/25)
    - Save Job - Related Jobs - Block Source
  • HPC Scientific Software Engineer (IT@JH…

    Johns Hopkins University (Baltimore, MD)
    …Deployment and Design_ + Develop and refine deployment strategies for scientific software on HPC and AI systems . + Design computational workflows, selecting ... Agents). _Performance Optimization_ + Analyze and optimize the performance of AI models and HPC...Ensure compliance with security and regulatory standards for all HPC and AI systems . **Minimum… more
    Johns Hopkins University (12/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Solution Architect, HPC

    NVIDIA (Santa Clara, CA)
    …Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... of large-scale AI clusters, focusing on performance at scale,...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more
    NVIDIA (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Herndon, VA)
    …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. - Current, active...- Experience implementing AWS services - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
    Amazon (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Arlington, VA)
    …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
    Amazon (09/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Applications Engineer

    NVIDIA (Westford, MA)
    …profiling, benchmarking, monitoring, and optimizing scientific or AI /ML applications on multi-GPU systems . + Working knowledge of NVIDIA HPC SDK , CUDA-Q , ... applications on the HPC + quantum environment. + Profile and tune performance for GPU-accelerated and hybrid workloads using tools such as NVIDIA Nsight, nvprof,… more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Google Cloud, HPC

    Google (Boulder, CO)
    …lifecycles, building tools, architecting and developing software for scalable, distributed systems , including data platform, AI /ML, and infrastructure. + ... products, and different customer segments/use cases of the emerging AI Compute tech stack. **About the job** The Google...of our customers and helping shape the future of HPC . As the Senior Manager in High Performance more
    Google (12/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Cluster Engineer - EDA

    NVIDIA (Santa Clara, CA)
    …analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... deploy, and operate GPU Compute Clusters for EDA and high- performance computing workloads used across multiple teams and projects.... systems such as Lustre and GPFS for AI / HPC workload. + Familiarity with metrics collection… more
    NVIDIA (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Engineer

    Texas A&M University System (College Station, TX)
    …patching, and performance tuning.* Oversee networking, security, and infrastructure for HPC systems .* Lead the development of specialized HPC computing ... research and super computing needs. As a Senior High Performance Computing Engineer ( HPC ), you will provide...expertise and consultation for the design and deployment of HPC systems . Get in on the ground… more
    Texas A&M University System (10/03/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer, HPC Solutions

    Google (Kirkland, WA)
    …to accelerate customer success by enabling them to run their most demanding High Performance Computing ( HPC ) and Machine Learning (ML) workloads on Google Cloud ... of experience in Software Development. + Experience in High Performance Computing. **Preferred qualifications:** + PhD degree in Computational...future of scientific computing by leading the convergence of AI and HPC . The AI more
    Google (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Product Architect, HPC

    NVIDIA (Santa Clara, CA)
    …management, and fabric scalability. + Experience working with benchmarking tools and performance analysis for large-scale HPC / AI networking deployments. + ... engine of modern Artificial Intelligence, Advanced Networking, and High Performance Computing ( HPC ) - the biggest technology...Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to… more
    NVIDIA (10/02/25)
    - Save Job - Related Jobs - Block Source