• AI / HPC Systems

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
    Meta (04/20/25)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer, Sustaining

    Meta (Menlo Park, CA)
    …hardware and software components, co-design 15. Experience in developing or debugging AI / HPC systems , performance optimizations, including familiarity ... or supporting production hardware at scale 9. Experience in deploying and productionizing AI / HPC systems and/or related components at scale 10. Experience in… more
    Meta (05/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI - HPC Cluster Engineer

    NVIDIA (Santa Clara, CA)
    …to work effectively with diverse teams and individuals. + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Passion for ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek a...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more
    NVIDIA (04/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI - HPC Storage Engineer

    NVIDIA (Santa Clara, CA)
    …designing and operating large scale storage infrastructure. + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Experience ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...solutions to enable runs of demanding deep learning, high performance computing, and computationally intensive workloads. We seek an… more
    NVIDIA (05/07/25)
    - Save Job - Related Jobs - Block Source
  • Postdoctoral Appointee - HPC & AI

    Argonne National Laboratory (Lemont, IL)
    …on designing the communication infrastructure for next-generation High- Performance Computing ( HPC ) and Artificial Intelligence ( AI ) systems . This ... and optimize workload-specialized interconnects and network-aware communication strategies to enhance the performance of AI and HPC workloads. + Implement… more
    Argonne National Laboratory (06/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Observability Architect, AI

    NVIDIA (Santa Clara, CA)
    …looking for a technical leader to define a vision and roadmap for distributed observability systems for large-scale AI and HPC clusters and workloads and ... and visualization to spectacularly improve efficiency, performance , and productivity of AI and HPC workloads. You will lead technical teams to develop,… more
    NVIDIA (05/15/25)
    - Save Job - Related Jobs - Block Source
  • AI Infrastructure Engineer - HPC

    Cisco (Research Triangle Park, NC)
    …and technologies. Preferred Qualifications * Deep understanding of operating systems , computer networks, and high- performance applications. * Established ... Showcase the power of Cisco: our people, products, processes, systems , and data. Please join us and make this...and managing the internal NVIDIA DGX and Cisco-UCS based AI platforms at Cisco. You will provide leadership in… more
    Cisco (05/24/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Network Engineer

    Meta (Menlo Park, CA)
    …requirements of RDMA workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across ... fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineer Responsibilities: 1. Design, develop, test and… more
    Meta (05/08/25)
    - Save Job - Related Jobs - Block Source
  • HPC SRE Systems Engineer

    Ford Motor Company (Dearborn, MI)
    We are seeking a highly skilled and motivated HPC SRE Systems Engineer to join our growing team. You will be responsible for designing, building, and maintaining ... + Design, implement, and maintain a robust and scalable HPC infrastructure to support containerized AI /ML workloads...Troubleshoot and resolve complex technical issues related to Linux systems , networking, storage, and HPC applications. +… more
    Ford Motor Company (05/28/25)
    - Save Job - Related Jobs - Block Source
  • HPC Systems Admin

    General Dynamics Information Technology (Fairfax, VA)
    …High Speed Networks, Parallel File systems . . Experience running and optimizing HPC performance benchmarks or MPI codes would be a plus. . Experience ... Able to Obtain:** None **Public Trust/Other Required:** NACI (T1) **Job Family:** Systems Engineering **Skills:** High- Performance Computing ( HPC ) Systems more
    General Dynamics Information Technology (06/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage Engineer, HPC & GPU

    Samsung SDS America (Ridgefield Park, NJ)
    …highly skilled and experienced Data Center Storage Engineer with exposure to High Performance Computing ( HPC ) and GPU Infrastructure. The ideal candidate will ... for HPC and GPU-intensive workloads. + Evaluate and implement high- performance storage technologies, including NVMe, SSD, parallel file systems (eg,… more
    Samsung SDS America (03/22/25)
    - Save Job - Related Jobs - Block Source
  • Sr. HPC Architect - Hybrid

    Caris Life Sciences (Irving, TX)
    …A Senior HPC Architect is responsible for designing and optimizing high- performance computing ( HPC ) systems , leveraging their expertise in parallel ... analysis tools and techniques to identify and address performance bottlenecks. + Knowledge of HPC hardware...scientific software and other 3rd party software applications on HPC systems + Experience with HPC more
    Caris Life Sciences (03/25/25)
    - Save Job - Related Jobs - Block Source
  • IT Systems Architect I - Networking…

    Mayo Clinic (Rochester, MN)
    …of on-premise Linux solutions and cloud-based technologies, enabling cutting-edge Generative AI (GenAI) and High- Performance Computing ( HPC ) capabilities. ... initiatives, with a focus on supporting Gen AI /LLM and HPC workloads. This includes designing and implementing high- performance network architectures… more
    Mayo Clinic (06/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Worldwide Specialist Solutions Architect,…

    Amazon (Herndon, VA)
    …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
    Amazon (06/12/25)
    - Save Job - Related Jobs - Block Source
  • HPC Subject Matter Expert

    General Dynamics Information Technology (Fairfax, VA)
    …with commonly used HPC applications and services (ie, schedulers, high performance file systems , modules for installing applications, compilers, MPI, OpenMP, ... **Public Trust/Other Required:** None **Job Family:** Scientists **Skills:** High Performance Computing ( HPC ),Researching,Supercomputing **Experience:** 10 + years… more
    General Dynamics Information Technology (06/14/25)
    - Save Job - Related Jobs - Block Source
  • HPC Systems Engineer

    General Dynamics Information Technology (Huntsville, AL)
    …Required:** None **Job Family:** Systems Engineering **Skills:** Complex Systems ,High- Performance Computing ( HPC ) Systems ,Linux,Management Tools, ... of related experience **US Citizenship Required:** Yes **Job Description:** HPC Systems Engineer GDIT is seeking a...+ Participate in the design of information and operational systems + Monitor and test application performance more
    General Dynamics Information Technology (05/17/25)
    - Save Job - Related Jobs - Block Source
  • HPC SME

    General Dynamics Information Technology (Fairfax, VA)
    …with commonly used HPC applications and services (ie, schedulers, high performance file systems , modules for installing applications, compilers, MPI, OpenMP, ... **Public Trust/Other Required:** None **Job Family:** Technology Consulting **Skills:** Computing, HPC ,Information Technology (IT) Systems ,Meeting Organization **Experience:** 20… more
    General Dynamics Information Technology (06/10/25)
    - Save Job - Related Jobs - Block Source
  • Principal HPC Software Engineer

    GliaCell Technologies (MD)
    …develops, tests, deploys, documents, maintains, and enhances complex and diverse software for HPC (high performance computing) systems based upon documented ... requirements. + The HPC systems might include, but are not limited to, processing-intensive analytics, novel algorithm development, manipulation of extremely… more
    GliaCell Technologies (05/13/25)
    - Save Job - Related Jobs - Block Source
  • IT Cloud Architect - GenAI and HPC - Remote

    Mayo Clinic (Rochester, MN)
    …in on-premise Linux solutions and cloud-based technologies, enabling cutting-edge Generative AI (GenAI) and High- Performance Computing ( HPC ) capabilities. ... and research initiatives, with a focus on supporting Gen AI /LLM and HPC workloads. This role involves...The architect will design scalable and resilient solutions for high- performance GPU clusters and HPC environments, optimizing… more
    Mayo Clinic (06/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect - Deep Learning…

    NVIDIA (Santa Clara, CA)
    …vision? What you will be doing: + Investigate opportunities to improve communication performance by identifying bottlenecks in today's systems . + Design and ... implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative solutions in HW and SW for our next generation platforms as… more
    NVIDIA (05/05/25)
    - Save Job - Related Jobs - Block Source