• AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Lead ... 5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques. **Minimum… more
    Meta (04/25/24)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
    Meta (05/11/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Systems Engineer

    General Dynamics Information Technology (Fairfax, VA)
    …Description:** At GDIT, people are our differentiator. Our work depends on a Senior HPC Systems Engineer joining our team to support the National Oceanic and ... Obtain:** None **Job Family:** Systems Engineering **Skills:** High-Performance Computing ( HPC ) Systems,Linux System Administration,Systems Management **Certifications:** None - N/A… more
    General Dynamics Information Technology (03/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance Engineer

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (05/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior High-Performance Computing ( HPC

    Microsoft Corporation (Redmond, WA)
    …working collaboratively with many industry partners. As a Senior High-Performance Computing ( HPC ) Software Engineer , you will be critical in designing and ... delivering the next generations of AI training, AI inferencing, virtual desktop, video...be challenged across a wide spectrum of hardware architectures, network types and processor types. You will help define… more
    Microsoft Corporation (05/22/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC DGX Systems Technical Support…

    NVIDIA (Westford, MA)
    We are seeking a motivated Senior HPC DGX Systems Technical Support Engineer passionate about AI , GPU, networking and datacenter technologies, to provide ... installations, maintenance, or operations for a broad scope of AI hardware and software products. As a primary point...+ InfiniBand, RDMA and GPU Technology + Clustering or HPC Data-Center technologies including Upper Layer Protocols (ie, MPI,… more
    NVIDIA (03/07/24)
    - Save Job - Related Jobs - Block Source
  • HPC Systems Administrator

    The MITRE Corporation (Mclean, VA)
    …Technology Division provides multiple corporate-wide services including High Performance Computing ( HPC ), Enterprise PC and Mobile Solutions, Network Services, ... organizations. Job Description: We are seeking an experienced Linux HPC Systems engineer to join our team!...intelligence ( AI ), and advanced computing to deliver AI and HPC services to MITRE organization… more
    The MITRE Corporation (04/03/24)
    - Save Job - Related Jobs - Block Source
  • HPC Operations Manager - Hardware…

    NVIDIA (Santa Clara, CA)
    …to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering ... intelligence to autonomous cars. We are now looking for a highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to… more
    NVIDIA (03/13/24)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer , AI

    Renesas (Godair, MO)
    …Job Description Global Pay grade: Up to Global Pay Grade P5 Division: HPC AI & Cloud Engineering Division (291800000000) Location: Dusseldorf, Munich, Dresden ... Information Technology, or related field. + Experience working as a Site Reliability Engineer or in a similar role. + Programming skills in languages such as… more
    Renesas (05/24/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Systems Development Engineer (AWS…

    Amazon (Seattle, WA)
    …delivering and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (04/12/24)
    - Save Job - Related Jobs - Block Source
  • Storage Infrastructure Engineer

    The MITRE Corporation (Mclean, VA)
    …Technology Division provides multiple corporate-wide services including High Performance Computing ( HPC ), Enterprise PC and Mobile Solutions, Network Services, ... HPC group is looking for an experienced Infrastructure Engineer with a background in designing, implementing, and supporting...intelligence ( AI ), and advanced computing to deliver AI and HPC services to MITRE organization… more
    The MITRE Corporation (04/03/24)
    - Save Job - Related Jobs - Block Source
  • GPU Computing Capacity Optimization…

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... usage of all datacenter resources including compute , storage, network and power. You will help build methodologies, tools...Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Working knowledge of cluster… more
    NVIDIA (05/04/24)
    - Save Job - Related Jobs - Block Source
  • IT SR Systems Engineer - Remote

    Mayo Clinic (Rochester, MN)
    …The Research and Specialty Services Unit is looking for an IT Senior Systems Engineer with demonstrated HPC skills as well as solid Linux systems administration ... background to help support the growing Research HPC , GPU, and Generative AI environments. Much...Under general supervision and guidance, the Senior IT Systems Engineer is responsible for the maintenance and support of… more
    Mayo Clinic (05/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior Research Software Engineer

    NYU Rory Meyers College of Nursing (New York, NY)
    …and research computing projects, in alignment with AI , Research Cloud, and HPC needs. The Senior Research Software Engineer works closely with NYU ... Position Summary The Senior Research Software Engineer provides software and systems engineering support to...Intelligence projects, an investment to research cloud for bursting HPC jobs, and a dedicated High-Speed Research Network more
    NYU Rory Meyers College of Nursing (03/28/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - Codec Avatar ML Compute…

    Meta (Pittsburgh, PA)
    …to enable groundbreaking research in relightable avatars, full-body avatars, and generative AI for codec avatars. **Required Skills:** Software Engineer - Codec ... with working on the frontiers of research.In this software engineer role on the Codec Avatar ML Compute team,...root cause analysis through multiple infrastructure layers (compute, storage, network ) for HPC clusters and act as… more
    Meta (03/24/24)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer

    Microsoft Corporation (Redmond, WA)
    …we build the software to expose this platform as an Azure service. As a Principal Software Engineer in the Azure HPC / AI team, you will play a critical role ... Azure High Performance Computing and Artificial Intelligence ( AI ) Platform ( HPC / AI ) group...(GPUs) and accelerators, as well as a state-of-the-art scale-out network infrastructure to enable these workloads. We collaborate with… more
    Microsoft Corporation (05/17/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - High Performance…

    Microsoft Corporation (Redmond, WA)
    …a Senior Software Engineer - High Performance Computing on the HPC (High Performance Computing)/ AI (Artificial Intelligence) team, you'll have the opportunity ... to work on cutting-edge technology that powers our cloud AI supercomputers (Azure HPC documentation | Microsoft Learn). You will be working directly with GPU… more
    Microsoft Corporation (03/09/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - Scaling…

    Meta (Menlo Park, CA)
    **Summary:** In this role, you will be a member of the Network . AI Software team and part of the bigger DC networking organization. The team develops and owns the ... Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on… more
    Meta (04/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , GPU…

    NVIDIA (Santa Clara, CA)
    …wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and network ... for Deep Learning frameworks (eg NCCL for TensorFlow/Pytorch) and HPC programming interfaces (eg UCX for MPI/OpenSHMEM) on GPU...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (04/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    Microsoft Corporation (Redmond, WA)
    …platform working collaboratively with many industry partners. As a Senior Software Engineer , you will be critical in designing and delivering the next generations ... of AI training, AI inferencing, virtual desktop, video...be challenged across a wide spectrum of hardware architectures, network types and processor types. You will help define… more
    Microsoft Corporation (03/04/24)
    - Save Job - Related Jobs - Block Source