• AI / HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Lead ... 5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques. **Minimum… more
    Meta (04/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Engineer , Infrastructure…

    NVIDIA (TX)
    NVIDIA is looking for a Senior HPC Engineer to join its Professional Services team. Academi c, c ommercial and government groups around the world are using ... and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is looking for someone with the ability to… more
    NVIDIA (06/19/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance Engineer

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (05/04/24)
    - Save Job - Related Jobs - Block Source
  • HPC Systems Administrator

    The MITRE Corporation (Mclean, VA)
    …Technology Division provides multiple corporate-wide services including High Performance Computing ( HPC ), Enterprise PC and Mobile Solutions, Network Services, ... organizations. Job Description: We are seeking an experienced Linux HPC Systems engineer to join our team!...intelligence ( AI ), and advanced computing to deliver AI and HPC services to MITRE organization… more
    The MITRE Corporation (04/03/24)
    - Save Job - Related Jobs - Block Source
  • HPC Operations Manager - Hardware…

    NVIDIA (Santa Clara, CA)
    …to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering ... intelligence to autonomous cars. We are now looking for a highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to… more
    NVIDIA (06/12/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Systems Development Engineer (AWS…

    Amazon (Seattle, WA)
    …delivering and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
    Amazon (04/12/24)
    - Save Job - Related Jobs - Block Source
  • Storage Infrastructure Engineer

    The MITRE Corporation (Mclean, VA)
    …Technology Division provides multiple corporate-wide services including High Performance Computing ( HPC ), Enterprise PC and Mobile Solutions, Network Services, ... HPC group is looking for an experienced Infrastructure Engineer with a background in designing, implementing, and supporting...intelligence ( AI ), and advanced computing to deliver AI and HPC services to MITRE organization… more
    The MITRE Corporation (04/03/24)
    - Save Job - Related Jobs - Block Source
  • Research Systems Engineer

    University of Oregon (Eugene, OR)
    …and cluster software subsystems including Infiniband networking, GPFS parallel file systems HPC queuing systems. The Research Systems Engineer serves as a ... staff supporting Link Oregon, Oregon's state-wide research and education network . Founded in 1876, the University of Oregon (UO)...enable the near- and long-term security goals of the HPC systems the Research Systems Engineer will… more
    University of Oregon (05/31/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer , Sustaining

    Meta (Menlo Park, CA)
    …hardware requirements and specifications (eg, configuring hardware components, GPU, memory, network for AI / HPC workloads) **Public Compensation:** ... **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP)...Responsibilities: 1. Develop robust, industry leading practices for supporting AI / HPC infrastructure at scale 2. Interface with… more
    Meta (06/05/24)
    - Save Job - Related Jobs - Block Source
  • GPU Computing Capacity Optimization…

    NVIDIA (Santa Clara, CA)
    …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... usage of all datacenter resources including compute , storage, network and power. You will help build methodologies, tools...Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Working knowledge of cluster… more
    NVIDIA (05/04/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - High Performance…

    Microsoft Corporation (Redmond, WA)
    …a Senior Software Engineer - High Performance Computing on the HPC (High Performance Computing)/ AI (Artificial Intelligence) team, you'll have the opportunity ... to work on cutting-edge technology that powers our cloud AI supercomputers (Azure HPC documentation | Microsoft Learn). You will be working directly with GPU… more
    Microsoft Corporation (06/08/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - Codec Avatar ML Compute…

    Meta (Pittsburgh, PA)
    …to enable groundbreaking research in relightable avatars, full-body avatars, and generative AI for codec avatars. **Required Skills:** Software Engineer - Codec ... with working on the frontiers of research.In this software engineer role on the Codec Avatar ML Compute team,...root cause analysis through multiple infrastructure layers (compute, storage, network ) for HPC clusters and act as… more
    Meta (06/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    Microsoft Corporation (Redmond, WA)
    … to join our team and help design, build and operate the next generation of HPC clusters to help power Microsoft's AI mission. If you love distributed systems ... Seeking opportunities to work on some of the largest AI infrastructure on the planet? Want to help empower...details of how everything works across cutting edge storage, network and Infiniband, this could be the role for… more
    Microsoft Corporation (06/20/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - Scaling…

    Meta (Menlo Park, CA)
    **Summary:** In this role, you will be a member of the Network . AI Software team and part of the bigger DC networking organization. The team develops and owns the ... Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on… more
    Meta (04/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , GPU…

    NVIDIA (Santa Clara, CA)
    …wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and network ... for Deep Learning frameworks (eg NCCL for TensorFlow/Pytorch) and HPC programming interfaces (eg UCX for MPI/OpenSHMEM) on GPU...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (04/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    Microsoft Corporation (Redmond, WA)
    …platform working collaboratively with many industry partners. As a Senior Software Engineer , you will be critical in designing and delivering the next generations ... of AI training, AI inferencing, virtual desktop, video...be challenged across a wide spectrum of hardware architectures, network types and processor types. You will help define… more
    Microsoft Corporation (06/03/24)
    - Save Job - Related Jobs - Block Source
  • Senior Infrastructure Performance and Development…

    NVIDIA (Santa Clara, CA)
    Joining NVIDIA's AI Efficiency Team means contributing to the infrastructure that powers our leading-edge AI research. This team focuses on optimizing efficiency ... and resiliency of ML workloads, as well as developing scalable AI infrastructure tools and services. Our objective is to deliver a stable, scalable environment for… more
    NVIDIA (04/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …Ways to stand out from the crowd: + Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system ... We are seeking a Sr System Software Engineer to help us build out our scientific...computing cloud platform enables Physics based Numerical Simulation Solvers, AI based Training, Inference and Visualization workflow for physical… more
    NVIDIA (06/11/24)
    - Save Job - Related Jobs - Block Source
  • Senior Math Libraries Engineer , Iterative…

    NVIDIA (Santa Clara, CA)
    We are looking for a software engineer for our Sparse Linear Algebra team which develops key technologies and libraries such as cuSOLVER, cuSPARSE, cuDSS, and AmgX, ... come and join our team! What you will be doing: + developing scalable HPC math library software for various numerical methods including but not limited to sparse… more
    NVIDIA (06/04/24)
    - Save Job - Related Jobs - Block Source
  • Pre-Sales Technical Engineer

    Lenovo (Morrisville, NC)
    Pre-Sales Technical Engineer **General Information** Req # WD00066517 Career area: Sales Support Country/Region: United States of America State: North Carolina City: ... fuel the advancement of 'New IT' technologies (client, edge, cloud, network , and intelligence) including server, storage, mobile, software, solutions, and services.… more
    Lenovo (06/14/24)
    - Save Job - Related Jobs - Block Source