• Network Engineer , HPC

    Meta (Menlo Park, CA)
    …and efficiency in our global network . **Required Skills:** Network Engineer , HPC Systems Network Strategy Responsibilities: 1. Design, ... meeting our demands; you will be responsible for conceiving, developing, and deploying software, hardware and network systems and tools that improve reliability… more
    Meta (08/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Performance Engineer

    NVIDIA (Santa Clara, CA)
    …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
    NVIDIA (08/04/25)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer - HPC , AI…

    Cisco (San Jose, CA)
    Principal Engineer - HPC , AI Infrastructure Apply (https://jobs.cisco.com/jobs/Login?projectId=1445895) + Location:San Jose, California, US + Area of ... maintain device drivers and runtime components for GPU and network components of the systems . + Working...PhD is a plus, especially with research in GPU systems , compilers, or HPC . **Message to applicants… more
    Cisco (07/19/25)
    - Save Job - Related Jobs - Block Source
  • AI/ HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: … more
    Meta (09/19/25)
    - Save Job - Related Jobs - Block Source
  • AI/ HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI/ HPC Systems Performance Engineer Responsibilities: 1. Active ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: … more
    Meta (08/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Product Architect, HPC

    NVIDIA (Santa Clara, CA)
    …high-performance environments. + Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to be one of the ... engine of modern Artificial Intelligence, Advanced Networking, and High Performance Computing ( HPC ) - the biggest technology breakthroughs of our time. We're on a… more
    NVIDIA (10/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior GPU and HPC Infrastructure…

    NVIDIA (Santa Clara, CA)
    …familiarity with software testing and deployment, familiarity with distributed systems , and excellent communication and planning abilities. Experience working with ... High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly preferred. We also welcome out-of-the-box thinkers who… more
    NVIDIA (10/09/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    LTD Global (Berkeley, CA)
    …computing ( HPC ) and data analysis for the organization. Our center provides essential HPC and data systems to more than 10,000 researchers working in areas ... Position overview: We are seeking a Site Reliability Engineer to join our Operations Group. This role...part of a 24/7 operations team that ensures our systems are accessible, reliable, secure, and available to the… more
    LTD Global (09/23/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer , NCCL…

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
    NVIDIA (10/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage and Networking Product…

    NVIDIA (Santa Clara, CA)
    HPC /AI clusters at scale, with hands-on expertise with network topologies and large-scale switch/router deployments. + Familiarity with network ... making the impossible achievable, particularly within AI, ML, and HPC . Joining our team as a Storage & Networking...Joining our team as a Storage & Networking Product Engineer involves being part of a group that fosters… more
    NVIDIA (09/25/25)
    - Save Job - Related Jobs - Block Source
  • Research Data Center Facility Engineer

    Stanford University (Stanford, CA)
    …researchers from a variety of Stanford and SLAC organizations. The majority of the HPC systems are hosted in the Stanford Research Computing Facility (SRCF), ... Research Data Center Facility Engineer **Business Affairs: University IT (UIT), Stanford, California,...Stanford Research Computing. Research Computing offers High Performance Computing ( HPC ) hosting services, computational and data systems ,… more
    Stanford University (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , GPU…

    NVIDIA (Santa Clara, CA)
    …wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and network ... crew that develops and maintains software for complex heterogeneous computing systems that power disruptive products in High Performance Computing and Deep… more
    NVIDIA (09/11/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI…

    Meta (Menlo Park, CA)
    …Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on ... (eg Large-Scale GenAI/LLM training) from the trainer down to the inter-GPU and network communication layer. And we are seeking for engineers to work on the… more
    Meta (08/01/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - Datacenter networking

    Meta (Menlo Park, CA)
    …Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized ... control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure analysis.We are actively seeking Software… more
    Meta (09/10/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - Datacenter networking

    Meta (Menlo Park, CA)
    …Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized ... control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure analysis.We are actively seeking Software… more
    Meta (08/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Cloud Services Software Engineer

    NVIDIA (Santa Clara, CA)
    …vital resources and scale to champion innovation. We are seeking a distributed software engineer to join our team! As a Senior engineer , you'll be instrumental ... team of like-minded engineers. What You Will Be Doing: As a software engineer specializing in backend development, you'll work in a dedicated team to enhance… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • AI Applications Engineer

    quadric.io, Inc (Burlingame, CA)
    …battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems . Unlike other NPUs or neural network accelerators in the ... co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of...C++ DSP and control code. Role: The Corporate Applications Engineer is the key bridge between development engineering and… more
    quadric.io, Inc (08/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage Product Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA's Enterprise Product Engineering involves crafting, constructing, and maintaining vital systems efficiently and reliably.. As a Senior Storage Product ... Engineer , you will take ownership of NVIDIA's Product Team's...environments. We focus on delivering high-performance, highly available storage systems that scale while enabling developers to innovate rapidly… more
    NVIDIA (09/26/25)
    - Save Job - Related Jobs - Block Source
  • Hardware Engineer I (Co-op) United States

    Cisco (San Francisco, CA)
    Hardware Engineer I (Co-op) United States Apply (https://jobs.cisco.com/jobs/Login?projectId=1449082) + Location:San Jose, California, US + Alternate LocationSan ... data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. Supply… more
    Cisco (09/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... OS, FW and CUDA SW stack from design doc. + Installing and testing various systems OS, server firmware and SW stack. + Drive support for root cause analysis on… more
    NVIDIA (09/24/25)
    - Save Job - Related Jobs - Block Source