• AI / HPC Systems

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
    Meta (11/18/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC System Performance

    Meta (Menlo Park, CA)
    …and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more
    Meta (11/06/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Network Engineering Manager

    Meta (Menlo Park, CA)
    …These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more
    Meta (10/16/25)
    - Save Job - Related Jobs - Block Source
  • Research Scientist, AI & Systems

    Meta (Menlo Park, CA)
    …on existing accelerator systems and guiding the future of models and AI HW at Meta. This drives improved performance , new model architectures and ... the following areas: Accelerators/GPU architectures, High Performance Computing ( HPC ), Machine Learning Compilers, Training/Inference ML Systems , Model… more
    Meta (11/07/25)
    - Save Job - Related Jobs - Block Source
  • Data Science & AI Librarian, Stanford Law…

    Stanford University (Stanford, CA)
    …same on any machine (Docker) and, when a laptop isn't enough, using campus high- performance computing ( HPC ) or a small cloud server to process larger datasets ... Data Science & AI Librarian, Stanford Law School **School of Law,...for grants. + **Liaise with campus** data science institutes, HPC , and central library data services; use HPC more
    Stanford University (10/03/25)
    - Save Job - Related Jobs - Block Source
  • Technical Program Manager, AI Network Infra

    Meta (Menlo Park, CA)
    AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible ... deliver on shared goals 10. The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network… more
    Meta (11/19/25)
    - Save Job - Related Jobs - Block Source
  • Remote Senior Performance Engineer

    Insight Global (Palo Alto, CA)
    …with GPU architecture and parallel computing. - Background in kernel optimization and HPC systems . - Proficiency in CUDA and familiarity with NVIDIA's ... Description Insight Global is looking to hire a Senior Performance Engineer for a client in the quantum computing...include: - Lead the design and build of specialized HPC environments. - Scale machine learning models on GPU… more
    Insight Global (11/10/25)
    - Save Job - Related Jobs - Block Source
  • AI Applications Engineer

    quadric.io, Inc (Burlingame, CA)
    …wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems . ... Candidates must demonstrate deep technical mastery of Quadric's product ecosystem including HPC Hardware (IP, Chips, Boards), SDK, and various algorithms (NN, DSP,… more
    quadric.io, Inc (08/26/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, SystemML - Scaling…

    Meta (Menlo Park, CA)
    …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more
    Meta (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager - Meta…

    Meta (Menlo Park, CA)
    …levels 9. Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: 10. GPU/ASIC-based kernel development and ... systems for our fleet 4. Technical management 5. Experience in systems architecture, performance , workload-analysis and large scale distributed systems more
    Meta (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager, MTIA

    Meta (Menlo Park, CA)
    …10. Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: GPU/ASIC-based kernel development and ... ROCm), distributed systems for large scale training and serving, and systems architecture and performance 11. Accelerator (GPU/ASIC) kernel development and… more
    Meta (09/06/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - Host Networking

    Meta (Menlo Park, CA)
    …networks, powering our global data centers and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network ... to join our teams and help build scalable distributed systems , develop innovative solutions to our challenges, and ship...firmware, and software for network devices, transport stacks, and AI workloads 2. Debug complex system-level issues and lead… more
    Meta (11/08/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - Host Networking

    Meta (Menlo Park, CA)
    …networks, powering our global data centers and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network ... to join our teams and help build scalable distributed systems , develop innovative solutions to our challenges, and ship...firmware, and software for network devices, transport stacks, and AI workloads 2. Debug complex system-level issues and lead… more
    Meta (11/08/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, Facebook Open Switching System…

    Meta (Menlo Park, CA)
    …networks, powering our global data centers and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network ... to join our teams and help build scalable distributed systems , develop innovative solutions to our challenges, and ship...firmware, and software for network devices, transport stacks, and AI workloads 2. Debug complex system-level issues and lead… more
    Meta (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Research Analytics Scientist

    Stanford University (Stanford, CA)
    …groups' meetings and presentations to assist with identifying promising tools and systems and to discuss their computational challenges and requirements. * Engage ... on the use of a broad set of cyberinfrastructure systems , tools, and software. * Provide support for Stanford...debugging techniques. Working knowledge of at least one mainstream ML/ AI framework and how to execute efficiently in an… more
    Stanford University (10/16/25)
    - Save Job - Related Jobs - Block Source
  • Hardware Engineer I (Co-op) - United States

    Cisco (San Francisco, CA)
    …data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. Supply ... teams while collaborating on ASIC Design and Verification for reliable, high- performance products. + Drive innovation in System/Board Design, leveraging excellent… more
    Cisco (11/14/25)
    - Save Job - Related Jobs - Block Source