• Lightmatter (Mountain View, CA)
    …processors at the speed of light in extreme-scale data centers for the most advanced AI and HPC workloads. Lightmatter raised $400 million in its Series D round, ... Lightmatter is leading the revolution in AI data center infrastructure, enabling the next giant leaps in human progress. The company invented the world's first… more
    Upward (07/10/25)
    - Save Job - Related Jobs - Block Source
  • AI / HPC Systems

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
    Meta (06/18/25)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer, Sustaining

    Meta (Menlo Park, CA)
    …hardware and software components, co-design 15. Experience in developing or debugging AI / HPC systems , performance optimizations, including familiarity ... or supporting production hardware at scale 9. Experience in deploying and productionizing AI / HPC systems and/or related components at scale 10. Experience in… more
    Meta (06/25/25)
    - Save Job - Related Jobs - Block Source
  • AI Infrastructure Engineer - HPC

    Cisco (San Jose, CA)
    AI Infrastructure Engineer - HPC Apply (https://jobs.cisco.com/jobs/Login?projectId=1443781) + Location:San Jose, California, US + Alternate LocationAnywhere is ... and managing the internal NVIDIA DGX and Cisco-UCS based AI platforms at Cisco. You will provide leadership in...SaltStack, Puppet and/or Chef + Deep understanding of operating systems , computer networks, and high- performance applications. +… more
    Cisco (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager - AI

    Meta (Menlo Park, CA)
    …Qualifications: 7. Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: 8. Communication libraries (eg, ... of Meta AI infrastructure! **Required Skills:** Software Engineering Manager - AI Systems Co-Design Responsibilities: 1. Lead and support the communications… more
    Meta (07/02/25)
    - Save Job - Related Jobs - Block Source
  • High- performance AI compute…

    Cisco (San Jose, CA)
    High- performance AI compute engineer Apply (https://jobs.cisco.com/jobs/Login?projectId=1445895) + Location:San Jose, California, US + Area of InterestEngineer - ... infrastructure - we'd love to meet you. **Impact** As **High- performance AI compute engineer** , you will...and **asynchronous programming models** . + Deep understanding of ** HPC workloads** , performance bottlenecks, and **compute/memory… more
    Cisco (07/19/25)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer, AI

    Meta (Menlo Park, CA)
    …7+ years of experience with one subset of the following AI systems : Accelerator (GPU/ASIC), Performance characterization/optimization/tracing/debugging (eg, ... developing and productizing high- performance software and hardware technologies for AI at datacenter scale.Hardware Systems Engineer in RTP work closely… more
    Meta (06/25/25)
    - Save Job - Related Jobs - Block Source
  • Technical Program Manager, AI Network Infra

    Meta (Menlo Park, CA)
    AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible ... deliver on shared goals 10. The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network… more
    Meta (05/08/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, SystemML - AI Networking

    Meta (Menlo Park, CA)
    …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more
    Meta (07/21/25)
    - Save Job - Related Jobs - Block Source
  • Technical Sourcing Manager, Advanced Thermal…

    Meta (Fremont, CA)
    …and associated system design trade-offs, particularly for AI and High Performance Computing ( HPC ) systems 21. Experience interfacing with internal ... chain organizations related to data center products, infrastructure, rack design, AI , Compute Hardware, or Mechanical Engineering 12. Proven experience building and… more
    Meta (06/25/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, Accelerator Systems

    Meta (Menlo Park, CA)
    …13. Full-stack experience and understanding of AI / HPC systems , from HW/infrastructure through the application layer, performance optimizations, including ... learning domains: hardware accelerators, AI Infrastructure, and/or high performance computing ( HPC ), particularly pertaining to interconnect and collective.… more
    Meta (05/01/25)
    - Save Job - Related Jobs - Block Source
  • AI Applications Engineer

    quadric.io, Inc (Burlingame, CA)
    …wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems . ... Candidates must demonstrate deep technical mastery of Quadric's product ecosystem including HPC Hardware (IP, Chips, Boards), SDK, and various algorithms (NN, DSP,… more
    quadric.io, Inc (06/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Engineer, ML Infrastructure…

    LinkedIn (Mountain View, CA)
    …parallel file systems , object storage, NVMe over Fabric) to meet performance and capacity requirements for ML workloads. Collaborate with network and storage ... our large-scale GPU infrastructure for machine learning (ML) and AI workloads. In this role, you will be the...8+ years of experience designing and managing large-scale, distributed systems or HPC environments, with at least… more
    LinkedIn (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer, SystemML - Scaling…

    Meta (Menlo Park, CA)
    …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more
    Meta (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Optical System Architect

    Microsoft Corporation (Mountain View, CA)
    …link budget analysis, transceiver technologies, and integration of optics into high- performance AI infrastructure. + Experience designing high performance ... design, leading the way for the next generation of systems and AI super computers. Our mission...and definitionfor industry leading platforms focused on GPU and AI accelerator-basedsolutions, with a focus on high performance more
    Microsoft Corporation (07/11/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Technical Lead

    Cisco (San Jose, CA)
    …agile team engaged in the design, development and execution of tests to qualify network performance for AI /ML capability. You will be a part of our solutions ... to build the next generation infrastructure to meet the needs of AI /ML workloads and continuously increasing internet users and application. We are uniquely… more
    Cisco (06/25/25)
    - Save Job - Related Jobs - Block Source
  • Advisor - Fragment-based Computational Design

    Lilly (San Francisco, CA)
    …contributor to an interdisciplinary initiative spanning structural biology, biophysics, chemistry, and AI systems , with the mission of transforming how fragments ... PanDDA, and the use of scripting to automate their use, ideally in a High- Performance Computing ( HPC ) environment. + Proven track record of FBLD. + Proven… more
    Lilly (05/16/25)
    - Save Job - Related Jobs - Block Source
  • Sr. GTM Specialist Solutions Architect,…

    Amazon (San Francisco, CA)
    …- 5+ years of technical experience in High Performance Computing, AI /ML, Math, Quantum Information Systems and Technologies, or similar accelerated computing ... that helps Startups adopt AWS' Accelerated Computing portfolio (ie HPC , AIML, big data), among others. You will 1/Be...businesses. Mentorship & Career Growth: We're continuously raising our performance bar as we strive to become Earth's Best… more
    Amazon (06/13/25)
    - Save Job - Related Jobs - Block Source
  • ML Engineer, Early Stage Project, X

    Google (Mountain View, CA)
    …with seismic and/or DAS + Large scale optimization/inversion experience + High performance computing ( HPC ) experience. + GCP experience. + Experience in ... eg unit testing, CI/CD and production operations + Use AI for code tools + Work effectively with cross-functional...infrastructure-as-code, eg Terraform + Exposure to productions systems that rely heavily on ML models, and/or experience… more
    Google (05/10/25)
    - Save Job - Related Jobs - Block Source