• AI / HPC Systems

    Meta (Nashville, TN)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
    Meta (03/22/25)
    - Save Job - Related Jobs - Block Source
  • HPC / AI - Kubernetes Engineer

    Deloitte (Nashville, TN)
    …day-to-day operations of the High- Performance Computing ( HPC ) and AI infrastructure, ensuring all systems meet or exceed requirements for scalability, ... Responsibilities: + System support and management of infrastructure for HPC and AI systems , this...system performance , ensuring the efficient execution of AI models and HPC applications. Implement techniques… more
    Deloitte (04/25/25)
    - Save Job - Related Jobs - Block Source