• AI / HPC Systems

    Meta (Olympia, WA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more
    Meta (03/22/25)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer, AI NPI

    Meta (Bellevue, WA)
    …end-to-end system validation strategy (hardware and software), with a focus on various AI / HPC hardware systems in datacenter applications. 2. Lead the ... algorithms, and OOP). **Preferred Qualifications:** Preferred Qualifications: 17. Proficiency in High- Performance Computing ( HPC ) or AI system architecture… more
    Meta (05/07/25)
    - Save Job - Related Jobs - Block Source
  • Systems Development Eng (AWS Generative…

    Amazon (Seattle, WA)
    …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
    Amazon (03/05/25)
    - Save Job - Related Jobs - Block Source
  • Technical Program Manager, AI Network Infra

    Meta (Bellevue, WA)
    AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible ... deliver on shared goals. 10. The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network… more
    Meta (05/08/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Hardware Dev Engineer (AWS Generative…

    Amazon (Seattle, WA)
    …operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. AWS Infrastructure Services owns the design, planning, ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...the current customer experience as well as developing improved systems for future designs. You will work directly with… more
    Amazon (05/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer, Annapurna…

    Amazon (Seattle, WA)
    Description We are seeking an experienced engineer to work on distributed AI /ML systems . This role involves working on collective operations - the fundamental ... operations that enable AI to scale across multiple accelerators & servers. Most...building networking solutions that for Machine Learning (ML) and High- Performance Computing ( HPC ) workloads on AWS. We… more
    Amazon (03/14/25)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect, Networking - Cloud Service…

    NVIDIA (Redmond, WA)
    …workshops, etc. + Analyze and develop solutions for customer performance issues for both AI workload and systems performance . What we need to see: + ... networking and help develop accelerated computing networking solutions for AI /ML and HPC with our Hyperscaler customers....systems in general including but not limited to performance testing/tuning, benchmarking, etc. + Strong systems more
    NVIDIA (04/09/25)
    - Save Job - Related Jobs - Block Source
  • Technology Evangelist (Cloud)

    Pacific Northwest National Laboratory (Richland, WA)
    …content (eg, blogs, whitepapers, presentations). + Specialized technical/functional (eg, Cloud/ HPC computing, Security, AI ) experience. Marketing Prowess + ... content (eg, blogs, whitepapers, presentations). + Specialized technical functional (eg, Cloud/ HPC computing, Security, AI ) experience. + Experience guiding… more
    Pacific Northwest National Laboratory (05/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Runtime Software Development Engineer,…

    Amazon (Seattle, WA)
    …As a Runtime Software Development Engineer you will have experience with high- performance Linux drivers, HPC technologies including: libfabric, MPI, and ... Description At AWS AI our vision is to make deep learning...for customers to quickly get started with running high performance and cost-effective inference and training. The Neuron team… more
    Amazon (03/19/25)
    - Save Job - Related Jobs - Block Source
  • Sr Product Manager - Technical

    Amazon (Seattle, WA)
    …digital transformation across several customer workloads including AI /ML, generative AI , databases, Big Data analytics, SAP, HPC , Edge, and more. ... Customers around the world rely on Amazon EC2 to provide elastic, high performance , secure, and cost effective resources to scale their infrastructure to keep up… more
    Amazon (03/06/25)
    - Save Job - Related Jobs - Block Source
  • GenAI Specialist Solutions Architect, Amazon…

    Amazon (Seattle, WA)
    …modernizing customer requirements to the cloud - Practical experience in High Performance Computing ( HPC ) and/or distributed training, performance profiling ... responsibilities We are looking for a strong Solution Architect with a Data & AI background to enable new capabilities for our customers to deploy GenAI workloads on… more
    Amazon (03/07/25)
    - Save Job - Related Jobs - Block Source