• HPC / AI - Kubernetes

    Deloitte (Hermitage, TN)
    HPC / AI Engineer (Federal) Job Summary: The HPC AI Engineer will be responsible for managing the day-to-day operations of the High-Performance ... Computing ( HPC ) and AI infrastructure, ensuring all systems...of experience in the design, support, and management of Kubernetes + 3+ years of In-depth experience of at… more
    Deloitte (04/25/25)
    - Save Job - Related Jobs - Block Source
  • AI Infrastructure Engineer

    Cisco (Research Triangle Park, NC)
    …and communicate advanced technical concepts. A talented and passionate engineer comfortable working in high-pressure, large-scale enterprise environments. What You ... and managing the internal NVIDIA DGX and Cisco-UCS based AI platforms at Cisco. You will provide leadership in...* 7+ years of previous experience deploying and administrating HPC clusters * Familiar with GPU resource scheduling managers… more
    Cisco (02/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Technical Support…

    NVIDIA (Durham, NC)
    We are seeking a motivated Senior HPC Support Engineer - Ethernet, passionate about data center and networking technologies, to provide comprehensive solutions ... (multi-distro), with the focus on NVIDIA Ethernet Switching technologies and our AI End-to-End Solutions. + Responding to customer product support inquiries via… more
    NVIDIA (02/05/25)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect, HPC Systems…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer . Do you want to be part of a team that brings new Artificial Intelligence ... ( AI ) hardware and software technologies to production in customer...GPU server and networking system deployments as Solution Architect Engineer . Guide customer discussions on network design, compute/storage and… more
    NVIDIA (04/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Make the choice, join our diverse team today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation of ... automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of...You will also be maintaining and building deep learning AI - HPC GPU clusters at scale and supporting… more
    NVIDIA (03/26/25)
    - Save Job - Related Jobs - Block Source
  • AI and ML Infra Software Engineer

    NVIDIA (Santa Clara, CA)
    …you can make a lasting impact on the world. We are currently hiring an AI /ML Infrastructure Software Engineer at NVIDIA to join our Hardware Infrastructure team. ... As an Engineer , you will play a crucial role in boosting...related field, with 5+ years of proven experience in AI /ML and HPC workloads and infrastructure. +… more
    NVIDIA (04/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Marketing Engineer

    NVIDIA (Santa Clara, CA)
    …how you can make a lasting impact on the world. As a Senior Technical Marketing Engineer for AI Infrastructure, you will join a dedicated team that is passionate ... of experience. + Proficiency in Python and C++ for AI and HPC applications. + Experience using...orchestration, and deploying multi-node GPU clusters using Slurm and Kubernetes . + Solid understanding of network protocols, distributed system… more
    NVIDIA (04/30/25)
    - Save Job - Related Jobs - Block Source
  • AI /ML Site Reliability Engineer

    Lockheed Martin (King Of Prussia, PA)
    …for you\. Job Description We are seeking an experienced Site Reliability Engineer \(SRE\) to join our team, responsible for designing, building, and maintaining ... the infrastructure for a new Artificial Intelligence \( AI \) and Machine Learning \(ML\) environment\. As an SRE,...in Spring of 2025 **Basic Qualifications:** \+ Experience with HPC hardware such as GPU\-based systems \(e\.g\., NVIDIA Tesla,… more
    Lockheed Martin (03/30/25)
    - Save Job - Related Jobs - Block Source
  • AI Predictive Machine Learning…

    Skyline Products (CO)
    …processes and improve product offerings. We are looking for a Remote Machine Learning Engineer to join our team, where you will play a key role in developing ... cutting-edge AI and machine learning models that drive innovative solutions...+ Familiarity with containerization and orchestration tools (eg, Docker, Kubernetes ). + Strong problem-solving and analytical skills, with the… more
    Skyline Products (02/05/25)
    - Save Job - Related Jobs - Block Source
  • AI Engineering Manager/Solutions Architect…

    Deloitte (San Francisco, CA)
    …in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns + Define and lead technology ... AI Engineering Manager/Solutions Architect - SFL Scientific SFL...machine learning & automation applications such as ELT functions, HPC /compute infrastructure, hybrid cloud solutions, database management, and optimization… more
    Deloitte (02/15/25)
    - Save Job - Related Jobs - Block Source
  • Associate Director, Sr Principal Systems…

    Bristol Myers Squibb (Princeton, NJ)
    …. **Summary:** Bristol Myers Squibb is looking for an experienced Sr Principal Systems Engineer in HPC / AI infrastructure to work with our technology teams ... and various stakeholders to design, manage, and support cutting-edge HPC / AI infrastructure platforms to serve our community...Collaborating with cross functional teams within BMS, the systems engineer would work our teams to define and execute… more
    Bristol Myers Squibb (04/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer , NCCL…

    NVIDIA (Santa Clara, CA)
    …test design + Experience working with engineering or academic research community supporting HPC or AI + Practical experience with high performance networking: ... runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner...applications. We are looking for a motivated Partner Enablement Engineer to guide our key partners and customers with… more
    NVIDIA (04/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Storage and Data Production Engineer

    NVIDIA (Santa Clara, CA)
    …distributed storage systems, and ensuring low-latency data access for high-performance computing ( HPC ) and AI /ML workloads. Production Engineers at NVIDIA ensure ... technologies, working on high-performance storage solutions that power the next generation of AI , HPC , and cloud computing. NVIDIA is leading in groundbreaking… more
    NVIDIA (04/05/25)
    - Save Job - Related Jobs - Block Source
  • Sr Staff Engineer , ML Infrastructure…

    LinkedIn (Mountain View, CA)
    …systems. Experience with containerization (Docker, Singularity) and job schedulers ( Kubernetes , Slurm, or other HPC schedulers). Excellent communication ... About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU...our large-scale GPU infrastructure for machine learning (ML) and AI workloads. In this role, you will be the… more
    LinkedIn (04/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... NVIDIA is also well positioned as the ' AI Computing Company', and NVIDIA GPUs are the brains...+ Proven years of experience in GitHub/Gitlab/Gerrit, PXE, SLURM, Stack/ Kubernetes /Docker) - huge plus Ways to stand out from… more
    NVIDIA (04/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer , Foundation Model…

    NVIDIA (Santa Clara, CA)
    …GPU clusters, HPC environments, and job scheduling/orchestration tools (eg, SLURM, Kubernetes ). Ways to stand out from the crowd: + Master's or PhD's degree ... NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the… more
    NVIDIA (03/08/25)
    - Save Job - Related Jobs - Block Source
  • Staff DevOps Engineer - Hybrid

    Caris Life Sciences (Tempe, AZ)
    …patient deserves answers as unique as their DNA. Backed by cutting-edge molecular science and AI , we ask ourselves every day: _"What would I do if this patient were ... Caris is where your impact begins.** **Position Summary** The Staff DevOps Engineer is responsible for designing, implementing, and maintaining scalable and secure… more
    Caris Life Sciences (04/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Research Engineer for Reinforcement…

    NVIDIA (Santa Clara, CA)
    …GPU clusters, HPC environments, and job scheduling/orchestration tools (eg, SLURM, Kubernetes ). Ways to stand out from the crowd: + Master's or PhD's degree ... NVIDIA is searching for a senior or principal engineer who specializes in large-scale reinforcement learning and policy learning in the Generalist Embodied Agent… more
    NVIDIA (03/08/25)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect, Financial Services

    NVIDIA (CA)
    …KubeFlow, data center deployments etc. Experience working with enterprise developers building AI , HPC , or data analytics applications NVIDIA is widely considered ... (Capital Markets, Consumer Finance, Payments) to accelerate High-Performance Computing and AI workloads across various use cases. We're seeking an inquisitive,… more
    NVIDIA (04/19/25)
    - Save Job - Related Jobs - Block Source