• Senior AI- HPC Cluster…

    NVIDIA (Santa Clara, CA)
    …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop… more
    NVIDIA (04/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI- HPC Storage…

    NVIDIA (Santa Clara, CA)
    …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
    NVIDIA (02/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are ... + Manage and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to automate deployment,… more
    NVIDIA (04/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Technical Support…

    NVIDIA (Durham, NC)
    We are seeking a motivated Senior HPC Support Engineer - Ethernet, passionate about data center and networking technologies, to provide comprehensive ... problems for customers installing our products and supporting systems using Linux Operating Systems (multi-distro), with the focus on NVIDIA Ethernet Switching… more
    NVIDIA (02/05/25)
    - Save Job - Related Jobs - Block Source
  • Sr Core Infrastructure Engineer HPC

    Children's Mercy Kansas City (Kansas City, MO)
    …we can improve the lives of children beyond the walls of our hospital. Overview Senior Core Infrastructure Engineer - HPC plans, designs, implements and ... and turning them into working systems and services. The Senior Core Infrastructure Engineer - HPC...and skills to perform advanced management and troubleshooting for Linux Servers in a hybrid environment. Requires knowledge of… more
    Children's Mercy Kansas City (03/28/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer

    Amazon (Cupertino, CA)
    Description We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental ... C/C++ and relatively low level, so solid knowledge of Linux , kernels, and performant code is important. Experience with...systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
    Amazon (03/21/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Nitro High…

    Amazon (Sunnyvale, CA)
    …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...with systems knowledge and experience in area such as Linux OS boot sequencing, Kernel, Hypervisor (Xen or KVM),… more
    Amazon (04/29/25)
    - Save Job - Related Jobs - Block Source
  • HPC Operations Manager - Hardware…

    NVIDIA (Santa Clara, CA)
    …to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering ... intelligence to autonomous cars. We are now looking for a highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to… more
    NVIDIA (03/12/25)
    - Save Job - Related Jobs - Block Source
  • Senior Linux IT Systems…

    Lockheed Martin (Orlando, FL)
    … IT Systems Engineer with a focus on supporting Red Hat\-based Linux distributions, Kubernetes and High Performance Computing \( HPC \) clusters\. * Linux ... * Maintain operating system cyber compliance, upgrades, and patching\. \(Windows/ Linux /VMWare\) * Perform guidance and management for hardware, including… more
    Lockheed Martin (04/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Administrator - Windows/…

    Mount Sinai Health System (New York, NY)
    …Mount Sinai. The Administrator is the principal technology expert for Windows and Linux systems, and help support high-performance computing ( HPC ) environment in ... and a research data services team. The **_Senior Systems Administrator/ Engineer ,_** as a member of the Scientific Computing and...TSM system is integrated with the 25,000-core, 30 petabyte HPC system. This position reports to the Director for… more
    Mount Sinai Health System (03/25/25)
    - Save Job - Related Jobs - Block Source
  • Linux System Administrator IV (Server)

    Leidos (Annapolis Junction, MD)
    …Administrator IV - Server** for a new customer on a strategic High-Performance Computing ( HPC ) program. The Senior System Administrator will need to be a ... judgment, and the ability to work within a team to mature the HPC capabilities of our customer. **Primary Responsibilities:** + Responsible for overseeing the most… more
    Leidos (04/10/25)
    - Save Job - Related Jobs - Block Source
  • Senior High Performance Computing…

    SLAC National Accelerator Laboratory (Menlo Park, CA)
    Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... is open to on-site and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services… more
    SLAC National Accelerator Laboratory (04/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Software Engineer

    NVIDIA (Santa Clara, CA)
    …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... to guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an… more
    NVIDIA (04/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Engineer - Accelerated…

    NVIDIA (Westford, MA)
    …Work (https://www.glassdoor.com/Award/Best-Places-to-Work-LST\_KQ0,19.htm) by Glassdoor. We are looking for a Senior DevOps Engineer to join our team, although ... Site Reliability Engineer , Build and Release Engineer , Continuous Integration Engineer .. all can be valid titles for this role. Our team builds software that… more
    NVIDIA (04/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Simulation & Analysis…

    Belcan (Greensboro, NC)
    Senior Simulation & Analysis Engineer Job Number:...Automotive experience a plus * Experience with working on Linux / HPC systems * A minimum of 6 years" ... 357016 Category: Design Analysis Description: Job Title: Senior Simulation & Analysis Engineer Location: Greensboro,...dynamics simulations is a plus * Highly experienced in Linux OS and use of HPC systems… more
    Belcan (04/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , GPU…

    NVIDIA (Santa Clara, CA)
    …the next wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and ... communication runtimes for Deep Learning frameworks (eg NCCL for TensorFlow/Pytorch) and HPC programming interfaces (eg UCX for MPI/OpenSHMEM) on GPU clusters. +… more
    NVIDIA (04/25/25)
    - Save Job - Related Jobs - Block Source
  • Associate Director, Sr Principal Systems…

    Bristol Myers Squibb (Princeton, NJ)
    …. **Summary:** Bristol Myers Squibb is looking for an experienced Sr Principal Systems Engineer in HPC /AI infrastructure to work with our technology teams and ... Collaborating with cross functional teams within BMS, the systems engineer would work our teams to define and execute...managers and schedulers, ideally Slurm but experience with other HPC schedulers should be acceptable. + Linux more
    Bristol Myers Squibb (04/25/25)
    - Save Job - Related Jobs - Block Source
  • Systems Engineer /Administrator Staff

    Lockheed Martin (Springfield, VA)
    …be able to fully utilize modern HPC systems\. We are seeking a senior level Systems Engineer /Administrator Staff with Linux and Software Configuration ... corporate responsibility\. Your Mission is Ours\. Lockheed Martin provides Red Hat Enterprise Linux \(RHEL\)/SE Linux based HPC services throughout the… more
    Lockheed Martin (04/09/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... improvement. This candidate must have enterprise server integration, strong Linux experience, reliability testing with various telemetries, scale out cluster,… more
    NVIDIA (04/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Datacenter Product Development…

    NVIDIA (Santa Clara, CA)
    …Ethernet, I3C/I2C, SPI, USB, etc. + In depth understanding of HPC server architecture and Out-of-Band management + Strong problem-solving and trouble-shooting ... in defining test and validation specifications for complex HW systems or HPC servers + Motivated to continually improve/optimize processes + Self-initiative, strong… more
    NVIDIA (04/19/25)
    - Save Job - Related Jobs - Block Source