• IT InfiniBand/ GPU -Sr Staff Systems

    Cadence Design Systems, Inc. (San Jose, CA)
    …(not remote): San Jose, CA Must Haves + 15+ years of experience in system administration and engineering. + Minimum five years overall experience in technical ... or AMD's ROCm + Experience with; H100, AMD MI210, GPU servers in Cluster + Customer deployments...schedulers, and parallel computing. Deployment and operation of large-scale systems ; resilient system design; and clustering of… more
    Cadence Design Systems, Inc. (03/01/24)
    - Save Job - Related Jobs - Block Source
  • Sr Linux System Administrator, SME

    BAE Systems (Vicksburg, MS)
    … consisting of Linux based application and license servers, virtual machines, and GP/ GPU cluster based systems . Coordination with network administrators ... to support a team of administrators with standard Linux system administration duties at ERDC in Vicksburg,...Unix operating systems concepts as well as systems administration experience + Ability to develop… more
    BAE Systems (04/26/24)
    - Save Job - Related Jobs - Block Source
  • Lead Systems Administrator

    Leidos (Dayton, OH)
    … consisting of Linux based application and license servers, virtual machines, and GP/ GPU cluster based systems . Coordination with network administrators is ... Administrator to support a team of administrators with standard system administration duties at the Engineering Research...Unix operating systems concepts as well as systems administration experience + Ability to develop… more
    Leidos (02/01/24)
    - Save Job - Related Jobs - Block Source
  • TS/SCI (FSP) Lead Systems Engineer

    Insight Global (Princeton, NJ)
    …in the Linux operating system and have significant experience with CPU/ GPU based systems , high-performance storage technologies (eg Lustre), HPC or High ... a full-scope poly * 6 years of experience in systems administration * Possess advanced, subject matter...High Performance Computing (HPC) systems or large cluster computing, including GPU based systems more
    Insight Global (04/11/24)
    - Save Job - Related Jobs - Block Source
  • VFX Maintenance Engineer/ Sys Admin (Disney…

    The Walt Disney Company (Los Angeles, CA)
    …Suite. + A minimum of 5 years Linux, Mac OS and Windows system administration experience including networking and scripting. + Object-orientated programming & ... is preferred. + Strong understanding of storage and networking, NFS, NAS storage cluster , StorNext file system . + 3D rendering workflows including Redshift or… more
    The Walt Disney Company (04/10/24)
    - Save Job - Related Jobs - Block Source
  • Systems Architect

    Colorado State University (Fort Collins, CO)
    …servers and services. + Automate, provision, and perform a variety of systems administration tasks such as software installation, patching, upgrades, security ... position designs, builds, and develops computing systems , distributed computing systems , clusters, services, and system architectures to support the research… more
    Colorado State University (04/03/24)
    - Save Job - Related Jobs - Block Source
  • System Administrator

    Stanford University (Stanford, CA)
    …our clients from hardware purchase, hardware installation, and operating system installation, to cluster management, configuration management, application ... System Administrator **School of Engineering, Stanford, California, United...place through the power of engineering principles, techniques and systems . **Computer Science Department** Founded in 1965, theStanford Computer… more
    Stanford University (02/28/24)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect - AI and HPC Cloud

    NVIDIA (Santa Clara, CA)
    …hosts a heterogeneous mix of machines and devices with various operating systems (Windows/Linux/Android), a multitude of hardware platforms both NVIDIA GPUs and ... of NVIDIA products. + Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center… more
    NVIDIA (03/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Engineer Site Reliability

    Dell Technologies (Austin, TX)
    …Hyper-Converged infrastructure along with fluency in AI/ML pipelines, Nvidia GPU optimization, InfiniBand networking, Machine Learning operating systems ... Service Delivery will be responsible for providing the primary management, administration , support, and ongoing maintenance of customer Platforms within a 24x7x365… more
    Dell Technologies (04/21/24)
    - Save Job - Related Jobs - Block Source
  • Production Support Engineer

    Leidos (Bethesda, MD)
    …of the following:** + Strong understanding of cloud-based systems + System , database, data stores, and network administration and optimization + ... subsystems + Experience working within technology platforms such as ElasticSearch, Kubernetes cluster , storage systems , etc. and writing code/scripts + Hands-on… more
    Leidos (04/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Professional Services HPC…

    NVIDIA (Santa Clara, CA)
    …interconnect infrastructure (Infiniband and Ethernet). + Expertise with HPC system software cluster management/provisioning tools, including job schedulers ... Python, etc.) and experience with programming fundamentals. + Expertise with administration , supervising and maintaining secure Linux/Unix operating systems more
    NVIDIA (04/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI Research Computing Engineer

    Harvard University (Cambridge, MA)
    …HPC clusters. + Hands-on experience with HPC systems , including storage, cluster computing, network, database. + Experience with GPU computing platforms for ... cloud-bursting workflows is a plus. + Proficient with git and version control systems . + Demonstrated team performance skills with a service mindset and clear… more
    Harvard University (04/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior HPC Engineer, Professional Services

    NVIDIA (TX)
    …for hardware and software products. + Knowledge and experience with Linux System Administration , process management, package management, task scheduling, kernel ... team building many of the largest and fastest AI/HPC systems in the world! NVIDIA is looking for someone...scope of these efforts includes a combination of Networking, System Design and Automation and being the face to… more
    NVIDIA (04/17/24)
    - Save Job - Related Jobs - Block Source
  • High Performance Computing Engineer III - OIT

    Emory Healthcare/Emory University (Atlanta, GA)
    …various AWS products and their applications + Strong knowledge of the Slurm cluster management software + Experience with specialized computing, like GPU , ... with standard practices in project management (PMI), service management (ITIL), and systems development lifecycle + Strong and precise communications skills and the… more
    Emory Healthcare/Emory University (03/19/24)
    - Save Job - Related Jobs - Block Source