- Lilly (Indianapolis, IN)
- …life better for people around the world. Come help us unlock the power of HPC and AI based POGPU and Accelerated Compute infrastructure! The Cloud and ... Connectivity organization is seeking experts and leaders in AI and High-Performance Computing ( HPC ), and Nvidia DGX server management. This role will also focus… more
- Meta (Menlo Park, CA)
- …and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...responsible for design, model, develop, test, deploy and operate AI / HPC Networks at scale 2. Provide continual… more
- Mayo Clinic (Rochester, MN)
- …or reimbursement account for flexible coverage. + Vision: Affordable plan with national network . + Pre-Tax Savings: HSA and FSAs for eligible expenses. + Retirement: ... & Speciality Services area is seeking a highly skilled and motivated Tech Spec I HPC Engineer to join the HPC Team. The ideal candidate will have specialized… more
- NVIDIA (Santa Clara, CA)
- …and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new ... runtime designs, and new network hardware features. + Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX,… more
- Federal Reserve Bank (Kansas City, MO)
- …file systems (such as ceph and IBM Spectrum Scale/GPFS) and storage solutions. Development + Design and implement innovative HPC solutions to address evolving ... and accelerator technologies (CUDA, OpenACC). + Experience supporting machine learning and AI workloads on HPC systems. **Additional Information** How We Work… more
- Texas A&M University System (College Station, TX)
- …firmware patching, and performance tuning.* Oversee networking, security, and infrastructure for HPC systems.* Lead the development of specialized HPC ... Job Title Senior HPC Engineer Agency Texas A&M University Department Technology...with container orchestration tools such as Kubernetes* Knowledge of Run: ai for AI workload management* Proficiency with… more
- NVIDIA (Santa Clara, CA)
- …at NVIDIA. We deliver libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of ... our communication libraries. The DL and HPC applications of today have a huge compute demand...space. Are you ready for to contribute to the development of innovative technologies and help realize NVIDIA's vision?… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a Distinguished Software Architect to help co-design our ... next generation data center platforms. DL and HPC applications have a huge compute demand already and...seen before. Are you ready to contribute to the development of innovative technologies and help realize NVIDIA's vision?… more
- NVIDIA (Santa Clara, CA)
- …join our mission in integrating genomic solutions into mainstream healthcare. As a healthcare HPC engineer, you will join a dynamic development team focused on ... healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as...understand their current and future challenges and provide outstanding HPC solutions. + Collaborate closely with hardware engineering, CUDA… more
- GliaCell Technologies (Annapolis Junction, MD)
- …Application Development , Big Data, Cloud Technologies, Analytics, Machine Learning, AI , and DevOps Containerization. We also provide customer solutions in the ... Are you a Principal HPC Software Engineer who is ready for a...delivering stable and reliable software solutions using Agile Software Development principles. These provide us the capability to deliver… more
- Meta (Menlo Park, CA)
- …The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network communications stack for ... operate in a multi-organization landscape. **Required Skills:** Technical Program Manager, AI Network Infra Responsibilities: 1. Lead technical program… more
- NVIDIA (Santa Clara, CA)
- …are looking for a passionate engineer who will solve networking problems for scalable AI clusters. This is a hands-on network engineering position focused on the ... We are seeking a highly skilled Principal Network Engineer to join our dynamic team to...and deployment of global-scale DCs inter-connects and fabric for HPC , AI , and GPU computing clusters. +… more
- Oracle (Seattle, WA)
- …to be the go-to experts on RDMA cluster architecture and its relationship to AI /ML/ HPC performance. We apply our deep understanding of these unique workload ... so our customers can push the cutting edge in AI /ML and other areas of HPC . Join...out performance studies on GPU clusters with focus on AI /ML workload performance, network performance and tuning.… more
- Oracle (Springfield, IL)
- …Oracle's Forward Deployed Engineer (FDE) team is hiring a Senior Principal Software Development Engineer - AI Data Platform to help global customers unlock ... provide expert architectural guidance focused on designing, optimizing, and scaling modern AI /ML-centric data platforms. As a key member of Oracle's Analytics and … more
- Bloomberg (New York, NY)
- …maintaining system software that enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems. This role will also be responsible for ... overseeing the ongoing monitoring, support, and maintenance of our HPC / AI clusters, ensuring peak performance and reliability. **We'll trust you to:** + Design,… more
- Amazon (Austin, TX)
- …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...diverse AWS Hardware Engineering team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
- Amazon (Austin, TX)
- …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's AI Factories are built to accelerate AI and HPC workloads. At their core the Digital Twin (physics-based model used to design, validate, and operate ... multi-physics simulation, and digital twin integration to lead the development and operation of NVIDIA's AI Factory...to stand out from the crowd + Background in AI / HPC data center cooling, including immersion and… more
- Oracle (Nashville, TN)
- …at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging...field. + 12+ years of total experience in software development . + Proven industry expert in Control Plane, Data… more
- Meta (Menlo Park, CA)
- …many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network and storage. The team invests significantly ... develop and help productionize high performance software & hardware technologies for AI at datacenter scale. We achieve this via concurrent design and optimization… more