- NVIDIA (Santa Clara, CA)
- …love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis Engineer to join ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...Deep Learning LLM training focused on collectives communication and networking . You will interact with many types of hardware… more
- NVIDIA (Santa Clara, CA)
- …our team, you'll design and shape the architectures that connect the world's most powerful AI clusters. As an HPC Networking Product Architect at NVIDIA, ... scalability. + Experience working with benchmarking tools and performance analysis for large-scale HPC / AI networking deployments. + Understanding of DPU (or… more
- NVIDIA (Santa Clara, CA)
- …intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and implementation ... doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking , and storage. +… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking , and storage. + ... of experience crafting and operating large scale compute infrastructure. + Experience with AI / HPC job schedulers and orchestrators, such as Slurm, K8s or LSF.… more
- Lilly (Indianapolis, IN)
- …life better for people around the world. Come help us unlock the power of HPC and AI based POGPU and Accelerated Compute infrastructure! The Cloud and ... Connectivity organization is seeking experts and leaders in AI and High-Performance Computing ( HPC ), and Nvidia...Nvidia DGX systems using Mission Control, Base Command and Run: AI , and optimizing Spectrum X networking and… more
- NVIDIA (Santa Clara, CA)
- …group at NVIDIA has openings for software architects in the field of AI and high-performance networking and system software. We research, develop, and ... deploy solutions in networking hardware, programming environments, and system software to make...+ Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI,… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M University Department Technology Services - IT Enterprise Operations Proposed Minimum Salary Commensurate Job ... faculty and staff providing cutting-edge research and super computing needs. As a Senior High Performance Computing Engineer ( HPC ), you will provide technical… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking , and storage. + ... tools such as BCM or Ansible. + Experience with AI / HPC job schedulers and orchestrators, such as...supporting EDA workloads and tools. + Familiarity with High-Speed Networking pertaining to HPC including InfiniBand, RDMA… more
- Massachusetts Institute of Technology (Cambridge, MA)
- Senior HPC Systems Engineer + Job...and optimizing HPC clusters, storage systems, and networking for AI /ML workloads. Join a collaborative, ... Email a Friend Save Save Apply Now Posting Description SENIOR HPC SYSTEMS ENGINEER, The Massachusetts Green...and container orchestration tools like Docker and Kubernetes; and experience in cloud-based HPC or AI /ML workloads.… more
- NVIDIA (Santa Clara, CA)
- …like NCCL, NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC . We're seeking a Senior Software Architect to help co-design next-gen data ... (eg NVLink, PCIe) within a node and with high-speed networking (eg InfiniBand, Ethernet) across nodes. Efficient and fast...+ Design and implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative… more
- Amazon (Seattle, WA)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...- Knowledge of the underlying infrastructure requirements such as Networking , Storage, and Hardware Optimization. - Experience in a… more
- University of Rochester (Rochester, NY)
- …of computing and/or infrastructure configuration management at large scales + High performance networking in an HPC setting + Parallel archival storage systems + ... Rochester's Laboratory for Laser Energetics seeks group leader for the High Performance Computing ( HPC ) group. The HPC group provides production HPC systems… more
- NVIDIA (Santa Clara, CA)
- …communication and planning abilities. Experience working with High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) ... NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a...strong programming background, knowledge of datacenter hardware, operations, and networking , familiarity with software testing and deployment, familiarity with… more
- Johns Hopkins University (Baltimore, MD)
- …daily operation and upkeep of Johns Hopkins University's high-performance computing & AI environments. This role helps maintain the reliability and availability of ... updates, and helping with node configuration under the direction of senior staff. Work involves resolving tickets, performing routine maintenance, and participating… more
- NVIDIA (Santa Clara, CA)
- …Frameworks Infrastructure team as a Senior Systems Engineer focusing on High-Performance AI & Networking Applications, committed to ground-breaking AI & ... + Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workload. + Experience with networking and communications libraries like… more
- NVIDIA (Santa Clara, CA)
- …is building the world's most groundbreaking and innovative accelerated computing platforms for AI and HPC . Because of our work, scientists, researchers, and ... Cluster Design and Architecture team with a focus on networking technologies. As AI workloads scale to...troubleshooting + Proven expertise in designing large-scale distributed systems, AI clusters, or HPC infrastructure + Ability… more
- NVIDIA (Santa Clara, CA)
- At NVIDIA, we are pioneers in making the impossible achievable, particularly within AI , ML, and HPC . Joining our team as a Storage & Networking Product ... networking architectures for storage environments, ensuring low-latency data paths for AI /ML and HPC workloads. + Configure and tune RDMA, NVMe-over-Fabrics,… more
- NVIDIA (Santa Clara, CA)
- …to help design and deploy cutting-edge NVIDIA networking platforms to run AI and HPC workloads + Address sophisticated and highly visible customer issues ... to join our team! We are searching for a senior networking application engineer with domain expertise...Infiniband and/or NVLINK to help support our groundbreaking, innovative networking technologies that make AI workloads in… more
- NVIDIA (Santa Clara, CA)
- …team and see how you can make a lasting impact on the world. As a Senior Technical Marketing Engineer for Datacenter Networking , you will join a dedicated team ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...about delivering outstanding developer and user experiences on NVIDIA's networking hardware and software products. This position in Santa… more
- NVIDIA (Santa Clara, CA)
- …to join the Solutions Architecture team in building the world's largest and fastest AI / HPC systems using NVIDIA Networking . This dynamic role requires ... and interpersonal skills to analyze, define, implement, and troubleshoot large-scale networking projects with customers and internal teams. What you'll be doing:… more