- Cisco (Research Triangle Park, NC)
- …support for your customers. **Why Cisco?** At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and ... and processes available. Who You Are You are an experienced Lead Engineer in artificial intelligence, machine learning, data analytics, software engineering, and… more
- NVIDIA (Santa Clara, CA)
- …and intelligence. Make the choice to join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership in the design and ... 5+ years of experience designing and operating large scale compute infrastructure + Experience with AI / HPC advanced job schedulers, such as Slurm, K8s, PBS,… more
- NVIDIA (Santa Clara, CA)
- …Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the next-generation ... innovation and collaboration. Within this mission, our team, Managed AI Superclusters (MARS) builds and scales the infrastructure...systems covering metrics, logs, traces, and events for GPU-powered AI and HPC workloads. + Build large-scale… more
- NVIDIA (Santa Clara, CA)
- …+ Minimum of 6 years of experience crafting and operating large scale compute infrastructure . + Experience with AI / HPC job schedulers and orchestrators, such ... learning and staying ahead of new technologies and effective approaches in the HPC and AI /ML infrastructure fields. Ways to stand out from the crowd: +… more
- NVIDIA (Santa Clara, CA)
- …generative machine learning models in digital biology and beyond + Collaborate with multiple HPC , AI infrastructure , and research teams + Develop tools to ... NVIDIA has become the platform upon which every new AI -powered application is built. We are seeking a Sr. HPC Performance engineer to join our team of… more
- Johns Hopkins University (Baltimore, MD)
- …will design, build, and support Johns Hopkins University's high-performance computing and AI research infrastructure . This role integrates elements of both ... and Design** + Develop and refine deployment strategies for scientific software on HPC and AI systems. + Design computational workflows, selecting optimal… more
- Meta (New York, NY)
- …host networking, communications lib and scheduling infrastructure . **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic… more
- Meta (Menlo Park, CA)
- …and host networking, comms lib and scheduling infrastructure . **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active member ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI . This results in a dramatic… more
- Johns Hopkins University (Baltimore, MD)
- …mission. This position focuses on the reliable operation, configuration, and optimization of HPC and AI systems, including multi-node CPU and GPU clusters, ... + Expertise with architecting, operating, and debugging large scale HPC network and storage infrastructure , including MPI,...Authority + Systems Integration - Authority Classified Title: Sr. HPC Systems Engineer Job Posting Title (Working… more
- University of Pennsylvania (Philadelphia, PA)
- …system benchmarking and develop automated testing to ensure a robust and efficient HPC infrastructure . + Maintain job scheduling systems and enforce storage ... programs and resources, and much more. Posted Job Title HPC Systems Engineer Job Profile Title Systems...to join the team. PARCC's main cluster (Betty), delivers HPC , data-intensive science and Artificial Intelligence ( AI )… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is hiring engineers to scale up its AI Infrastructure . We expect you to have a strong programming background, knowledge of datacenter hardware, ... hardware fleet management systems. Proven operational excellence in designing and maintaining AI infrastructure NVIDIA is widely considered to be one of… more
- NVIDIA (Santa Clara, CA)
- …continual learning and staying ahead of new technologies and effective approaches in the HPC infrastructure fields. Ways to stand out from the crowd: + ... world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and...tools such as BCM or Ansible. + Experience with AI / HPC job schedulers and orchestrators, such as… more
- Google (Kirkland, WA)
- Staff Software Engineer , HPC Solutions _corporate_fare_ Google _place_...by leading the convergence of AI and HPC . The AI and Infrastructure ... We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M University Department Technology Services - IT Enterprise Operations Proposed Minimum Salary Commensurate Job ... cutting-edge research and super computing needs. As a Senior High Performance Computing Engineer ( HPC ), you will provide technical expertise and consultation for… more
- Capgemini (Phoenix, AZ)
- …the job you're considering** Capgemini is seeking an experienced Senior Cloud & HPC Engineer to accelerate research computing initiatives in the biotech and ... secure, and efficient solutions for research workflows on cloud and High-Performance Computing ( HPC ) environments. **Your role** + Design and maintain HPC and… more
- Capgemini (Boston, MA)
- …_Developer_ **Organization:** _ERD PPL US_ **Title:** _Senior Cloud & HPC Engineer -BioTech Research_ **Location:** _DC-Washington_ **Requisition ID:** _082255_ ... for research workflows on Google Cloud Platform (GCP) and High-performance Computing ( HPC ) environments. Your work will directly enable scientists to push the… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI /ML systems. This role involves working on collective operations - the fundamental ... operations that enable AI to scale across multiple accelerators & servers. Most...systems is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
- GliaCell Technologies (Annapolis Junction, MD)
- Are you a Principal HPC Software Engineer who is ready for a new challenge that will launch your career to the next level? + Tired of being treated like a ... work with some amazingly talented people Job Description: GliaCell is seeking a Principal HPC Software Engineer on one of our subcontracts. This is a full-time… more
- University of Maine System (Orono, ME)
- …of the Graduate School. The position will have an active role in shaping what HPC resources are available, who can access those resources, how to get data to those ... under construction, that will focus on revolutionizing manufacturing through AI -enabled, large-scale bio-based advanced manufacturing. Typical hiring range is… more
- Amazon (Arlington, VA)
- …technologies in a multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC services. - Experience ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...the cloud computing delivery model as it relates to HPC . - Knowledge of the underlying infrastructure … more