- Meta (Austin, TX)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- Deloitte (Austin, TX)
- …and building secure networks and modern data centers, to enabling the adoption of AI or high- performance computing ( HPC ), you'll gain firsthand experience ... organizations through Data Center and infrastructure transformation journeys, such as adopting AI , deploying high- performance computing ( HPC ) or edge… more
- Texas A&M University System (College Station, TX)
- …patching, and performance tuning.* Oversee networking, security, and infrastructure for HPC systems .* Lead the development of specialized HPC computing ... expertise and consultation for the design and deployment of HPC systems . Get in on the ground...the following:* Experience with High Performance Computing ( HPC ) environments* Advanced Linux system administration skills*… more
- Amazon (Austin, TX)
- …/ aerospace, financial services or pharmaceuticals. - Experience in architecting an HPC system and infrastructure with orchestration schedulers (eg Slurm, ... to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a...in a multi-user environment. - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -… more
- Amazon (Austin, TX)
- …computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
- Micron Technology, Inc. (Richardson, TX)
- …infrastructure. + Coordinate the management of enterprise SAN, NAS, and cloud storage systems to ensure reliability and performance . + Implement new storage ... learn, communicate and advance faster than ever. As an HPC Staff Engineer at Micron, you will join a...storage environments, including enterprise SAN NAS and cloud storage systems across the company's global infrastructure! Your role will… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... role offers the opportunity to contribute to ground breaking memory solutions for AI , HPC , and data-centric systems . **Responsibilities** + Contribute to… more
- Amazon (Austin, TX)
- …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...the limits of performance , efficiency, and scalability in the cloud, this is… more
- Oracle (Austin, TX)
- …HPC Infrastructure solutions and build an imaging service for Large Scale Compute/ HPC / AI /ML Customer Workloads and performance while providing strong ... is your opportunity to play a role in the Compute/ HPC / AI /ML industry movement on the Windows platform....of developing and shipping enterprise distributed and/or cloud native systems 3. Strong grasp of system design… more
- Micron Technology, Inc. (Richardson, TX)
- …designing and optimizing High Bandwidth Memory (HBM) products for AI /ML, high- performance computing ( HPC ), and data-centric systems , collaborating across ... Introduction** Micron's Heterogeneous Integration Group (HIG) is shaping the future of AI and accelerated computing by developing advanced memory solutions. The team… more
- Oracle (Austin, TX)
- …Responsibilities + Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, including ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this... performance tuning at scale. + Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
- Amazon (Austin, TX)
- …and operating AWS cloud offerings that enable high performance and scalability in AI /ML and HPC workloads. You are intrigued by the continuous release of ... Want to do industry leading work delivering continuous price performance improvements in the cloud for AI ...have tremendous interest in cloud scale and curious how systems and software decisions impact the user. You insist… more
- Amazon (Austin, TX)
- …design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing ... Do you want to shape the future of Generative AI at AWS? Join the team building the foundation...the limits of performance , efficiency, and scalability in the cloud, this is… more
- Oracle (Austin, TX)
- …designs, troubleshooting, and best practices. + Stay current with emerging trends in AI infrastructure, agent frameworks, HPC systems , and cloud-native ... learning, LLM applications, and agentic AI . Our team builds real-world AI systems and deploys scalable, production-ready solutions across Oracle's enterprise… more
- SHI (Austin, TX)
- …platforms and compatible OEM solutions for AI training, inference, and high- performance computing ( HPC ) workloads. + Work with and assess customer ... HPC workloads. + Proven expertise with NVIDIA DGX systems , GPU technologies (eg, CUDA, NCCL), and AI...systems . + Experience creating BOMs, sizing infrastructure for AI workloads, and performing cost/ performance analysis. +… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Deloitte (Dallas, TX)
- …Engineer, Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack The wage range for this role takes into ... drug discovery, optimizing population health and clinical trials, autonomous systems and edge AI , and renewable energy....on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more