- Ford Motor Company (Austin, TX)
- …leadership position is focused mostly on the support of our AI / ML platform , but shares in workload across the HPC service as most of our platforms ... leverage shared resources. Key to our AI / ML platform are Kubernetes and...experience + 4 years of experience managing high-performance computing ( HPC ) and AI / ML infrastructure platforms,… more
- Amazon (Austin, TX)
- …the following programming languages: C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI / ML frameworks. Preferred Qualifications - An advanced ... National Super Computing Centers , Government agencies , and/or AI / ML , CAE , Weather and accelerated...map these to solutions. - Experience in architecting an HPC platform with scheduling middleware (eg Slurm,… more
- Oracle (Austin, TX)
- …We are seeking a Principal Software Developer (IC4) with deep expertise in AI / ML system design, large-scale data engineering, and applied intelligence to help ... - operating at OCI's hyperscale. **Responsibilities** + Design and build distributed AI / ML services that enable anomaly detection, event correlation, RCA… more
- Amazon (Austin, TX)
- …delivering and operating AWS cloud offerings that enable high performance and scalability in AI / ML and HPC workloads. You are intrigued by the continuous ... for AWS Cloud. About the team The Hardware Engineering AI / ML development team is a...is the world's most comprehensive and broadly adopted cloud platform . We pioneered cloud computing and never stopped innovating… more
- Amazon (Austin, TX)
- …delivering and operating AWS cloud offerings that enable high performance and scalability in AI / ML and HPC workloads. Utility Computing (UC) AWS Utility ... Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build...management of Compute, Database, Storage, Internet of Things (IoT), Platform , and Productivity Apps services in AWS, including support… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI / ML / HPC workloads. This is your chance to be part of ... and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of...governance + Cloud infrastructure: OCI, AWS, Azure, Google Cloud Platform (GCP) + Operating Systems: Linux, MacOS + Scripting… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI / ML / HPC workloads. This is your chance to be part of ... and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of...highly distributed service infrastructure. + Experience working in cloud platform (s) (AWS, OCI, GCP, Azure etc). + Experience in… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI / ML / HPC workloads. This is your chance to be part of ... triage automation, and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of GPUs, leveraging technologies… more
- Deloitte (Austin, TX)
- We are seeking an accomplished HPC / AI Platform Engineering Manager to lead the design, implementation, and optimization of advanced computing environments ... ideal for a hands-on technologist with deep expertise in HPC systems, GPU-accelerated infrastructure, and large-scale AI ...infrastructure blueprints supporting secure, high-throughput AI workloads. AI / ML & LLM Platform Enablement… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI / ML / HPC workloads. This is your chance to be part of ... automation, and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of GPUs, leveraging technologies like… more
- Meta (Austin, TX)
- …1. Lead technical program management of next-generation Artificial Intelligence/Machine Learning ( AI / ML ) platform (s) for Meta's Network Infrastructure in ... product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible for… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI / ML / HPC workloads. This is your chance to be part of ... continues to meet the rapidly evolving demands of both Enterprise and AI / ML customers. + Ensure reliability and customer satisfaction through proactive issue… more
- Oracle (Austin, TX)
- …triage automation, and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of GPUs, leveraging technologies ... to scale and optimize Monitoring and Repair solutions for AI infrastructure components like GPU control plane and GPU...governance + Cloud infrastructure: OCI, AWS, Azure, Google Cloud Platform (GCP) + Operating Systems: Linux, MacOS + Scripting… more
- Oracle (Austin, TX)
- …senior software engineers and applied ML developers building the next-generation AI -driven operations platform for OCI. + Partner with Network Engineering, ... fabric** , supporting millions of devices, multi-region interconnects, and high-performance compute ( HPC / AI /GPU) environments. + Integrate ML and LLM-based… more
- Oracle (Austin, TX)
- …the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI / ML / HPC workloads. This is your chance to be part of ... automation, and diagnostic services. These are essential for running distributed AI / ML / HPC workloads across thousands of GPUs, leveraging technologies like… more