- Meta (New York, NY)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- Bloomberg (New York, NY)
- …and maintenance of our HPC / AI clusters, ensuring peak performance and reliability + Drive system upgrades, customization, and seamless integration ... enables communication between GPUS, CPUs, and storage in scale-out AI and HPC systems . This...overseeing the ongoing monitoring, support, and maintenance of our HPC / AI clusters, ensuring peak performance … more
- Deloitte (New York, NY)
- …and building secure networks and modern data centers, to enabling the adoption of AI or high- performance computing ( HPC ), you'll gain firsthand experience ... organizations through Data Center and infrastructure transformation journeys, such as adopting AI , deploying high- performance computing ( HPC ) or edge… more
- IBM (New York, NY)
- …in system design * Experience with GPU Systems * Familiarity with HPC system performance evaluation. * Familiarity with system architectures * ... technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The… more
- Oracle (Trenton, NJ)
- …HPC Infrastructure solutions and build an imaging service for Large Scale Compute/ HPC / AI /ML Customer Workloads and performance while providing strong ... is your opportunity to play a role in the Compute/ HPC / AI /ML industry movement on the Windows platform....of developing and shipping enterprise distributed and/or cloud native systems 3. Strong grasp of system design… more
- Mount Sinai Health System (New York, NY)
- …Computational Scientist works with scientists and researchers to effectively use MSSM HPC (high performance computing) clusters; researches and deploys codes on ... computing environment or equivalent experience. + Experience in batch HPC cluster environment with a parallel file system...System is one of the largest academic medical systems in the New York metro area, with more… more
- Oracle (Trenton, NJ)
- …Responsibilities + Lead architecture, system design, and implementation for high- performance RDMA solutions across OCI's AI / HPC platforms, including ... If you thrive at the intersection of large-scale distributed systems , high-speed networking, and AI workloads, this... performance tuning at scale. + Familiarity with AI / HPC stacks and workloads: NCCL/RCCL/MPI, Slurm or… more
- SHI (Trenton, NJ)
- …platforms and compatible OEM solutions for AI training, inference, and high- performance computing ( HPC ) workloads. + Work with and assess customer ... HPC workloads. + Proven expertise with NVIDIA DGX systems , GPU technologies (eg, CUDA, NCCL), and AI...systems . + Experience creating BOMs, sizing infrastructure for AI workloads, and performing cost/ performance analysis. +… more
- Oracle (Trenton, NJ)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Oracle (Trenton, NJ)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Deloitte (Morristown, NJ)
- …Engineer, Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack The wage range for this role takes into ... drug discovery, optimizing population health and clinical trials, autonomous systems and edge AI , and renewable energy....on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns… more
- Oracle (Trenton, NJ)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Deloitte (Morristown, NJ)
- …Engineer, Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack The wage range for this role takes into ... in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns + Define and lead technology… more
- Oracle (Trenton, NJ)
- …network fabric** , supporting millions of devices, multi-region interconnects, and high- performance compute ( HPC / AI /GPU) environments. + Integrate ML ... Development Team within OCI's Network Availability organization. This team builds the AI , analytics, and automation systems that power OCI's self-healing cloud… more
- Oracle (Trenton, NJ)
- …conversational search, and summarization. + Work with Oracle Vector Database and other retrieval systems to optimize AI performance . + Build and optimize ... Generative AI , and intelligent agent-driven applications. **Responsibilities** ** AI & LLM System Development** + Design,...close gaps. + Stay current with emerging trends in AI infrastructure, agent frameworks, HPC systems… more
- Oracle (Trenton, NJ)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, creating systems that allow customers...and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging… more
- Oracle (Trenton, NJ)
- …networking, HPC , or GPU infrastructure. + Expertise in designing data feedback systems that improve AI model performance through continuous learning. + ... a Principal Software Developer (IC4) with deep expertise in AI /ML system design, large-scale data engineering, and...platforms. In this role, you will design and deliver AI -powered systems for predictive incident detection, automated… more
- SHI (Somerset, NJ)
- …architecture, presales engineering, or datacenter solution design, including 5+ years dedicated to AI infrastructure or HPC systems . + Strong understanding ... objectives and technical specifications. - Intermediate + Ability to design comprehensive AI system structures and frameworks to meet complex requirements. -… more
- Oracle (Trenton, NJ)
- …the forefront of building a cutting-edge, ultra-high- performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... AI revolution, working with systems that allow...scale from tens to thousands of GPUs without compromising performance . Our team is responsible for designing and developing… more
- Oracle (Trenton, NJ)
- …+ Optimize performance , scalability, and reliability of distributed data/ AI systems . + Collaborate with cross-functional teams (engineering, product, ... Engineer (FDE) team is hiring a Senior Principal Software Development Engineer - AI Data Platform to help global customers unlock the full potential of their… more