- Cisco (San Jose, CA)
- AI Infrastructure Engineer - HPC Apply (https://jobs.cisco.com/jobs/Login?projectId=1443781) + Location:San Jose, California, US + Alternate ... and communicate advanced technical concepts. A talented and passionate engineer comfortable working in high-pressure, large-scale enterprise environments. **What You… more
- Meta (Menlo Park, CA)
- …and host networking, comms lib and scheduling infrastructure . **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active member ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing uses cases of AI . This results in a dramatic… more
- Meta (Menlo Park, CA)
- …learning domains: Distributed ML Training, GPU architecture, ML systems, AI infrastructure , high performance computing, performance optimizations, or ... space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Tech-leading the… more
- Meta (Menlo Park, CA)
- …in exploring, developing and productizing high-performance software and hardware technologies for AI at datacenter scale.Hardware Systems Engineer in RTP work ... and optimize these systems in production. **Required Skills:** Hardware Systems Engineer , AI Systems Responsibilities: 1. Interface with external vendors… more
- LinkedIn (Mountain View, CA)
- …practices across the company. Strategic Roadmapping Influence the long-term roadmap for ML/ AI infrastructure , factoring in technology trends, product needs, and ... About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU... to design, build, and maintain our large-scale GPU infrastructure for machine learning (ML) and AI … more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production. **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Drive interfacing with ... **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP)...a leading contributor 18. 3+ years of experience supporting AI or HPC systems and/or related systems,… more
- Meta (Menlo Park, CA)
- … Engineer , Sustaining Responsibilities: 1. Develop robust, industry leading practices for supporting AI and HPC infrastructure at scale 2. Interface with ... **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP)...hardware at scale 9. Experience in deploying and productionizing AI / HPC systems and/or related components at scale… more
- Deloitte (San Francisco, CA)
- …availability in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns + Define and lead ... AI Engineering Manager/Solutions Architect - SFL Scientific Our...valuation modeling, cost optimization, restructuring, business design and transformation, infrastructure and real estate, mergers and acquisitions (M&A), and… more
- Meta (Menlo Park, CA)
- …of the following machine learning domains: hardware accelerators, AI Infrastructure , and/or high performance computing ( HPC ), particularly pertaining to ... **Summary:** Meta is seeking an experienced software engineer to join our Accelerator Solutions & Technologies...HPC workloads). 13. Full-stack experience and understanding of AI / HPC systems, from HW/ infrastructure through… more
- Meta (Menlo Park, CA)
- …learning domains: Distributed ML Training, GPU architecture, ML systems, AI infrastructure , high performance computing, performance optimizations, or ... this role, you will be a member of the Network. AI Software team and part of the bigger DC...Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and… more
- Broadcom (San Jose, CA)
- …team developing high-performance package designs for ASICs for artificial intelligence ( AI ), networking, high-performance computing ( HPC ), and 5G base stations. ... **Job Description:** Broadcom is seeking an experienced package design engineer for complex flip-chip-BGA packages for industry-leading ASICs with high-speed… more
- Google (Mountain View, CA)
- …computing ( HPC ) experience. + GCP experience. + Experience in infrastructure -as-code, eg Terraform + Exposure to productions systems that rely heavily on ... software release lifecycle, eg unit testing, CI/CD and production operations + Use AI for code tools + Work effectively with cross-functional teams of engineers,… more
- Broadcom (San Jose, CA)
- …End-2-End congestion techniques, working experience on debugging Embedded Software, knowledge of HPC and AI /ML data center operational models, deep knowledge of ... Sign-In before you apply.** **Job Description:** Software Field Applications Engineer (FAE) is software technical lead for Broadcom ethernet controllers/Network… more