- Amazon (Cupertino, CA)
- …empowers you to own them to completion. The Core Networking team is looking for a Network Development Engineer to join our Network Fabric Engineering ... (NFE) team. As a Network Development Engineer , you will be responsible for building, deploying and scaling the Amazon networks that support AWS, customers,… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of...space. Are you ready for to contribute to the development of innovative technologies and help realize NVIDIA's vision?… more
- NVIDIA (Santa Clara, CA)
- …to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering ... intelligence to autonomous cars. We are now looking for a highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to… more
- Broadcom (San Jose, CA)
- …Develop, analyze, debug, and enhance software solutions leveraging Broadcom's proprietary Software Development Kit (SDK) optimized for AI/ML and HPC market ... users adopting the latest Broadcom switch platforms and emerging network technologies optimized for AI and HPC ...emerging network technologies optimized for AI and HPC workloads. + Contribute to hardware and low-level software… more
- Amazon (Cupertino, CA)
- …have extensive experience in low-latency networking and collective operations, such as HPC network fabric or machine learning accelerator cluster systems. Also ... on building networking solutions that for Machine Learning (ML) and High-Performance Computing ( HPC ) workloads on AWS. We are seeking an experienced engineer … more
- Meta (Menlo Park, CA)
- … hardware requirements and specifications (eg, configuring hardware components, GPU, memory, network for AI/ HPC workloads). 14. Understanding of the transport ... **Summary:** Meta is seeking an experienced software engineer to join our Accelerator Solutions & Technologies group, supporting the development of Meta's… more
- NVIDIA (Santa Clara, CA)
- …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... experience, reliability testing with various telemetries, scale out cluster, test plan development , track record in developing AI tools and NLP, DevOps, CI/CD… more
- Meta (Menlo Park, CA)
- …Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on ... (eg Large-Scale GenAI/LLM training) from the trainer down to the inter-GPU and network communication layer. And we are seeking for engineers to work on the… more
- Meta (Menlo Park, CA)
- **Summary:** In this role, you will be a member of the Network .AI Software team and part of the bigger DC networking organization. The team develops and owns the ... Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on… more
- Meta (Menlo Park, CA)
- …operating Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development , capacity planning, distributed and ... actively seeking Software Engineers to help build and scale our rapidly evolving network infrastructure. We are looking for Software Engineers with a passion for… more
- quadric.io, Inc (Burlingame, CA)
- …code and conventional C++ DSP and control code. Role: The Corporate Applications Engineer is the key bridge between development engineering and hands-on users ... (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint… more
- quadric.io, Inc (Burlingame, CA)
- …code and conventional C++ DSP and control code. Role: The Field Application Engineer (FAE) will work closely with Business Development , Product, and Engineering ... co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of...expert-level skills with the in-house built product line including HPC Hardware (IP, Chips, Boards), SDK, Algorithms (NN, DSP,… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data centers are ... software and hardware technologies for AI at datacenter scale. Hardware Systems Engineer in RTP work closely with HW/SW co-design teams, hardware designers,… more
- Microsoft Corporation (Mountain View, CA)
- …business productivity. + Promote a culture of innovation and creativity, fostering the development of a high-performance HPC team. + Integrate the latest GenAI ... **Responsibilities** + Lead and manage a global team of HPC professionals, ensuring the delivery of high-quality solutions, roadmap...solutions into silicon development flows to enhance design and verification processes. +… more
- Amazon (Cupertino, CA)
- …cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns the design, planning, delivery, and ... You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers,...specific function, you will own and lead the design, development and root cause of a new segment of… more
- Broadcom (San Jose, CA)
- …please Sign-In before you apply.** **Job Description:** Software Field Applications Engineer (FAE) is software technical lead for Broadcom ethernet controllers/ ... Network Interface Cards targeted towards Enterprise, Server & Storage...engineering and factory applications team. Desired skills include driver development , embedded software development , strong coding skills… more
- Amazon (Sunnyvale, CA)
- …as a Technical Product Manager experience - 5+ years of technical (software development , network development , IT, other related) experience - Experience ... across a range of EC2 products across compute, storage, network and accelerated computing. You will be responsible for...including AI/ML, generative AI, databases, Big Data analytics, SAP, HPC , Edge, and more. This is a unique opportunity… more