- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
- Cisco (San Jose, CA)
- Principal Engineer - HPC , AI Infrastructure Apply (https://jobs.cisco.com/jobs/Login?projectId=1445895) + Location:San Jose, California, US + Area of ... maintain device drivers and runtime components for GPU and network components of the systems . + Working...PhD is a plus, especially with research in GPU systems , compilers, or HPC . **Message to applicants… more
- NVIDIA (Santa Clara, CA)
- …high-performance environments. + Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to be one of the ... engine of modern Artificial Intelligence, Advanced Networking, and High Performance Computing ( HPC ) - the biggest technology breakthroughs of our time. We're on a… more
- NVIDIA (Santa Clara, CA)
- …familiarity with software testing and deployment, familiarity with distributed systems , and excellent communication and planning abilities. Experience working with ... High Performance Computing ( HPC ), GPUs, and high-performance networking (RDMA, Infiniband, RoCE) are strongly preferred. We also welcome out-of-the-box thinkers who… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- NVIDIA (Santa Clara, CA)
- … HPC /AI clusters at scale, with hands-on expertise with network topologies and large-scale switch/router deployments. + Familiarity with network ... making the impossible achievable, particularly within AI, ML, and HPC . Joining our team as a Storage & Networking...Joining our team as a Storage & Networking Product Engineer involves being part of a group that fosters… more
- Stanford University (Stanford, CA)
- …researchers from a variety of Stanford and SLAC organizations. The majority of the HPC systems are hosted in the Stanford Research Computing Facility (SRCF), ... Research Data Center Facility Engineer **Business Affairs: University IT (UIT), Stanford, California,...Stanford Research Computing. Research Computing offers High Performance Computing ( HPC ) hosting services, computational and data systems ,… more
- NVIDIA (Santa Clara, CA)
- …wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and network ... crew that develops and maintains software for complex heterogeneous computing systems that power disruptive products in High Performance Computing and Deep… more
- NVIDIA (Santa Clara, CA)
- …Docker containers & Jenkins pipelines + Certifications in storage (eg, SNIA) or HPC systems or Storage Performance experience with mdtest or FIO tool. ... be. We are looking for a Senior Software Validation Engineer to lead software validation activities in the Datacenter...streamlining our testing processes. + Validation of distributed Storage systems (eg, Lustre) on AI/ HPC Datacenter scale… more
- Meta (Menlo Park, CA)
- …Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized ... control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure analysis.We are actively seeking Software… more
- Meta (Menlo Park, CA)
- …Meta's global data center networks. Our work covers the entire network lifecycle, including hardware development, capacity planning, distributed and centralized ... control systems , modeling/provisioning/automation, monitoring/troubleshooting/analytics, and simulation/design/failure analysis.We are actively seeking Software… more
- quadric.io, Inc (Burlingame, CA)
- …battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems . Unlike other NPUs or neural network accelerators in the ... co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of...C++ DSP and control code. Role: The Corporate Applications Engineer is the key bridge between development engineering and… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's Enterprise Product Engineering involves crafting, constructing, and maintaining vital systems efficiently and reliably.. As a Senior Storage Product ... Engineer , you will take ownership of NVIDIA's Product Team's...environments. We focus on delivering high-performance, highly available storage systems that scale while enabling developers to innovate rapidly… more
- Broadcom (San Jose, CA)
- …experience. 2. Significant experience in RDMA protocol, QoS, Packet Classifications, Linux Systems programming, Linux kernel, Linux Network Drivers, Linux Kernel ... join the NIC product development team. As a Software Engineer , you will be responsible for designing and development...Experience analyzing and tuning performance for a variety of HPC workloads. 7. Excellent programming skills in C, C++… more
- NVIDIA (Santa Clara, CA)
- …GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC , datacenters and networking in addition to our traditional OEM business. ... OS, FW and CUDA SW stack from design doc. + Installing and testing various systems OS, server firmware and SW stack. + Drive support for root cause analysis on… more
- NVIDIA (Santa Clara, CA)
- …a discipline that involves designing, building, and maintaining large-scale production systems with high efficiency and availability. It encompasses various areas, ... including software and systems engineering practices, storage, data management, and services. Production..., and ensuring low-latency data access for high-performance computing ( HPC ) and AI/ML workloads. Storage Production Engineers at NVIDIA… more
- Amazon (Cupertino, CA)
- …cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns the design, planning, delivery, and ... to help. You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital… more
- NVIDIA (Santa Clara, CA)
- …impact NVIDIA's ability to deliver robust, secure, and high-performing solutions for AI, HPC , and cloud-scale systems . You will: + Define End-to-End Test ... and data center offerings. If you are a dedicated engineer with a deep understanding of firmware and date...with a deep understanding of firmware and date center systems , and you thrive in an exciting, innovative environment,… more
- Google (Sunnyvale, CA)
- …analysis and experience in performance modeling of High-Performance Computing ( HPC ) interconnect topologies. + Knowledge of computer architecture (Tensor Processing ... Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have...enable cost effective performance and power of future ML systems such as fast iteration and innovation for ML… more