- SpaceX (Hawthorne, CA)
- Sr . HPC Systems Engineer (Top Secret Clearance) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the ... extended hours and weekends as needed. COMPENSATION AND BENEFITS: Pay Range: SR . HPC Systems Engineer : $160,000.00-$220,000.00/per year Your actual level… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ML systems . This role involves working on collective operations - the fundamental ... kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or... is valued, and experience with high-speed networking or HPC interconnects is valued highly. If you like solving… more
- NVIDIA (Santa Clara, CA)
- …doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC ...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
- NVIDIA (Santa Clara, CA)
- …efficiency, and performance and drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are ... responsible for the big picture of how our systems relate to each other, we use a breadth...and support workload and resource schedulers in a large-scale HPC environment. + Automate Everything: Develop automation scripts to… more
- Amazon (Sunnyvale, CA)
- …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux… more
- Amazon (San Diego, CA)
- …in 2025. To achieve our ambitious goals, we're expanding our team and looking for a Senior Engineer to lead the development of a new EC2 Service critical to ... scale our current and next-generation Machine Learning (ML) and HPC Platforms. You will also be a technical leader...scale. With the extensive network and access to Principal, Sr . Principal and Distinguished Engineers across EC2, AWS and… more
- The Walt Disney Company (Emeryville, CA)
- We seek a Senior Storage Engineer who is passionate about building and maintaining data storage solutions in our creative studio environment, and who is ... on our studio. **RESPONSIBILITIES:** + Build and support our on-prem HPC storage systems + Develop software tools that enhance storage administration, automate… more
- LinkedIn (Mountain View, CA)
- …hybrid in LinkedIn's Sunnyvale, CA campus. About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU ... experience. 8+ years of experience designing and managing large-scale, distributed systems or HPC environments, with at least 3+ years focused on GPU-based ML… more
- Micron Technology, Inc. (San Jose, CA)
- …world to learn, communicate and advance faster than ever. Micron is hiring a Sr . Principal, System Architect - Pathfinding Engineer in the Compute and Networking ... their implementation over hardware configurations such as CPU, SOC, and GPU systems . + Identify bottlenecks in current technologies through detailed system and… more
- Amazon (Sunnyvale, CA)
- …performance across our product line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux OS boot sequencing, Kernel, ... High performance computing workloads. The Nitro High Memory and HPC team owns the purpose built platform development for...with supporting script and tests in Python and Lua. Senior Software Development Engineers work closely with EC2 Principal… more
- Qualcomm (San Diego, CA)
- …lookout for passionate innovators who thrive on building sophisticated, large-scale systems designed for peak performance and unwavering reliability. With your ... extensive experience in distributed systems architecture and networking, you'll collaborate closely with our...with Windows, Linux, and VMware Preferred: + Experience with HPC Grid data center environments + Experience in GPU… more
- Amazon (Cupertino, CA)
- …range of applications including databases, web services, games, video encoding, ML, and HPC workloads. This doesn't mean you have or will have all those skills, ... other open source projects - Develop analysis frameworks and automation systems Tool(s) Development - Enhance APerf (our open-source Rust-based performance tool)… more
- Amazon (Cupertino, CA)
- …cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns the design, planning, delivery, and ... goal of improving the current customer experience as well as developing improved systems for future designs. You will work directly with vendors and ODM/JDM design… more
- Wells Fargo (San Francisco, CA)
- **About this role:** We are seeking a High-Performance Computing ( HPC ) Engineer with experience in Machine Learning to optimize and scale AI/ML workloads. The ... this role, you will:** + Design, develop, and optimize HPC solutions for large-scale ML workloads. + Optimize data...domains + Assure quality, security and compliance for supported systems and applications + Serve as a technical resource… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... is open to on-site and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing Services… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- NVIDIA (Santa Clara, CA)
- …Experience with Cloud Deployment, BCM, Terraform. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads. + Familiarity ... diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of… more
- NVIDIA (Santa Clara, CA)
- …team and see how you can make a lasting impact on the world. As a Senior Technical Marketing Engineer for AI Infrastructure, you will join a dedicated team that ... advancement of datacenter GPUs and large scale GPU computing systems . What you will be doing: + Evaluate and...+ Proficiency in Python and C++ for AI and HPC applications. + Experience using large scale multi node… more