- NVIDIA (Santa Clara, CA)
- …for emerging AI workloads. From debugging performance bottlenecks in thousand-GPU distributed systems to influencing next-generation hardware design, we push ... : CUDA optimization, GPU programming, numerical libraries (cuBLAS, NCCL), or distributed computing . + Compiler engineering background: LLVM, GCC, domain-specific… more
- Amazon (East Palo Alto, CA)
- …as a mentor, tech lead or leading an engineering team - Experience with distributed computing and enterprise-wide systems Proficiency in at least one ... base. You'll bring a passion for innovation, data, search, analytics, and distributed systems . You'll also: Solve challenging technical problems, often ones… more
- NVIDIA (Santa Clara, CA)
- …with Go for Kubernetes controllers and operators development. + Deep understanding of distributed systems , parallel computing , and GPU architectures. + ... we're searching for engineers enthusiastic about building the next generation of scalable AI systems . As a Senior Applied AI Software Engineer on the Dynamo… more
- NVIDIA (Santa Clara, CA)
- …designing and building software, especially related to GO, Rust and C, experience with Systems Software and Distributed systems , as well as excellent ... NVIDIA is looking for a hardworking Sr. Systems Software Engineer to work on...+ Understanding of performance, security and reliability in complex distributed systems . Ways to stand out from… more
- Capital One (Mclean, VA)
- Senior Lead Software Engineer , Distributed Systems (Go, Java, Kubernetes, AWS) **The Capital One machine learning platform organization manages our ... processing workloads. We are seeking a Senior Lead Software engineer , who is passionate about working on large scale..., who is passionate about working on large scale distributed systems to help develop foundational capabilities… more
- NVIDIA (Santa Clara, CA)
- …software professional to contribute to design and development of accelerated and distributed implementations of Python APIs for numerical computing . In the ... team that is working to unlock the power of distributed GPU computing for domains such as...production use + contribute to the development of runtime systems that underlay the foundation of multi-GPU computing… more
- NVIDIA (Santa Clara, CA)
- …from the crowd: + Technical competency in managing and automating large-scale distributed systems independent of cloud providers. Advanced hands-on experience ... apply today! For two decades, we have pioneered visual computing , the art and science of computer graphics. With...part of an DGX Cloud team responsible for production systems that enable large scalable GPU clusters to be… more
- TP-Link North America, Inc. (Irvine, CA)
- …Master's degree is preferred. Work Experience + Over 5 years of experience in cloud computing , distributed systems , database systems , or related fields. ... a seamless, effortless lifestyle. OVERVIEW As a Senior Cloud Engineer - Distributed Database & Middleware, you...to ensure project success. + Mentor team members in distributed systems , database governance, and performance tuning… more
- NVIDIA (Santa Clara, CA)
- …and fleet management engineering. + Experience with infrastructure automation and distributed systems design developing tools for running large scale ... knowledge in one or more of the following: Linux, Slurm, Kubernetes, Local and Distributed Storage, and Systems Networking. Ways to stand out from the crowd:… more
- Amgen (Washington, DC)
- …transform the lives of patients while transforming your career. **Senior High Performance Computing Engineer ** **What you will do** Let's do this. Let's change ... + Experience in an Agile development environment. + Prior work with distributed computing and big data technologies (Hadoop, Spark). **Professional… more
- Amazon (Cupertino, CA)
- Description AWS Utility Computing (UC) provides product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC ... suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio. Annapurna Labs (our...accelerators. This role is for a senior machine learning engineer in the Distribute Training team for AWS Neuron,… more
- Amazon (Sunnyvale, CA)
- …and Amazon. Our team is constantly innovating, finding new ways of building massively scalable distributed systems . We set a high bar to build and deliver highly ... computing / We are looking for a Software Development Engineer who is excited by the unique challenges in...the launch. - Experience in designing and implementing large-scale distributed systems . - Demonstrated ability to distill… more
- Emory Healthcare/Emory University (Atlanta, GA)
- …with a variety of technical teams and with Emory faculty to help engineer well-designed high performance computing solutions that advance knowledge discovery, ... to provide both cloud and on-premises IT infrastructure to meet the growing computing and analysis needs of its research and teaching community, particularly in the… more
- Amazon (Cupertino, CA)
- Description AWS Utility Computing (UC) provides product innovations - from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic ... use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team...and runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large… more
- Amazon (Cupertino, CA)
- …offers growth opportunities in ML infrastructure, bridging the gap between frameworks, distributed systems , and hardware acceleration. About the team Annapurna ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...one of our AWS Neuron teams: - The ML Distributed Training team works side by side with chip… more
- Amazon (Seattle, WA)
- Description AWS Utility Computing (UC) provides product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC ... suite of generative AI services and other cutting-edge cloud computing offerings across the AWS portfolio. Annapurna Labs (our...use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team… more
- Google (Sunnyvale, CA)
- …or 1 year of experience with an advanced degree. + Experience in distributed computing or machine learning infrastructure. Preferred qualifications: + Master's ... academic or industry setting. + Experience building and supporting large scale distributed systems and infrastructure. + Familiarity with Kubernetes development,… more
- NVIDIA (Santa Clara, CA)
- Modern data centers are transforming into AI factories, and NVIDIA accelerated computing is the engine of artificial intelligence. Our data center platforms ... seeking a highly technical and creative Senior Technical Marketing Engineer to join our team to showcase the innovations...world's largest AI models. This role will focus on distributed AI model training, ensuring that customers and partners… more
- Amazon (Seattle, WA)
- …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip...comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating - that's why customers… more
- SAIC (Chantilly, VA)
- **Description** JOB DESCRIPTION: SAIC is seeking a Grid Computing Engineer in Chantilly, VA to oversee a globally deployed, centrally managed, decentralized big ... data computing environment. The successful candidate leverages their strong verbal...troubleshooting Information Technology (IT) infrastructure. + 5-years with virtual, distributed , and/or cloud data processing. + 3-years with service… more