We interpreted Mountain View, CA as Mountain View, CA. Other options include: Mountain View (Contra Costa County), CA
- Amazon (Cupertino, CA)
- …degree in computer science or equivalent - Preferred previous software engineer expertise with Pytorch/Jax/Tensorflow, Distributed libraries and Frameworks, ... Web Services (AWS) is looking for a Software Development Engineer II to build, deliver, and maintain complex products...customers and raise our performance bar. You'll design fault-tolerant systems that run at massive scale as we continue… more
- Amazon (Cupertino, CA)
- …degree in computer science or equivalent - Preferred previous software engineer expertise with Pytorch/Jax/Tensorflow, Distributed libraries and Frameworks, ... use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team...stable diffusion, Vision Transformers and many more. The ML Distributed Training team works side by side with chip… more
- eightfold.ai (Santa Clara, CA)
- …internal/external technical presentations. + Architect, design, develop, maintain, and support distributed systems in Eightfold's core infrastructure. + Build ... Eightfold offers the industry's first AI -powered Talent Intelligence SaaS Platform to transform how...secure, scalable and highly available + Expertise in building distributed systems at cloud scale + Familiarity… more
- NVIDIA (Santa Clara, CA)
- …expect you to have a strong programming background, a deep understanding of distributed systems , familiarity with software testing and deployment, and excellent ... Algorithms. + Understanding of performance, security and reliability in complex distributed systems . Familiarity with system level architecture, data… more
- LinkedIn (Mountain View, CA)
- …such as Scala or other relevant coding languages -Hands-on experience developing distributed systems or other large-scale systems . Preferred Qualifications: ... billions of user queries Model Training Infrastructure: As an engineer on the AI Training Infra team,...ML applications, LLM serving, GPU serving. -Experience with search systems or similar large-scale distributed systems… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- LinkedIn (Sunnyvale, CA)
- …working with streaming solutions (Kafka, Samza, etc.) Suggested Skills: * Distributed Systems Design* Technical Leadership Experience* Ability to solve ... impact on all LinkedIn products, establishing the platform as a leader in the AI realm. Responsibilities: As a Staff Engineer at LinkedIn, you will be… more
- LinkedIn (Sunnyvale, CA)
- …internals MS or PhD in Computer Science or related technical disciplineExperience with distributed systems development or distributed ML workloads Suggested ... full-time engineering role based in Sunnyvale, CA Team Overview: Foundational AI Technologies (FAIT) organization stands as the innovation epicenter, addressing the… more
- Amazon (Santa Clara, CA)
- …programming language - Experience in developing highly scalable, fault-tolerant, distributed systems - Experience with multi-threaded asynchronous development ... Description AWS AI is looking for world-class software developers to...the space. Key job responsibilities As a Software Development Engineer in the SageMaker Engines team, you will be… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network… more
- NVIDIA (Santa Clara, CA)
- …performance critical applications + Experience implementing, tuning, and debugging runtimes and/or distributed systems for supercomputers or the cloud + Good ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing - with...develop and enhance the functionality and performance of runtime systems that underlay the foundation of distributed … more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- LinkedIn (Mountain View, CA)
- …leading / building deep learning systems - Hands-on experience developing distributed systems or other large-scale systems Preferred Qualifications: - ... work for Training infrastructure. As a Senior Staff Software Engineer on the AI Training Infra team,...Rust, Scala - 5+ years of experience with large-scale distributed systems and client-server architectures - Co-author… more
- NVIDIA (Santa Clara, CA)
- We are seeking a highly motivated performance engineer to join our AI Applications organization to work on distributed cloud native accelerated video ... as part of the Metropolis ecosystem. As a performance engineer , you will work with the Application teams to...+ Experience in real-time streaming AI inference systems + A history of working on distributed… more
- Walmart (Sunnyvale, CA)
- …role which requires expertise at the intersection of large-scale distributed systems , machine learning, LLMs and more. Our AI assistants are rapidly ... Engineer to lead the next evolution of the AI assistant platform by defining and building highly scalable...assistant platform by defining and building highly scalable Generative AI systems and infrastructure. This will be… more
- NVIDIA (Santa Clara, CA)
- …ecosystem + Hands-on experience in performance optimization and benchmarking on large-scale distributed systems + Hands-on experience with NVIDIA GPUs, HPC ... roadmap in a fast-growing technology company that leads the AI revolution while helping deep learning users around the...with domain expert teams as they transition applications to distributed environments. What we need to see: + Masters… more
- NVIDIA (Santa Clara, CA)
- …of distributed storage services. + Design, implement an on-prem AI /HPC infrastructure supplemented with cloud computing to support the growing needs of ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...general procedures and practices, perform technology evaluations, related to distributed file systems . + Collaborate across teams… more
- Meta (Menlo Park, CA)
- …12. Experience in deep learning and PyTorch 13. Experience with distributed systems or on-device algorithm development 14. Experience contributing ... **Summary:** Meta is seeking a Research Engineer to join our Fundamental AI ...and machine learning techniques to build intelligent rich language systems that improve Meta's products and experiences 2. Assist… more
- Intuit (Mountain View, CA)
- …and services with an emphasis on performance. Google Cloud experience preferred + Distributed systems and data infrastructure + Familiarity with and experience ... Overview The Data Engineering teams build systems that bridge the gap between raw business...industry best practices + Experience in building and managing distributed compute infrastructure for AI /ML training at… more
- NVIDIA (Santa Clara, CA)
- …within Kubernetes + Deep understanding of cloud technologies, distributed compute systems , and distributed systems and microservices architecture + ... We are seeking a highly skilled Senior Infrastructure System Software Engineer with Kubernetes-based infrastructure experience to join our Omniverse Infrastructure… more