- Amazon (East Palo Alto, CA)
- …the team that architects, designs, and implements highly scalable distributed database systems with availability, reliability and performance guarantees. This is ... language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- NVIDIA (Santa Clara, CA)
- …Data Center Servers. What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, ... increases with every GPU generation, developing efficient and reliable systems is an imperative. We are looking for a...System Reliability Engineer to join NVIDIA's existing Reliability Engineering team, involved in NVIDIA's diverse… more
- Rubrik (Palo Alto, CA)
- …we want to talk to you! **About The Role:** Senior Site Reliability Engineers at Rubrik are systems /software engineers who ensure that Rubrik's ... and have the capacity for future growth. As a Senior Site Reliability Engineer, you will be...our customers + Design, implement and maintain relational database systems for performance and reliability + Manage… more
- Netflix (Los Gatos, CA)
- …for a software engineering manager to lead our Ecosystem Platform and Reliability team. Role We are seeking an experienced Software Engineering Manager to ... enabling brilliant Netflix experiences on partner devices and building foundational client systems and tools. Partner Enablements Apps group within CPT is looking… more
- NVIDIA (Santa Clara, CA)
- …AI training and inferencing. The responsibilities include implementing software and systems engineering practices to ensure high efficiency and availability ... and scale to foster innovation. We are seeking a Senior Site Reliability Engineer (SRE) to join...you'll be doing: + Develop software solutions to ensure reliability and operability of large-scale systems supporting… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high ... efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge… more
- NVIDIA (Santa Clara, CA)
- …to run htol process using next generation oven while ensuring world class reliability . What you will be doing: + Developing, debugging, and managing test programs ... + Leading and providing technical guidance to lab technicians and various engineering groups to ensure proper and seamless bring up. + Working continuously… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior Site Reliability Engineer to work in IPP (Infrastructure, Planning and Process). IPP is a global organization within NVIDIA. This ... hosts a heterogeneous mix of machines and devices with various operating systems (Windows/Linux/Android), a multitude of hardware platforms both NVIDIA GPUs and… more
- NVIDIA (Santa Clara, CA)
- …workflows. In this role, you will design, implement and support operational and reliability aspects of large scale distributed systems with focus on performance ... approaches for our GPU Compute Clusters. As a Site Reliability Engineer, you will help us with the strategic...live, on some of the largest and most complex systems in the world What we need to see:… more
- Tarana Wireless (Milpitas, CA)
- As a Senior Site Reliability Engineer, you will help us manage software that runs on the cloud and remotely manages millions of radio devices. You will work on a ... to support millions of connected devices + Monitoring of all live systems + Troubleshoot and triage production active issues Required Skills and Experience:… more
- LinkedIn (Mountain View, CA)
- …career growth. Join us to challenge yourself with work that matters. The Site Reliability TPM team at LinkedIn is seeking a Staff Technical Program Manager, a highly ... Platform and Autonomous Fleet programs, aligning with business objectives. * Collaborate with senior leadership to ensure program goals are in sync with the overall… more
- NVIDIA (Santa Clara, CA)
- …accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role ... operations of our growing IT ecosystem. You will collaborate closely with engineering teams to align infrastructure with their evolving needs, document best… more
- Abbott (Pleasanton, CA)
- …executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer** As senior member of Site Reliability Engineering , you will play ... software engineering teams and business stakeholders establish and evolve reliability goals and measure progress against those goals using SLIs/SLOs + Automates… more
- NVIDIA (Santa Clara, CA)
- … and efficiency. What We Need To See: + Extensive experience in a senior -level role within Site Reliability Engineering , particularly in managing storage ... As a Sr Manager in Site Reliability Engineering (SRE), you will lead...availability. This role spans various domains, including software and systems engineering , cloud-scale storage, data management, and… more
- Palo Alto Networks (Santa Clara, CA)
- …Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a team ... This includes automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab… more
- Insight Global (Santa Clara, CA)
- …Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew that ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, youll also be working...working in conjunction with various teams such as software engineering to deploy these new products and manage our… more
- Amazon (Cupertino, CA)
- …Linux/Unix environment experience - 5+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience - 5+ ... largest cloud computing infrastructures in the world, and managing systems at scale? If yes, come join us. Key...customers. A day in the life Lead the Hardware Engineering (HWEng) System Development (SysDE) effort to define and… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Datacenter Baseboards Product Development Engineering Manager. NVIDIA Corporation is a world leader in visual computing and artificial ... artificial intelligence and autonomous cars! Collaborating with your peers across various engineering groups, you will successfully launch new NVIDIA HGX and MGX AI… more
- Amazon (Sunnyvale, CA)
- …language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience ... that cares about people just as much as products? Amazon Traffic Engineering builds innovative managed compute and networking solutions that empower Amazon Software… more
- Microsoft Corporation (San Jose, CA)
- We are looking for a ** Senior Systems Engineer** to join the team. As a Systems Engineering team member, you will work directly with engineers across ... To achieve this goal, the Cloud AI & Advanced Systems Engineering (CAASE) team is instrumental in...+ Collaborate with internal and external partners to ensure systems meet significant quality, reliability , and service… more