- NVIDIA (Santa Clara, CA)
- …the system and platform level to enhance reliability in our production flows. As a Senior Systems Reliability Engineering Lead, you will: + Lead with ... make groundbreaking impacts. What you will be doing: The Senior Systems Reliability position focuses...validation. + BS or MS in Electrical or Computer Engineering or a related field (or equivalent experience). +… more
- Amazon (East Palo Alto, CA)
- …the team that architects, designs, and implements highly scalable distributed database systems with availability, reliability and performance guarantees. This is ... language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- EPAM Systems (San Jose, CA)
- …and develop, monitor, and alert on SLIs/SLOs **Requirements** + 5+ years of SRE or Systems Engineering experience + 2+ years as team lead or SRE champion + ... EPAM is hiring a **Remote Lead Site Reliability Engineer** . If you are looking for a high-impact, exciting role with a company that leads the globe in the digital… more
- Palo Alto Networks (Santa Clara, CA)
- …field or equivalent military experience required + 20+ years progressive experience in reliability engineering or reliability centered quality work for ... production and must develop, implement and improve processes and systems to support product quality and reliability ....controls and processes related to NPI product quality and reliability + Is integral in Engineering Development… more
- NVIDIA (Santa Clara, CA)
- …Data Center Servers. What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, ... increases with every GPU generation, developing efficient and reliable systems is an imperative. We are looking for a...System Reliability Engineer to join NVIDIA's existing Reliability Engineering team, involved in NVIDIA's diverse… more
- Rubrik (Palo Alto, CA)
- Senior Site Reliability Engineers at Rubrik are systems /software engineers who ensure that Rubrik's infrastructure services run smoothly and have the ... capacity for future growth. As a Senior Site Reliability Engineer, you will be...our customers + Design, implement and maintain relational database systems for performance and reliability + Manage… more
- Rubrik (Palo Alto, CA)
- **About Team & About Role:** The Site Reliability Engineering team at Rubrik ensures reliability , availability and performance of our cutting-edge ... product stability and success. **What you'll do:** As a Senior Site Reliability Engineer, you will be...will be responsible for: + Manage and run backend systems like Kubernetes, MySQL and everything in between +… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high ... efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge… more
- NVIDIA (Santa Clara, CA)
- …NVIDIA is looking to hire a deeply technical, creative, and Staff Site Reliability Engineer to build, support and maintain the next generation AI powered enterprise ... products that improve engineering efficiency, data security, and power our product development....shaping the technological future of our organization, ensuring our systems are scalable, reliable, and efficient. What you will… more
- LinkedIn (Mountain View, CA)
- …career growth. Join us to challenge yourself with work that matters. The Site Reliability TPM team at LinkedIn is seeking a Staff Technical Program Manager, a highly ... Platform and Autonomous Fleet programs, aligning with business objectives. * Collaborate with senior leadership to ensure program goals are in sync with the overall… more
- Tarana Wireless (Milpitas, CA)
- As a Senior Site Reliability Engineer, you will help us manage software that runs on the cloud and remotely manages millions of radio devices. You will work on a ... to support millions of connected devices + Monitoring of all live systems + Troubleshoot and triage production active issues Required Skills and Experience:… more
- NVIDIA (Santa Clara, CA)
- …accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role ... operations of our growing IT ecosystem. You will collaborate closely with engineering teams to align infrastructure with their evolving needs, document best… more
- Abbott (Pleasanton, CA)
- …executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer** As senior member of Site Reliability Engineering , you will play ... software engineering teams and business stakeholders establish and evolve reliability goals and measure progress against those goals using SLIs/SLOs + Automates… more
- NVIDIA (Santa Clara, CA)
- … and efficiency. What We Need To See: + Extensive experience in a senior -level role within Site Reliability Engineering , particularly in managing storage ... As a Sr Manager in Site Reliability Engineering (SRE), you will lead...availability. This role spans various domains, including software and systems engineering , cloud-scale storage, data management, and… more
- Palo Alto Networks (Santa Clara, CA)
- …mission to protect our way of life in the digital age. As a Customer Reliability Engineer for Data Security, you will act as a liaison between our Customer Success ... organization and our product and engineering teams. In this role you will lead troubleshooting...product tools and telemetry + Communicate status of the systems via automation (customers) and targeted messages (TAC) +… more
- Amazon (Sunnyvale, CA)
- …language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience ... Description The Region Flexibility Engineering (RFE) team builds and leverages foundational infrastructure capabilities, tools, and datasets needed to support the… more
- Zoom (San Jose, CA)
- …is not available for this position ** What you can expect As a senior level Product Resilience SRE, you will define, scope, plan, and schedule Disaster Recovery ... documents. Finally, you will communicate with stakeholders including security teams, senior managers, and customers. About the Team You will be part… more
- Celestica (San Jose, CA)
- …Position: Yes Region: Americas Country: USA **General Overview** **Job Title:** Senior Manager, Engineering Program / Project Management **Functional Area:** ... Engineering (ENG) **Career Stream:** Engineering Program / Project Management (EPM) **Role:** ...our customers' expectations are exceeded to improve yield, increase reliability , and deliver cost saving. They manage the development,… more
- Lucile Packard Children's Hospital Stanford (Palo Alto, CA)
- …technology groups that operate within the Information Services Department. Furthermore as a senior member of the network engineering team they will pursue ... will report directly to the Network Manager and is responsible for hands-on engineering activities to ensuring the successful operation of a mission critical network… more
- Sandia National Laboratories (Livermore, CA)
- …geographic salary differential. What Your Job Will Be Like: We are seeking a Senior Manager (job title: Senior Manager, R&D Scienc SG.) This group (8220) ... capabilities will be brought together to form a new organization: ND Engineering Services and Capabilities (8220). This new organization will include Sandia Programs… more