• Senior Customer Reliability

    Google (Sunnyvale, CA)
    Senior Customer Reliability Engineer , Reliability Incident Management _corporate_fare_ Google _place_ New York, NY, USA; Austin, TX, USA; +2 more; +1 ... use technology to connect with customers, employees and partners. As a Senior Customer Reliability Engineer , you will be a pivotal individual contributor on… more
    Google (10/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a ... crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build automation… more
    NVIDIA (11/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …demands robust, automated, and secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
    NVIDIA (09/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Site…

    Google (Sunnyvale, CA)
    Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving ... meet some of our SREs. + Read acareer profile (https://careers.google.com/stories/site- reliability -engineering-profile-google/) about why a software engineer chose… more
    Google (09/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer , Site…

    Google (Sunnyvale, CA)
    Senior Systems Engineer , Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to build and run… more
    Google (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Quality & Reliability Engineer

    Amazon (Cupertino, CA)
    …designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna...future technologies. * Drive manufacturing process improvements to address reliability issues and concerns. * Qualify manufacturing lines and… more
    Amazon (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** Experience ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/site- reliability -engineering-profile-google/) about why a Software Engineer chose… more
    Google (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …great pride in providing excellent, comprehensive support to our customers! ​Sr Site Reliability Engineer in this role will significantly impact and contribute ... Computer Science or related field. + 8+ years of experience in site reliability engineering and/or software development roles. + Fluency in Python + In-depth… more
    NVIDIA (10/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Reliability Methodology…

    NVIDIA (Santa Clara, CA)
    …that are groundbreaking in AI and computing. What you'll be doing: As a Reliability Methodology Engineer at NVIDIA, you will be responsible for ensuring our ... design, product, and test engineering teams to apply DFT methodologies to improve reliability screening specific to HTOL (Component level Hight Temp Op. Life Test).… more
    NVIDIA (10/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    General Motors (Sunnyvale, CA)
    …Groovy + On-call and fire-fighting experience + Experience with modern site reliability practice including but not limited to post mortem, SLO/SLI, Tracing, ... Synthetic monitoring, etc. **What Will Give You A Competitive Edge (Preferred Qualifications)** + You have experience managing Azure cloud platform and are the domain expert + You are well known for being customer focused. + You had demonstrated low tolerance… more
    General Motors (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Site…

    LinkedIn (Sunnyvale, CA)
    …Streams IO Reliability Team, you would be you will be a reliability -focused software engineer responsible for driving success of our Pubsub ecosystems, ... in Mountain View, CA. Come join the Streams IO Reliability Team responsible for maintaining one of the largest...Streaming ecosystems on the planet. The LinkedIn Streams IO Reliability Team team is responsible for maintaining LinkedIn's pubsub… more
    LinkedIn (11/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Service Reliability

    NVIDIA (Santa Clara, CA)
    …engineers to design, develop and implement a global, dynamic, innovative Service Reliability Operations Center, to provide extraordinary levels of support for our ... with other key members of our organization including Site Reliability Engineering, Security Operations Center, DevOps teams, and other...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
    NVIDIA (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Systems Engineer

    General Motors (Sunnyvale, CA)
    …of the Autonomous Vehicle (AV) software stack through automation, data-driven reliability insights, and systematic validation processes. The mission is to accelerate ... the velocity and stability of AV releases by unifying software engineering, reliability analysis, and release automation under one cohesive framework. In this… more
    General Motors (10/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior System ASIC Engineer - Speed…

    NVIDIA (Santa Clara, CA)
    …and board designers, software/firmware engineers, HW/SW applications engineering, process/ reliability specialists, DFx engineers, ATE engineers, product managers, ... our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!… more
    NVIDIA (11/07/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …in Sunnyvale, CA or San Francisco, CA. **Responsibilities** + Serve as a senior technical leader driving the long-term reliability and observability strategy ... enable the right business decisions around improving quality and reliability of our services and products + Act as...availability and performance + Previous experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale… more
    LinkedIn (09/24/25)
    - Save Job - Related Jobs - Block Source
  • (USA) Senior Director, Site…

    Walmart (Sunnyvale, CA)
    …management, or related area., SRE certification (for example, IBM Cloud Site Reliability Engineer )., We value candidates with a background in creating ... ** **What you'll do ** **Location: Sunnyvale / Bentonville​** **Department: Reliability Engineering / Business Reliability Engineering (BRE)** **Reports To:… more
    Walmart (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... data for reporting, alerting, monitoring. + Collaborate with NVIDIA leadership, senior engineers, program managers, and product managers to develop compelling IT… more
    NVIDIA (11/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …a green card, will be considered. **Role Overview ** We're seeking a Senior Staff Software Engineer with strong full stack development experience (expertise ... sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how...plays a key role in shaping the quality and reliability of our products. **About the Team** The Public… more
    ServiceNow, Inc. (11/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …daily through our high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with L2, Other ... that will ensure the highest levels of availability and reliability of CPC applications. About Team: Our team works...probability of failure, frequency of failure) to measure site reliability . Monitors site reliability conditions and new… more
    Walmart (11/14/25)
    - Save Job - Related Jobs - Block Source