• Senior SRE

    IBM (San Jose, CA)
    …thrive. **Your role and responsibilities** We are seeking a Sr Customer Support / SRE to join our team who is responsible for delivering Astra Streaming (Apache ... **Required technical and professional expertise** * 5+ years of experience in SRE , DevOps, or Production Engineering for large-scale distributed systems. * Deep… more
    IBM (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Reliability Engineer

    Abbott (Pleasanton, CA)
    …and compliant with healthcare regulations-this is the role for you. As a Senior SRE , you'll work closely with engineering, QA, cybersecurity, and regulatory ... and scientists. **The Opportunity** We're looking for a strong ** Senior Site Reliability Engineer ( SRE )** who's ready to roll up their sleeves and make… more
    Abbott (09/20/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew ... Job Description Insight Global is looking for a seasoned SRE to join one of our largest technology clients'...cater to their infrastructure & systems needs. As an SRE , you'll also be working in conjunction with various… more
    Insight Global (09/09/25)
    - Save Job - Related Jobs - Block Source
  • Sr Manager, Site Reliability Engineering (InfoSec)

    Palo Alto Networks (Santa Clara, CA)
    …we all win with precision. **Your Career** We are actively seeking a highly motivated DevOps/ SRE Senior Manager to lead our Global InfoSec SRE team, based ... and resolving high-priority production maintenance issues and incidents. As the Senior Manager for the Infosec SRE Group, you will lead a team in a cutting-edge… more
    Palo Alto Networks (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    Eliassen Group (Concord, CA)
    …+ Recommended Jobs **Description:** **Hybrid | Concord, CA** We are seeking a Senior Site Reliability Engineer ( SRE ) to join our Digital Platform Engineering ... ** Senior Site Reliability Engineer** **Concord, CA** **Type:** Contract...**Responsibilities:** . Production Support & Escalation: Serve as a senior escalation point for Platform Engineers, providing expert troubleshooting… more
    Eliassen Group (10/28/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …robust, automated, and secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer ( SRE ) to advance our enterprise ... experience (or equivalent experience). + 10+ years of software engineering/DevOps/ SRE experience, with a significant focus on operational security, automation,… more
    NVIDIA (09/30/25)
    - Save Job - Related Jobs - Block Source
  • (USA) Senior Director, Site Reliability…

    Walmart (Sunnyvale, CA)
    …resilience and operational continuity at an unprecedented scale. We are seeking a Senior Director, Agentic AI to lead the development and deployment of intelligent, ... Objective (RPO) standards. **Technology Execution** * Build and Lead SRE SWAT Team o Lead Walmart's elite SRE... SRE SWAT Team o Lead Walmart's elite SRE SWAT Team, a battle-ready force specializing in rapid… more
    Walmart (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer, Site…

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer, Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** Experience owning ... in Computer Science or Engineering. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale,… more
    Google (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff DevOps Engineer - Cloud…

    ServiceNow, Inc. (Pleasanton, CA)
    …next-generation analytics to support ServiceNow's Cloud and AI growth. As our Senior Staff DevOps Engineer for Cloud Analytics & FinOps Engineering Platform, you ... SLIs/SLOs/SLAs for data platform services with error budget management, establish SRE practices including toil reduction and capacity planning, and create… more
    ServiceNow, Inc. (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer, Site Reliability…

    Google (Sunnyvale, CA)
    Senior Systems Engineer, Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving ... in Computer Science or Engineering. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale,… more
    Google (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, Site Reliability…

    Google (Sunnyvale, CA)
    Senior Software Engineer, Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... systems is a true strategy, and a good one._ Site Reliability Engineering ( SRE ) is an engineering discipline that combines software and systems engineering to build… more
    Google (09/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - DGX…

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
    NVIDIA (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior ML Platform Engineer - Lepton

    NVIDIA (Santa Clara, CA)
    …the world's most powerful GPU systems. Join our top team and apply your SRE and software engineering skills to craft robust, user-friendly platforms for seamless ML ... reproducibility and scalability across large-scale, distributed GPU clusters. + Apply SRE principles to diagnose, troubleshoot, and resolve complex system issues… more
    NVIDIA (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Network Site Reliability…

    NVIDIA (Santa Clara, CA)
    GeForce Now is looking for a Manager, Network Site Reliability Engineer ( SRE ) to enhance our network infrastructure and operations. We are looking for a leader who ... ensuring a smooth user experience. The position focuses on managing Network SRE to streamline network operations, minimize manual tasks, and achieve service level… more
    NVIDIA (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Customer Reliability Engineer,…

    Google (Sunnyvale, CA)
    Senior Customer Reliability Engineer, Reliability Incident Management _corporate_fare_ Google _place_ New York, NY, USA; Austin, TX, USA; +2 more; +1 more **Mid** ... Software Engineering, Customer Engineering or professional services. + Experience in applying SRE principles to improve the reliability and performance of systems. +… more
    Google (10/17/25)
    - Save Job - Related Jobs - Block Source
  • ( Senior ) Software Engineer,…

    pony.ai (Fremont, CA)
    …Pony.ai went public at NASDAQ in November 2024. Responsibilities As a ( Senior ) Kubernetes Engineer, you will: + Design, operate, and optimize Kubernetes clusters ... security policies, and operational guidelines. + Contribute to observability and SRE practices to ensure reliability at scale (SLOs, incident reviews, metrics-driven… more
    pony.ai (09/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Machine Learning Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …AI technologies that unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer you will:** + Contribute to the design, development ... well, and remain reliable. + Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements for software tooling. +… more
    ServiceNow, Inc. (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer, Cloud…

    NVIDIA (Santa Clara, CA)
    …well as managing vendor relationships. You will partner with engineering, SRE , product, and third-party infrastructure providers to achieve operational excellence. ... operational excellence best practices across all infrastructure providers, partnering with SRE , infra, product, and security teams + Define and operationalize… more
    NVIDIA (10/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …+ Build reusable tools, library, dashboards which can be used across DevOps/ SRE teams **What you'll bring:** + Bachelor's degree in Computer Science, Engineering ... or related discipline + 5+ years of hands-on related to SRE , Operations ; Development experience with Java Script, Java, Restful services, Git, Maven, Jenkins,… more
    Walmart (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Program Manager, DGX Cloud…

    NVIDIA (Santa Clara, CA)
    NVIDIA is seeking a Senior Technical Program Manager to lead the Infrastructure and Product Security and Compliance program for DGX Cloud. In this role, you will ... highest standards of trust, resilience, and governance. As a Senior TPM focused on Cloud Security, you will own...and processes, establishing security KPIs, dashboards, and "run safe" SRE practices. + Partner with the CISO organization to… more
    NVIDIA (11/14/25)
    - Save Job - Related Jobs - Block Source