• Senior SRE

    IBM (San Jose, CA)
    …thrive. **Your role and responsibilities** We are seeking a Sr Customer Support / SRE to join our team who is responsible for delivering Astra Streaming (Apache ... **Required technical and professional expertise** * 5+ years of experience in SRE , DevOps, or Production Engineering for large-scale distributed systems. * Deep… more
    IBM (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew ... Job Description Insight Global is looking for a seasoned SRE to join one of our largest technology clients'...cater to their infrastructure & systems needs. As an SRE , you'll also be working in conjunction with various… more
    Insight Global (09/09/25)
    - Save Job - Related Jobs - Block Source
  • Sr Manager, Site Reliability Engineering (InfoSec)

    Palo Alto Networks (Santa Clara, CA)
    …we all win with precision. **Your Career** We are actively seeking a highly motivated DevOps/ SRE Senior Manager to lead our Global InfoSec SRE team, based ... and resolving high-priority production maintenance issues and incidents. As the Senior Manager for the Infosec SRE Group, you will lead a team in a cutting-edge… more
    Palo Alto Networks (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer ( Senior

    MongoDB (San Francisco, CA)
    We are looking for an experienced Senior or Staff Engineer for our SRE , InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE ... that reinforce the platform's security posture. This is an SRE team, which means you can expect a highly...on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A… more
    MongoDB (10/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability Engineer…

    NVIDIA (Santa Clara, CA)
    …robust, automated, and secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer ( SRE ) to advance our enterprise ... experience (or equivalent experience). + 10+ years of software engineering/DevOps/ SRE experience, with a significant focus on operational security, automation,… more
    NVIDIA (09/30/25)
    - Save Job - Related Jobs - Block Source
  • (USA) Senior Director, Site Reliability…

    Walmart (Sunnyvale, CA)
    …resilience and operational continuity at an unprecedented scale. We are seeking a Senior Director, Agentic AI to lead the development and deployment of intelligent, ... Objective (RPO) standards. **Technology Execution** * Build and Lead SRE SWAT Team o Lead Walmart's elite SRE... SRE SWAT Team o Lead Walmart's elite SRE SWAT Team, a battle-ready force specializing in rapid… more
    Walmart (11/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer, Site…

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer, Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** Experience owning ... in Computer Science or Engineering. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale,… more
    Google (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer, Site Reliability…

    Google (Sunnyvale, CA)
    Senior Systems Engineer, Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving ... in Computer Science or Engineering. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale,… more
    Google (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, Site Reliability…

    Google (Sunnyvale, CA)
    Senior Software Engineer, Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... systems is a true strategy, and a good one._ Site Reliability Engineering ( SRE ) is an engineering discipline that combines software and systems engineering to build… more
    Google (09/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - DGX…

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
    NVIDIA (11/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior ML Platform Engineer - Lepton

    NVIDIA (Santa Clara, CA)
    …the world's most powerful GPU systems. Join our top team and apply your SRE and software engineering skills to craft robust, user-friendly platforms for seamless ML ... reproducibility and scalability across large-scale, distributed GPU clusters. + Apply SRE principles to diagnose, troubleshoot, and resolve complex system issues… more
    NVIDIA (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer ( Senior

    MongoDB (San Francisco, CA)
    **The Team** Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that support the ... Overview** We are seeking a talented Site Reliability Engineer ( SRE ) with a strong networking background to join the...secure and efficient communication between our services. As an SRE on the Fabric team, you will leverage your… more
    MongoDB (10/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Network Site Reliability…

    NVIDIA (Santa Clara, CA)
    GeForce Now is looking for a Manager, Network Site Reliability Engineer ( SRE ) to enhance our network infrastructure and operations. We are looking for a leader who ... ensuring a smooth user experience. The position focuses on managing Network SRE to streamline network operations, minimize manual tasks, and achieve service level… more
    NVIDIA (11/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Customer Reliability Engineer,…

    Google (Sunnyvale, CA)
    Senior Customer Reliability Engineer, Reliability Incident Management _corporate_fare_ Google _place_ New York, NY, USA; Austin, TX, USA; +2 more; +1 more **Mid** ... Software Engineering, Customer Engineering or professional services. + Experience in applying SRE principles to improve the reliability and performance of systems. +… more
    Google (10/17/25)
    - Save Job - Related Jobs - Block Source
  • ( Senior ) Software Engineer,…

    pony.ai (Fremont, CA)
    …Pony.ai went public at NASDAQ in November 2024. Responsibilities As a ( Senior ) Kubernetes Engineer, you will: + Design, operate, and optimize Kubernetes clusters ... security policies, and operational guidelines. + Contribute to observability and SRE practices to ensure reliability at scale (SLOs, incident reviews, metrics-driven… more
    pony.ai (09/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Machine Learning Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …AI technologies that unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer you will:** + Contribute to the design, development ... well, and remain reliable. + Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements for software tooling. +… more
    ServiceNow, Inc. (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …+ Build reusable tools, library, dashboards which can be used across DevOps/ SRE teams **What you'll bring:** + Bachelor's degree in Computer Science, Engineering ... or related discipline + 5+ years of hands-on related to SRE , Operations ; Development experience with Java Script, Java, Restful services, Git, Maven, Jenkins,… more
    Walmart (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer, Cloud…

    NVIDIA (Santa Clara, CA)
    …well as managing vendor relationships. You will partner with engineering, SRE , product, and third-party infrastructure providers to achieve operational excellence. ... operational excellence best practices across all infrastructure providers, partnering with SRE , infra, product, and security teams + Define and operationalize… more
    NVIDIA (10/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Program Manager, DGX Cloud…

    NVIDIA (Santa Clara, CA)
    NVIDIA is seeking a Senior Technical Program Manager to lead the Infrastructure and Product Security and Compliance program for DGX Cloud. In this role, you will ... highest standards of trust, resilience, and governance. As a Senior TPM focused on Cloud Security, you will own...and processes, establishing security KPIs, dashboards, and "run safe" SRE practices. + Partner with the CISO organization to… more
    NVIDIA (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Senior Program Manager, Infrastructure…

    General Motors (Mountain View, CA)
    **Job Description** ** Senior Program Manager, Infrastructure Engineering** **Hybrid:** This role is categorized as hybrid. This means the successful candidate is ... 3 locations. **The Role** We are looking for a Senior Program Manager to lead program execution of a...status, risks, and impacts clearly and effectively. Incorporate CI/CD, SRE , and Dev Experience practices into program workflows. +… more
    General Motors (11/15/25)
    - Save Job - Related Jobs - Block Source