• SRE : Level 5

    TEKsystems (Oakland, CA)
    Description CLIENT JD: We are looking for a Site Reliability Engineer ( SRE ) to join the IT AI Infrastructure team to deploy, manage, and optimize AI-powered ... including leadership. Skills Proven experience as a Site Reliability Engineer ( SRE ) or similar role. Strong understanding of AI technologies and platforms.… more
    TEKsystems (11/03/25)
    - Save Job - Related Jobs - Block Source
  • Lead Software Engineer ( SRE / Devops)

    Capital One (San Jose, CA)
    Lead Software Engineer ( SRE / Devops) **Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, ... or part-time status, exempt or non-exempt status, and management level . This role is expected to accept applications for...is expected to accept applications for a minimum of 5 business days. No agencies please. Capital One is… more
    Capital One (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Sr Manager, Site Reliability Engineering (InfoSec)

    Palo Alto Networks (Santa Clara, CA)
    …we all win with precision. **Your Career** We are actively seeking a highly motivated DevOps/ SRE Senior Manager to lead our Global InfoSec SRE team, based at our ... our team, especially given the global distribution of our SRE group. The InfoSec SRE group is...hours, ensuring prompt and effective resolution. **Your Experience** + ** 5 + years of proven industry experience** with DevOps methodologies,… more
    Palo Alto Networks (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to... 4, and 200,000 USD - 322,000 USD for Level 5 . You will also be eligible ... source cloud enabling technologies like Kubernetes and Public Cloud. SRE at NVIDIA ensures that our internal and external...while keeping an eye on capacity, latency and performance. SRE is also a mindset and a set of… more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager II, Site Reliability…

    Google (Sunnyvale, CA)
    …experience. + 8 years of experience with data structures or algorithms. + 5 years of experience with software development in one or more programming languages. ... year of people management experience. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and run large-scale,… more
    Google (09/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer, Site Reliability…

    Google (Sunnyvale, CA)
    …degree in Computer Science, a related field, or equivalent practical experience. + 5 years of experience with programming in one or more programming languages. + ... Science or Engineering. **About the job** Site Reliability Engineering ( SRE ) combines software and systems engineering to build and...+ benefits. Our salary ranges are determined by role, level , and location. Within the range, individual pay is… more
    Google (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - Observability…

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering ( SRE ) at NVIDIA is an engineering discipline to... 4, and 208,000 USD - 333,500 USD for Level 5 . You will also be eligible ... and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at NVIDIA ensures that our internal and external facing GPU cloud… more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior ML Platform Engineer - Lepton

    NVIDIA (Santa Clara, CA)
    …USD for Level 4, and 224,000 USD - 356,500 USD for Level 5 . You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) ... a proven track record of building and managing production infrastructure. + SRE -oriented mindset with extensive experience in diagnosing system- level issues,… more
    NVIDIA (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Sr. AWS Cloud & DevOps Architect - Remote

    McAfee, Inc. (San Jose, CA)
    …alerting solutions to maintain system health and security​ + Drive SRE practices by implementing strategies that improve reliability, availability, and scalability ... of cloud infrastructure. + Develop and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs),...and guide junior engineers in cloud architecture, DevOps, and SRE best practices. + Act as a subject matter… more
    McAfee, Inc. (10/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …reviews, assist in root cause identification, and write RCA reports. + Deliver SRE solutions in a globally distributed, multi-cloud hybrid environment - AWS, GCP, ... and On-prem. + Ensure the highest level of uptime and Quality of Service (QoS) for...Science or related technical field (or equivalent experience) with 5 + years in building and supporting critical services. +… more
    NVIDIA (09/16/25)
    - Save Job - Related Jobs - Block Source
  • Senior, Software Engineer

    Walmart (Sunnyvale, CA)
    …bring:** + Bachelor's degree in Computer Science, Engineering or related discipline + 5 + years of hands-on related to SRE , Operations ; Development experience ... solutions and services making a profound impact at every level of Walmart. As a key part of Walmart...years' experience in software engineering or related area.Option 2: 5 years' experience in software engineering or related area.… more
    Walmart (08/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Engineer, Cloud…

    NVIDIA (Santa Clara, CA)
    …USD for Level 4, and 224,000 USD - 356,500 USD for Level 5 . You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) ... well as managing vendor relationships. You will partner with engineering, SRE , product, and third-party infrastructure providers to achieve operational excellence.… more
    NVIDIA (10/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior Customer Reliability Engineer, Reliability…

    Google (Sunnyvale, CA)
    …management and response in a distributed systems environment. **Preferred qualifications:** + 5 years of experience in a technical role such as Site Reliability ... Customer Engineering or professional services. + Experience in applying SRE principles to improve the reliability and performance of...+ benefits. Our salary ranges are determined by role, level , and location. Within the range, individual pay is… more
    Google (10/17/25)
    - Save Job - Related Jobs - Block Source
  • Customer Reliability Engineer, Reliability…

    Google (Sunnyvale, CA)
    …management and response in a distributed systems environment. **Preferred qualifications:** + 5 years of experience in a technical role such as Site Reliability ... Customer Engineering or professional services. + Experience in applying SRE principles to improve the reliability and performance of...+ benefits. Our salary ranges are determined by role, level , and location. Within the range, individual pay is… more
    Google (10/17/25)
    - Save Job - Related Jobs - Block Source
  • Principal Database Architect

    Zoom (San Jose, CA)
    …and by driving improvements to automation. Broadly speaking, you are an exemplary SRE /DevOps leader who will guide our teams toward best practices. About the Team ... + Guiding projects that deliver automation, improve monitoring, and level up our DBA teams' ability to manage databases...15 years of experience as a Site Reliability Engineer ( SRE ) or DevOps Engineer role. + Have proven track… more
    Zoom (10/31/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer (Backend)

    Palo Alto Networks (Santa Clara, CA)
    …more. + Collaborate closely with the Product management, Development, Quality Assurance, SRE and Customer support teams on delivering the roadmap and improving ... ability to introduce monitoring/tracing of application logs. + Experience handling Devops, SRE , availability and reliability outcomes for a large cloud product. + … more
    Palo Alto Networks (09/03/25)
    - Save Job - Related Jobs - Block Source
  • Cloud Engagement Lead - West Coast

    EPAM Systems (San Jose, CA)
    …Professional Services to the BU Team + Collaborate with the Director, VP, and C- level management at clients and serve as a trusted Cloud technology advisor for them ... delivery with recent focus on Cloud Modernization projects + 5 + years of demonstrated track record of developing and...development, Cloud managed services, Cloud Governance, CCoE, DevOps, and SRE + Industry experience in one or many areas… more
    EPAM Systems (10/03/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud Software Engineer…

    NVIDIA (Santa Clara, CA)
    …USD for Level 4, and 208,000 USD - 333,500 USD for Level 5 . You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) ... Participate in the definition of our internal facing service level objectives and error budgets as part of our...coding (eg, physics or mathematics) or equivalent experience. + 5 + years of relevant experience in infrastructure and fleet… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer, Cloud Functions

    NVIDIA (Santa Clara, CA)
    …USD for Level 4, and 224,000 USD - 356,500 USD for Level 5 . You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) ... collaboration with teams across various departments with the goal of reducing SRE toil and improving hardware utilization + Collaborating with various organizations… more
    NVIDIA (10/31/25)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer Software (Cloud Application)

    Palo Alto Networks (Santa Clara, CA)
    …ownership of their areas of focus and who are driven to pursue problems at every level . Collaboration is at the heart of our culture and we need engineers who can ... communicate at a high level and work well with multi-functional teams towards achieving...implementation and test + Lead cross-functionally with Product Management, SRE , Software, and Quality Engineering teams to deliver new… more
    Palo Alto Networks (09/09/25)
    - Save Job - Related Jobs - Block Source