• Reliability Systems Engineer

    Northrop Grumman (Sunnyvale, CA)
    …development through system retirement. We are looking for you to join our team as a ** Reliability / Systems Safety Engineer Level 3 /4** based out of ... Sunnyvale, CA **What you'll get to do:** The ** Reliability / Systems Safety Engineer Level 3 /4** we are seeking will be an individual who thrives in a… more
    Northrop Grumman (05/22/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Engineer - Operations…

    ServiceNow, Inc. (Santa Clara, CA)
    …make the world work better for everyone. We are seeking a **Senior Staff Engineer ** with a strong background in operations, reliability , and DevOps strategy to ... product engineering organization. You won't just implement tools-you'll shape **how we engineer for reliability ** . If you're passionate about bringing people,… more
    ServiceNow, Inc. (05/10/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability Engineering team, involved ... What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards,...well as the ability to communicate at a high level . + Self-motivating, independent, and committed to getting things… more
    NVIDIA (04/17/25)
    - Save Job - Related Jobs - Block Source
  • Staff Hardware Reliability Engineer

    Google (Mountain View, CA)
    …leadership level communication, project management, planning and organizational skills. As a Reliability Engineer , you will play a key role in creating new ... for new product development of complex designs and technologies systems . Own and oversee material, design, or reliability...+ benefits. Our salary ranges are determined by role, level , and location. Within the range, individual pay is… more
    Google (05/10/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of ... of deep learning workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus on… more
    NVIDIA (03/26/25)
    - Save Job - Related Jobs - Block Source
  • Silicon Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …the forefront of technological advancement. We are now looking for a Silicon Reliability Engineer . NVIDIA's Silicon Reliability Engineers are responsible for ... reliability stress hardware and software infrastructures and participate in a system- level High-Temperature Operating Life (HTOL) reliability test. + Build… more
    NVIDIA (05/23/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - Site…

    General Motors (Mountain View, CA)
    …our customers, including fleet management, energy optimization, transportation logistics, safety systems , and more. To fulfill our mission, we are actively expanding ... future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements...and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an… more
    General Motors (04/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big picture of how ... our systems relate to each other, we use a breadth...Issues: Perform comprehensive troubleshooting from bare metal to application level , ensuring system reliability and efficiency. +… more
    NVIDIA (04/04/25)
    - Save Job - Related Jobs - Block Source
  • Principal Site Reliability Engineer

    Palo Alto Networks (Santa Clara, CA)
    …a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer , you will be part of a team supporting the services running ... This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability . Our stack includes Kubernetes, Docker, GCP, AWS, Ansible,… more
    Palo Alto Networks (04/18/25)
    - Save Job - Related Jobs - Block Source
  • Sr Site Reliability Engineer (App…

    Palo Alto Networks (Santa Clara, CA)
    …large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of a team supporting the ... This includes automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab… more
    Palo Alto Networks (04/17/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Staff Software Engineer

    LinkedIn (Mountain View, CA)
    …Pub/Sub systems , Kubernetes, and platforms. Suggested Skills: -Distributed Systems -Technical Leadership -Infrastructure Reliability - Systems ... passion for distributed technologies and algorithms, API design and systems design, and your passion for writing code that...impact within our company. As a Sr. Staff Software Engineer , you will be a key technical leader and… more
    LinkedIn (04/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Site…

    Google (Sunnyvale, CA)
    …Preferred qualifications: + Master's degree in Computer Science or Engineering. Site Reliability Engineering (SRE) combines software and systems engineering to ... that Google Cloud's services-both our internally critical and our externally-visible systems -have reliability , uptime appropriate to customer's needs and a… more
    Google (05/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Machine Learning Engineer

    ServiceNow, Inc. (Santa Clara, CA)
    …that unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
    ServiceNow, Inc. (05/13/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Site…

    Google (Sunnyvale, CA)
    systems . + Excellent problem-solving skills for monitoring and troubleshooting serving systems . Site Reliability Engineering (SRE) combines software and ... that Google Cloud's services-both our internally critical and our externally-visible systems -have reliability , uptime appropriate to customer's needs and a… more
    Google (05/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …is looking to hire a deeply technical, creative, and experienced Principal Site Reliability Engineer (SRE) with expertise in Content Delivery Networks (CDN). ... + Outstanding communication skills, problem-solving, negotiation, and interpersonal skills. + Expert- level knowledge of managing and debugging Unix/Linux systems more
    NVIDIA (04/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... - AWS, GCP, and On-prem. + Ensure the highest level of uptime and Quality of Service (QoS) for...as Code (IaC). + Deep understanding of Linux operating systems and TCP/IP fundamentals. + Expertise with at least… more
    NVIDIA (04/02/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager II, Site…

    Google (Sunnyvale, CA)
    …Computer Science or Engineering. + 1 year of people management experience. Site Reliability Engineering (SRE) combines software and systems engineering to build ... SRE ensures that Google's services-both our internally critical and our externally-visible systems -have reliability , uptime appropriate to users' needs and a… more
    Google (04/15/25)
    - Save Job - Related Jobs - Block Source
  • PhD Software Engineer , PhD, Early Career,…

    Google (Sunnyvale, CA)
    …multi-threading, or synchronization. Preferred qualifications: + Experience with performance, reliability , systems data analysis, visualization tools, or ... systems and products. As a Google PhD Software Engineer , you will work on a specific project critical...services around the world. We prioritize security, efficiency, and reliability across everything we do - from developing our… more
    Google (04/09/25)
    - Save Job - Related Jobs - Block Source
  • Hardware Systems Engineer

    Meta (Sunnyvale, CA)
    **Summary:** We are seeking a highly skilled Hardware Systems Engineer to join our team at Meta Reality Labs. As a key member of the hardware systems ... requirements and function correctly in the intended environment. As a Hardware Systems Engineer , you will build and maintain relationships with stakeholders… more
    Meta (05/14/25)
    - Save Job - Related Jobs - Block Source
  • Lead Systems Hardware Engineer

    Amazon (Sunnyvale, CA)
    …solutions for integration into Amazon's products, services, and operations. The Lead Systems HW Engineer is responsible for system impact assessment and ... Description We are seeking a highly skilled and innovative Lead Hardware Engineer to join our Amazon Device Climate Tech Accelerator program. This role will be… more
    Amazon (05/10/25)
    - Save Job - Related Jobs - Block Source