• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... techniques and Infrastructure as Code (IaC). + Deep understanding of Linux operating systems and TCP/IP fundamentals. + Expertise with at least one major cloud… more
    NVIDIA (04/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Reliability Engineer

    Celonis (Redwood City, CA)
    …engineering and Site Reliability Engineering (SRE) principles to drive system reliability , scalability, and operational excellence across the organization. ... Engineering with modern Software Engineering practices to build resilient and scalable systems . + Lead reliability efforts for a fleet of 80+ FedRAMP-compliant… more
    Celonis (04/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Reliability

    NVIDIA (Santa Clara, CA)
    …efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability Engineering ... Center Servers. What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack,… more
    NVIDIA (04/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of ... of deep learning workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus on… more
    NVIDIA (03/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big picture of how ... our systems relate to each other, we use a breadth...comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define… more
    NVIDIA (04/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Reliability

    ServiceNow, Inc. (Santa Clara, CA)
    …to improve the reliability and performance of the infrastructure through improved system design. + Join a culture of intolerance to manual activity, resulting in ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
    ServiceNow, Inc. (05/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    MongoDB (San Francisco, CA)
    …any infrastructure, and our newest offering, Atlas Data Lake. The Cloud Site Reliability Engineering Team designs and builds the global infrastructure on which we ... with it, and increasing our internal visibility into the health of the system . We are strong believers in infrastructure-as-code and self-healing systems . The… more
    MongoDB (03/18/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    General Motors (Mountain View, CA)
    …testing for current, new and major programs + Lead development of software system team design content and software anomaly corrections. + Performs complex design ... analysis + Specifies and balances system requirements + Provide, communicate, and support common best...DevOps software upgrades for both development and production enterprise systems . + Advanced knowledge of Policy Management, Azure Identity… more
    General Motors (04/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …is looking to hire a deeply technical, creative, and experienced Principal Site Reliability Engineer (SRE) with expertise in Content Delivery Networks (CDN). ... projects. + Design and implement scalable, reliable, and efficient distributed systems . + Manage CDN infrastructures and ensure robust security configurations. +… more
    NVIDIA (04/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - Site…

    Walmart (Sunnyvale, CA)
    …rotation to secure the system from issues. **What you'll do:** Site Reliability Engineers are hybrid systems and software engineers who are responsible and ... Reliability Engineers, we are a team of hybrid systems and software engineers who take ownership of ...Chef and Puppet + Build and drive the automation systems that maintain system health + Eliminate… more
    Walmart (04/19/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer - Site…

    General Motors (Mountain View, CA)
    …our customers, including fleet management, energy optimization, transportation logistics, safety systems , and more. To fulfill our mission, we are actively expanding ... future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements...and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an… more
    General Motors (04/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Machine Learning…

    ServiceNow, Inc. (Santa Clara, CA)
    …unlock new work experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
    ServiceNow, Inc. (05/09/25)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    Abbott (Pleasanton, CA)
    …for diversity, working mothers, female executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer ** As senior member of Site ... serve people in more than 160 countries. **Staff Site Reliability Engineer ** **Working at Abbott** At Abbott,...delivery, and implementation of highly complex and critical software systems . Expertise in the value and principles of SRE… more
    Abbott (02/16/25)
    - Save Job - Related Jobs - Block Source
  • Sr Site Reliability Engineer (App…

    Palo Alto Networks (Santa Clara, CA)
    …Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of ... of critical business and production issues **Your Experience** + 4+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering + 2+ years… more
    Palo Alto Networks (04/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Technical Program Manager, Quality…

    NVIDIA (Santa Clara, CA)
    As a Senior Technical Program Manager (Quality & Reliability ) - Silicon Solutions team, you will play a pivotal role in bridging the technical and management ... and diagnostic software + Able to understand the silicon manufacturing to system integration, test and reliability challenges and correlation of key… more
    NVIDIA (03/13/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... and Driverless Cars to cater to their infrastructure & systems needs. As an SRE, youll also be working...Science, Information Technology, or related field, or equivalent experience. - System admin and Windows admin experience in an on… more
    Insight Global (04/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Design Engineer

    SanDisk (Milpitas, CA)
    …RESPONSIBILITIES: Main responsibilities of the role focus on validation of memory system design on Sandisk's enterprise SSD products + In-depth understanding of NAND ... Design and development of test cases for new memory system firmware designs + Development and validation of data...of NAND management FW features + Perform end-of-life (EOL) reliability verification tests + Perform failure analysis on EOL… more
    SanDisk (04/05/25)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    MongoDB (San Francisco, CA)
    …to build next-generation, AI-powered applications. We are looking for an experienced Staff Engineer for our SRE, InfraSec team, to guide the security of our ... on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A...low-level fundamentals, and how they work together in complex systems Communication and Leadership Skills: + Strong ability to… more
    MongoDB (05/07/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Power Validation…

    NVIDIA (Santa Clara, CA)
    …places to work in the world. We are now looking for a Senior System Power Validation & Applications Engineer in the Datacenter System Engineering Team. ... solutions of density, performance, transient response, manageability, scalability, manufacturability, reliability , security, protection, and cost. You will gain a… more
    NVIDIA (03/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior System Firmware…

    NVIDIA (Santa Clara, CA)
    …scale. We are looking for a talented and experienced Datacenter CPU RAS ( Reliability , Availability, and Serviceability) firmware engineer . As the CPU RAS ... We are looking for a: Sr Software Engineer , RAS Firmware - Platform Software. NVIDIA's invention...the most thoughtful people in the world. NVIDIA DGX systems deliver the world's leading solutions for enterprise AI… more
    NVIDIA (05/07/25)
    - Save Job - Related Jobs - Block Source