• Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
    NVIDIA (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Site Reliability Engineer to work in IPP (Infrastructure, Planning and Process). IPP is a global organization within ... NVIDIA. This group works with various other groups within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure needs. These cloud services provide almost… more
    NVIDIA (09/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play ... a crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    Tarana Wireless (Milpitas, CA)
    …internet speeds worldwide, bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer , you will help us ... manage software that runs on the cloud and remotely manages millions of radio devices. You will work on a team and be a main point of contact during off shore hours and responsible for all aspects of cloud operations, such as: + Infrastructure as Code + Manage… more
    Tarana Wireless (08/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our enterprise security ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
    NVIDIA (09/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    Google (Sunnyvale, CA)
    Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Durham, NC, USA; Raleigh, NC, USA; +3 more; +2 more **Mid** ... meet some of our SREs. + Read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a software engineer more
    Google (10/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a Software Engineer more
    Google (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    Palo Alto Networks (Santa Clara, CA)
    …actionable insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: ... influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +...DevOps/SRE Expertise: 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong… more
    Palo Alto Networks (10/03/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (10/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    Rubrik (Palo Alto, CA)
    …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
    Rubrik (08/07/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Reliability Engineer

    Abbott (Pleasanton, CA)
    …working mothers, female executives, and scientists. **The Opportunity** We're looking for a strong ** Senior Site Reliability Engineer (SRE)** who's ready ... and compliant with healthcare regulations-this is the role for you. As a Senior SRE, you'll work closely with engineering, QA, cybersecurity, and regulatory teams to… more
    Abbott (09/20/25)
    - Save Job - Related Jobs - Block Source
  • JR- Senior Software Engineer

    General Motors (Sunnyvale, CA)
    …+ Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability ... future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements...and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an… more
    General Motors (09/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Network Site

    NVIDIA (Santa Clara, CA)
    GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a ... be doing: + Cultivate a top-performing team of Network Site Reliability Engineers through encouraging a culture...Artificial Intelligence, and Autonomous Vehicles. If you're a creative engineer who enjoys autonomy and shares our passion for… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... data for reporting, alerting, monitoring. + Collaborate with NVIDIA leadership, senior engineers, program managers, and product managers to develop compelling IT… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... that develops and maintains sophisticated internal cloud provisioning products. The team works with various other business units such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their… more
    Insight Global (09/09/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Quality & Reliability Engineer

    Amazon (Cupertino, CA)
    …designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that… more
    Amazon (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Manufacturing Engineer , Trainium…

    Amazon (Cupertino, CA)
    …between the system engineering team and the ODM and CM partners. As a Senior Manufacturing Engineer you will engage with an experienced cross-disciplinary staff ... and CMs. As part of the Manufacturing, Quality and Reliability Team in AWS Annapurna Labs focused on Machine...provider. We are seeking a talented and motivated Manufacturing Engineer with a proven track record of implementing best… more
    Amazon (08/29/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …equivalent role at a high-growth or web-scale technology company Suggested Skills + Site Reliability Engineering (SRE) + Leadership + Large scale infrastructure ... in Sunnyvale, CA or San Francisco, CA. **Responsibilities** + Serve as a senior technical leader driving the long-term reliability and observability strategy… more
    LinkedIn (09/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior Electric Utility Engineer

    Silicon Valley Power (Santa Clara, CA)
    ** Senior Electric Utility Engineer ** Print (https://www.governmentjobs.com/careers/cityofsantaclaraca/jobs/newprint/4679157)  ** Senior Electric Utility ... Services (AWS), and NVIDIA. **The Positions** SVP is seeking dynamic and innovative Senior Electric Utility Engineer candidates to fill three (3) vacancies in… more
    Silicon Valley Power (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior , Software Engineer

    Walmart (Sunnyvale, CA)
    …of orders daily through our high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with ... criteria (for example, probability of failure, frequency of failure) to measure site reliability . Monitors site reliability conditions and new … more
    Walmart (08/15/25)
    - Save Job - Related Jobs - Block Source