• Senior Customer Reliability

    Google (Sunnyvale, CA)
    Senior Customer Reliability Engineer , Reliability Incident Management _corporate_fare_ Google _place_ New York, NY, USA; Austin, TX, USA; +2 more; +1 ... use technology to connect with customers, employees and partners. As a Senior Customer Reliability Engineer , you will be a pivotal individual contributor on… more
    Google (10/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
    NVIDIA (09/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Site Reliability Engineer to work in IPP (Infrastructure, Planning and Process). IPP is a global organization within NVIDIA. ... This group works with various other groups within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure needs. These cloud services provide almost half a… more
    NVIDIA (09/11/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a ... crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build automation… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    Tarana Wireless (Milpitas, CA)
    …speeds worldwide, bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer , you will help us manage software ... that runs on the cloud and remotely manages millions of radio devices. You will work on a team and be a main point of contact during off shore hours and responsible for all aspects of cloud operations, such as: + Infrastructure as Code + Manage environments in… more
    Tarana Wireless (08/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …demands robust, automated, and secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
    NVIDIA (09/30/25)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Engineer , Site…

    Google (Sunnyvale, CA)
    Senior Systems Engineer , Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to build and run… more
    Google (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Reliability Engineer

    Microsoft Corporation (Sunnyvale, CA)
    …system health and operational quality at scale. We are seeking a **Site Reliability Engineer ** within the Firmware Deployment team, you will be instrumental ... Your efforts in deploying and managing firmware updates will ensure the reliability and efficiency of Azure's hardware infrastructure. By focusing on stability and… more
    Microsoft Corporation (10/19/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Quality & Reliability Engineer

    Amazon (Cupertino, CA)
    …designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna...future technologies. * Drive manufacturing process improvements to address reliability issues and concerns. * Qualify manufacturing lines and… more
    Amazon (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    Google (Sunnyvale, CA)
    Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** Experience ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/site- reliability -engineering-profile-google/) about why a Software Engineer chose… more
    Google (09/27/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    …for a passionate member to join our DGX Cloud Engineering Team as a Sr. Site Reliability Engineer . In this role, you will play a significant part in helping to ... and maintaining a high standard of perfection in service operability and reliability . + Design, build, and implement scalable cloud-based systems for PaaS/IaaS. +… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability

    NVIDIA (Santa Clara, CA)
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
    NVIDIA (10/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Reliability Engineer

    Abbott (Pleasanton, CA)
    …mothers, female executives, and scientists. **The Opportunity** We're looking for a strong ** Senior Site Reliability Engineer (SRE)** who's ready to roll ... and compliant with healthcare regulations-this is the role for you. As a Senior SRE, you'll work closely with engineering, QA, cybersecurity, and regulatory teams to… more
    Abbott (09/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior Reliability Methodology…

    NVIDIA (Santa Clara, CA)
    …that are groundbreaking in AI and computing. What you'll be doing: As a Reliability Methodology Engineer at NVIDIA, you will be responsible for ensuring our ... design, product, and test engineering teams to apply DFT methodologies to improve reliability screening specific to HTOL (Component level Hight Temp Op. Life Test).… more
    NVIDIA (07/31/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Software Engineer

    LinkedIn (Mountain View, CA)
    …community while making a real impact within our company. As a Sr. Staff Software Engineer , you will be a key technical leader and role model within the organization. ... Suggested Skills: + Distributed Systems + Technical Leadership + Infrastructure Reliability + Systems Infrastructure + Java/Golang/Rust/Python You will Benefit from… more
    LinkedIn (09/24/25)
    - Save Job - Related Jobs - Block Source
  • Senior System ASIC Engineer - Speed…

    NVIDIA (Santa Clara, CA)
    …and board designers, software/firmware engineers, HW/SW applications engineering, process/ reliability specialists, DFx engineers, ATE engineers, product managers, ... our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!… more
    NVIDIA (08/09/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …in Sunnyvale, CA or San Francisco, CA. **Responsibilities** + Serve as a senior technical leader driving the long-term reliability and observability strategy ... enable the right business decisions around improving quality and reliability of our services and products + Act as...availability and performance + Previous experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale… more
    LinkedIn (09/24/25)
    - Save Job - Related Jobs - Block Source
  • Principal Staff Site Reliability

    NVIDIA (Santa Clara, CA)
    …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... data for reporting, alerting, monitoring. + Collaborate with NVIDIA leadership, senior engineers, program managers, and product managers to develop compelling IT… more
    NVIDIA (08/21/25)
    - Save Job - Related Jobs - Block Source
  • Senior Manager, Network Site…

    NVIDIA (Santa Clara, CA)
    GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and operations. We are looking for a leader ... be doing: + Cultivate a top-performing team of Network Site Reliability Engineers through encouraging a culture of collaboration, accountability, and technical… more
    NVIDIA (08/08/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer

    Insight Global (Santa Clara, CA)
    …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... that develops and maintains sophisticated internal cloud provisioning products. The team works with various other business units such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their… more
    Insight Global (09/09/25)
    - Save Job - Related Jobs - Block Source