- Zscaler (San Jose, CA)
- …+ 8+ years of experience working in infrastructure operations, DevOps, or site reliability roles + Demonstrated expertise in system observability, including ... speed and agility with a cloud-first strategy. We are seeking an experienced Senior Staff Infrastructure Operations Engineer to join our team. This critical… more
- MongoDB (San Francisco, CA)
- …or remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to ... join the Fabric team. This role is pivotal in building and maintaining the robust infrastructure necessary for secure and efficient communication between our services. As an SRE on the Fabric team, you will leverage your expertise in networking, distributed… more
- MongoDB (San Francisco, CA)
- We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, ... with a strong focus on security work, with ideally 2+ years in a senior or staff engineering role Security Mindset: + A comprehensive understanding of all facets… more
- NVIDIA (Santa Clara, CA)
- …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play ... a crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build… more
- NVIDIA (Santa Clara, CA)
- …secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our enterprise security ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
- NVIDIA (Santa Clara, CA)
- …We take great pride in providing excellent, comprehensive support to our customers! Sr Site Reliability Engineer in this role will significantly impact and ... in Computer Science or related field. + 8+ years of experience in site reliability engineering and/or software development roles. + Fluency in Python + In-depth… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Abbott (Pleasanton, CA)
- …working mothers, female executives, and scientists. **The Opportunity** We're looking for a strong ** Senior Site Reliability Engineer (SRE)** who's ready ... and compliant with healthcare regulations-this is the role for you. As a Senior SRE, you'll work closely with engineering, QA, cybersecurity, and regulatory teams to… more
- Google (Sunnyvale, CA)
- Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving ... meet some of our SREs. + Read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a software engineer … more
- Google (Sunnyvale, CA)
- Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a Software Engineer… more
- Google (Sunnyvale, CA)
- Senior Systems Engineer , Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to… more
- Google (Sunnyvale, CA)
- Senior Customer Reliability Engineer , Reliability Incident Management _corporate_fare_ Google _place_ New York, NY, USA; Austin, TX, USA; +2 more; +1 ... years of experience in a technical role such as Site Reliability Engineering, Technical Solutions Engineering, or...to connect with customers, employees and partners. As a Senior Customer Reliability Engineer , you… more
- NVIDIA (Santa Clara, CA)
- …Support), you will partner with other key members of our organization including Site Reliability Engineering, Security Operations Center, DevOps teams, and other ... engineers to design, develop and implement a global, dynamic, innovative Service Reliability Operations Center, to provide extraordinary levels of support for our… more
- Walmart (Sunnyvale, CA)
- …infrastructure management, or related area., SRE certification (for example, IBM Cloud Site Reliability Engineer )., We value candidates with a ... and household necessities **Qualifications** * 16+ years of experience in Site Reliability Engineering, Production Engineering, and Infrastructure Reliability… more
- General Motors (Sunnyvale, CA)
- …Go, or Groovy + On-call and fire-fighting experience + Experience with modern site reliability practice including but not limited to post mortem, SLO/SLI, ... Tracing, Synthetic monitoring, etc. **What Will Give You A Competitive Edge (Preferred Qualifications)** + You have experience managing Azure cloud platform and are the domain expert + You are well known for being customer focused. + You had demonstrated low… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... data for reporting, alerting, monitoring. + Collaborate with NVIDIA leadership, senior engineers, program managers, and product managers to develop compelling IT… more
- Insight Global (Santa Clara, CA)
- …fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior SRE Engineer . The position will be part of a fast-paced crew ... that develops and maintains sophisticated internal cloud provisioning products. The team works with various other business units such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their… more
- Genentech (South San Francisco, CA)
- …**Technical Leadership & Industry\Network Engagement:** + The Senior Principal Mechanical Engineer will be the SSF site expert on industry HVAC/Plumbing, BAS ... approaches which improve building systems performance, cost effectiveness, and reliability . + This role will lead the development of...meet expectations. + The Senior Principal Mechanical Engineer will also be the SSF site … more
- Amazon (Cupertino, CA)
- …designs cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that… more
- LinkedIn (Mountain View, CA)
- …equivalent role at a high-growth or web-scale technology company Suggested Skills + Site Reliability Engineering (SRE) + Leadership + Large scale infrastructure ... in Sunnyvale, CA or San Francisco, CA. **Responsibilities** + Serve as a senior technical leader driving the long-term reliability and observability strategy… more