- Zscaler (San Jose, CA)
- …speed and agility with a cloud-first strategy. We're looking for an experienced Staff Site Reliability Engineer to join our Government Cloud team, reporting ... Looking for (Minimum Qualifications)** + 5+ years of experience as a Site Reliability Engineer with expertise in Operations and Engineering. + Experience… more
- Zscaler (San Jose, CA)
- …+ 8+ years of experience working in infrastructure operations, DevOps, or site reliability roles + Demonstrated expertise in system observability, including ... strategy. We are seeking an experienced Senior Staff Infrastructure Operations Engineer to join our team. This critical role involves designing, implementing,… more
- The Clorox Company (Pleasanton, CA)
- …Interested? Join us to #IgniteYourCareer! Your role at Clorox: Are you an engineer ready to join a high performing team leading manufacturing projects from ideation ... Champion the integration of innovative technologies and best practices to enhance plant reliability , efficiency, and competitiveness. + Travel 50 - 75% to plant and… more
- NVIDIA (Santa Clara, CA)
- …automated, and secure production environments. We are seeking a deeply skilled Staff Site Reliability Engineer (SRE) to advance our enterprise security ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
- JPMorgan Chase (Palo Alto, CA)
- …You've discovered the perfect environment to have a major impact. As a **Principal Site Reliability Engineer ** at JPMorgan Chase within the **Enterprise ... capabilities, and skills** + Formal training or certification on site reliability engineering concepts and 10+ years applied experience. + Ability… more
- General Motors (Mountain View, CA)
- …or Mountain View, CA_ _three times per week, at minimum_ The Software Engineering Site Reliability Engineer is responsible for ensuring the reliability ... the future. + Collaborating with software development teams to ensure that reliability and scalability considerations are incorporated into the software design and… more
- NVIDIA (Santa Clara, CA)
- …impact on the world. We are seeking a highly skilled and experienced Network Site Reliability Engineer (SRE) to join our Enterprise Network Operations ... practice in network operations or related fields concentrating on automation & site reliability engineering. Familiarity with both enterprise and the data… more
- NVIDIA (Santa Clara, CA)
- …We take great pride in providing excellent, comprehensive support to our customers! Sr Site Reliability Engineer in this role will significantly impact and ... in Computer Science or related field. + 8+ years of experience in site reliability engineering and/or software development roles. + Fluency in Python + In-depth… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior Site Reliability Engineer to work in IPP (Infrastructure, Planning and Process). IPP is a global organization within NVIDIA. ... This group works with various other groups within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure needs. These cloud services provide almost half a… more
- NVIDIA (Santa Clara, CA)
- …accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial ... role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build automation tools,… more
- Tarana Wireless (Milpitas, CA)
- …worldwide, bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer , you will help us manage software that ... runs on the cloud and remotely manages millions of radio devices. You will work on a team and be a main point of contact during off shore hours and responsible for all aspects of cloud operations, such as: + Infrastructure as Code + Manage environments in AWS… more
- MongoDB (San Francisco, CA)
- …or remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to ... join the Fabric team. This role is pivotal in building and maintaining the robust infrastructure necessary for secure and efficient communication between our services. As an SRE on the Fabric team, you will leverage your expertise in networking, distributed… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As a Site Reliability Operations Engineer within the Global Technology Platforms (GTP) Command and Control Center ... micro-services, tools, and processes that will ensure highest levels of availability and reliability of Walmart's technology stack. You're right for the job if you… more
- Google (Sunnyvale, CA)
- Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and decision ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a Software Engineer… more
- Google (Sunnyvale, CA)
- Senior Systems Engineer , Site Reliability Engineering, Google Cloud _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Mid** Experience ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Palo Alto Networks (Santa Clara, CA)
- …is the market leader in this space. We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services ... architecture to improve scalability in networking like BGP, OSPF, service reliability , capacity, and performance + Collaborate with development teams to ensure… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... architectures and identify opportunities for containerization to improve scalability, reliability , and efficiency. + Strong analytical skills with the ability… more
- Palo Alto Networks (Santa Clara, CA)
- …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... GitLab CI/CD, GitOps, Prometheus, Grafana, Loki, Docker, GCP, Backstage, MySQL, PagerDuty, FireHydrant, Python, Bash, Java, NodeJS and Go. **Your Impact** + **Design, build, and operate** reliable, secure Cloud infrastructure across multi-cloud environments… more