- MongoDB (San Francisco, CA)
- …MongoDB to build next-generation, AI-powered applications. We are looking for an experienced Lead for our SRE, InfraSec team, to guide the security of our ... cloud-based infrastructure. As a Lead SRE, you will be very hands-on technically while...of SREs. The InfraSec team collaborates closely with other engineering teams to ensure that our infrastructure adheres to… more
- NVIDIA (Santa Clara, CA)
- …a related technical field, or equivalent experience. + 10+ overall years of experience in Site Reliability Engineering , DevOps, or a similar role, with at ... doing: + Recruit, develop, and inspire a team of Site Reliability Engineers, fostering a strong culture...ELK Stack, Splunk, Jaeger, etc. + Demonstrated ability to lead and mentor engineering teams, fostering a… more
- General Motors (Mountain View, CA)
- …of the systems they are working on. We believe in setting a high bar for engineering managers who can lead by example in both technical expertise and people ... OR IN THE FUTURE. **The Role** As an SRE Engineering Manager, you will be expected to not only...tools and software to automate operational processes, improve system reliability , and reduce manual intervention. + Lead ,… more
- Google (Sunnyvale, CA)
- …Computer Science or Engineering . + 1 year of people management experience. Site Reliability Engineering (SRE) combines software and systems ... grow. To learn more: check out our books on Site Reliability Engineering (https://landing.google.com/sre/book.html) or...a Software Engineer chose to join SRE. As an Engineering Manager, you'll lead a team and… more
- Google (San Francisco, CA)
- …2 years of experience designing, analyzing, and troubleshooting large-scale distributed systems. Site Reliability Engineering (SRE) combines software and ... grow. To learn more: check out our books on Site Reliability Engineering (https://landing.google.com/sre/book.html) or...or service operations and quality. + Participate in, or lead design reviews with peers and stakeholders to decide… more
- Google (Sunnyvale, CA)
- …systems. + Excellent problem-solving skills for monitoring and troubleshooting serving systems. Site Reliability Engineering (SRE) combines software and ... Google Cloud's services-both our internally critical and our externally-visible systems-have reliability , uptime appropriate to customer's needs and a fast rate of… more
- Google (Sunnyvale, CA)
- …and technologies. + Experience in building large-scale operations capabilities in Site Reliability Engineering . Google Cloud's software engineers ... scaling from small to large deployments. As a Technical Lead , you will define the operations engineering ...Technical Lead , you will define the operations engineering strategy for these solutions, working with engineering… more
- MongoDB (San Francisco, CA)
- …office, we provide hybrid work accommodation. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) Lead with a strong networking ... Toyota, trust MongoDB to build next-generation, AI-powered applications. **The Team** Platform Engineering is the department within SRE that is responsible for a… more
- SanDisk (Milpitas, CA)
- …keep our world moving forward. **Job Description** We are seeking a Principal Engineer, Reliability Engineering to join our team in Milpitas, United States. In ... product quality and performance. ESSENTIAL DUTIES AND RESPONSIBILITIES: + Lead and mentor a team of reliability ...Stay current with industry standards and best practices in reliability engineering , and implement them within the… more
- Amazon (Cupertino, CA)
- …develop into a better-rounded professional. Basic Qualifications - Bachelor's degree in Reliability Engineering , Physics, Material Science or related field, or ... solid understanding of computer systems to influence design for reliability . - Lead identifying and validating product/component...equivalent experience - 5+ years of Reliability Engineering work experience with server platforms… more
- Lockheed Martin (Sunnyvale, CA)
- …the security and integrity of the classified system **Basic Qualifications:** * Experience in site reliability engineering , DevOps, or a related field, with ... and handled in accordance with classified system requirements * Lead and participate in incident response and post\-incident reviews...DoD environment \(RMF, STIG, or NISPOM\) * Certification in site reliability engineering , DevOps, or… more
- General Motors (Mountain View, CA)
- …Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability ... and provide inputs in architecture, infrastructure resources, observability to achieve reliability and scalability goals. + Collaborate with engineering teams… more
- Palo Alto Networks (Santa Clara, CA)
- …applications in the Kubenetes cluster with autoscaling enabled + Experience in Production Engineering , DevOps, or Site Reliability + Expertise in the ... is one of the largest GCP customers. As a Site Reliability Engineer, you will be part...SRE and Dev teams in the on-call rotation + Lead root cause analysis of critical business and production… more
- Cornerstone onDemand (Dublin, CA)
- We are seeking a highly skilled Site Reliability Engineer with 3 years of experience to join our dynamic team. The ideal candidate will have a strong background ... on designing, implementing, and managing cloud-based solutions. As a Site Reliability Engineer, you will play a...our cloud infrastructure. **In this role you will:** + Lead the day-to-day technical operations, providing the highest levels… more
- Cornerstone onDemand (Dublin, CA)
- We are seeking a highly skilled ** Site Reliability Engineer** with 3 years of experience to join our dynamic team. The ideal candidate will have a strong ... on designing, implementing, and managing cloud-based solutions. As a Site Reliability Engineer, you will play a...our cloud infrastructure. **In this role you will:** + Lead the day-to-day technical operations, providing the highest levels… more
- Palo Alto Networks (Santa Clara, CA)
- …in configuration management with a framework such as Terraform, Helm + Experience in Site Reliability Engineering , Production Engineering , or DevOps + ... This includes automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab… more
- NVIDIA (Santa Clara, CA)
- …equivalent experience. + Minimum of 8 years of industry experience in network site reliability engineering , network automation, network operations, or ... team is looking to add a seasoned Technical SRE lead to help actualize the SRE vision for our...of the network infrastructure, ensuring its high availability and reliability . + Partnering with architecture and deployment teams to… more
- LiveRamp (San Francisco, CA)
- …setting up production and internal environments** + **Provide 24/7 first line of Engineering support (via follow the sun teams in all regions) for any issues ... operations support.** + **Drive effective resolutions of core product issues with Engineering teams** + **Setup and maintain Infrastructure & Product Reliability … more
- Celonis (Redwood City, CA)
- …resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system ... a highly technical, collaborative, and innovation-driven team that blends Site Reliability Engineering with modern... practices to build resilient and scalable systems. + Lead reliability efforts for a fleet of… more
- MongoDB (San Francisco, CA)
- …a small team of SREs. The InfraSec team collaborates closely with other engineering teams to ensure that our infrastructure adheres to the highest security ... actual implementation. **Responsibilities:** Cloud Security Design and Implementation: + Help lead the design and deployment of security solutions for cloud… more