- Fiserv (Sunnyvale, CA)
- …to make an impact on a global scale, come make a difference at Fiserv. **Job Title** Site Reliability Engineer (SRE) **What does a successful Site ... Reliability Engineer do at Fiserv?** A successful Site Reliability Engineer at Fiserv leverages software engineering and operations discipline to… more
- JPMorgan Chase (Palo Alto, CA)
- …You've discovered the perfect environment to have a major impact. As a **Principal Site Reliability Engineer ** at JPMorgan Chase within the **Enterprise ... capabilities, and skills** + Formal training or certification on site reliability engineering concepts and 10+ years applied experience. + Ability… more
- Cornerstone onDemand (Dublin, CA)
- We are seeking a highly skilled Site Reliability Engineer with 3 years of experience to join our dynamic team. The ideal candidate will have a strong ... with a focus on designing, implementing, and managing cloud-based solutions. As a Site Reliability Engineer , you will play a key role in ensuring the… more
- NVIDIA (Santa Clara, CA)
- …culture? If so, we have a great opportunity for you! NVIDIA is seeking a Senior Site Reliability Engineer (SRE) for the Data Science & ML Platform(s) team. ... now! What you'll be doing: + Develop software solutions to ensure reliability and operability of large-scale systems supporting machine-critical use cases. + Gain… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big ... comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define and document standard methodologies… more
- Palo Alto Networks (Santa Clara, CA)
- …runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer , you will be part of a team supporting the ... cluster with autoscaling enabled + Experience in Production Engineering, DevOps, or Site Reliability + Expertise in the public cloud (GCP or AWS), especially in… more
- NVIDIA (Santa Clara, CA)
- …impact on the world. NVIDIA is looking to hire a deeply technical and creative Site Reliability Engineer to build, support and maintain the next generation ... challenges, automate processes, and iterate for efficiency + Tackle systemic reliability issues with multi-functional teams. + Monitor, optimize, and manage system… more
- ServiceNow, Inc. (Santa Clara, CA)
- …experiences in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the design, development ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
- MongoDB (San Francisco, CA)
- …office, we provide hybrid work accommodation. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) Lead with a strong networking ... background to join the Fabric team. This role is pivotal in building and maintaining the robust infrastructure necessary for secure and efficient communication between our services. As the lead SRE on the Fabric team, you will leverage your expertise in… more
- Palo Alto Networks (Santa Clara, CA)
- …and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + DevOps/SRE Expertise - 4+ years of experience ... influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +...as a DevOps/SRE engineer with a passion for technology and a strong… more
- Palo Alto Networks (Santa Clara, CA)
- …configuration management with a framework such as Terraform, Helm + Experience in Site Reliability Engineering, Production Engineering, or DevOps + Passion for ... large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of a team supporting the… more
- NVIDIA (Santa Clara, CA)
- …experience. + Minimum of 8 years of industry experience in network site reliability engineering, network automation, network operations, or related areas. ... for our network infrastructure. We are looking for an engineer who is passionate about the network and making...of the network infrastructure, ensuring its high availability and reliability . + Partnering with architecture and deployment teams to… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Celonis (Redwood City, CA)
- …and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , ... for that, we need you to join us. **The Team** As a member of our Reliability Engineering team, you will play a critical role in ensuring the health, performance,… more
- Palo Alto Networks (Santa Clara, CA)
- …team to influence the operability of the product and ensure the reliability and availability of our services **Your Experience** + DevOps/SRE Expertise: 5+ ... years of experience as a DevOps/SRE engineer with a passion for technology and a strong...passion for technology and a strong motivation for high reliability at the service level + Observability Tools: High… more
- Rubrik (Palo Alto, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- Palo Alto Networks (Santa Clara, CA)
- …automation, architecture, performance, observability, troubleshooting, security, and reliability . Our Infrastructure Platform stack includes Terraform, Kubernetes, ... GitLab CI/CD, GitOps, Prometheus, Grafana, Loki, Docker, GCP, Backstage, MySQL, PagerDuty, FireHydrant, Python, Bash, Java, NodeJS and Go. **Your Impact** + Design, build, and operate reliable, secure Cloud infrastructure across multi-cloud environments +… more
- MongoDB (San Francisco, CA)
- …to build next-generation, AI-powered applications. We are looking for an experienced Staff Engineer for our SRE, InfraSec team, to guide the security of our ... cloud-based infrastructure. As a Staff SRE, you will be very hands-on technically while also mentoring a small team of SREs. The InfraSec team collaborates closely with other engineering teams to ensure that our infrastructure adheres to the highest security… more
- MongoDB (San Francisco, CA)
- …and work location. Salary is one part of MongoDB's total compensation and benefits package. Other benefits for eligible employees may include: equity, participation ... in the employee stock purchase program, flexible paid time off, 20 weeks fully-paid gender-neutral parental leave, fertility and adoption assistance, 401(k) plan, mental health counseling, access to transgender-inclusive health insurance coverage, and health… more