- Discover (Houston, TX)
- …emergency response and capacity planning of their services. As an Application Site Reliability Engineer ( SRE ) you will be part of team of people who are ... the way with a rewarding career. Site Reliability Engineering ( SRE ) is a set of principles and practices that...to enforce best practices. + Lead failure point discussions, chaos testing and family level capacity management. + Responsible… more
- Discover (Riverwoods, IL)
- …products. This is where you come in. We need a Principal Application Reliability Engineer who's seeking an opportunity to make a positive impact. You will partner ... the teams to help build reliability thinking. + Lead failure point discussions, chaos testing and family level capacity management. + Responsible for family level… more
- Discover (Riverwoods, IL)
- …achieve yours along the way with a rewarding career. As a Principal DevOps/Reliability Engineer , you will have an opportunity to make a positive impact across the ... observability gaps, leading problem management, and driving capacity planning. The Engineer uses a vast repertoire of experience delivering high impact engineering… more
- Discover (Houston, TX)
- …emergency response and capacity planning of their services. As an Application Site Reliability Engineer ( SRE ) you will be part of team of people who are ... the way with a rewarding career. Site Reliability Engineering ( SRE ) is a set of principles and practices that...Discover + Partner with Application Development teams to build resiliency into our critical websites and mobile application +… more
- Federal Reserve System (Boston, MA)
- …and should be discussed during the interview process. **Responsibilities** + As a Senior Engineer of the SRE / Production Operations team for FedNow, you will ... Resiliency , DR and BCP (including testing) + The SRE / Production Operations team is part of the...Fault Injection tooling(ie AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey) + Best practices in chaos … more
- Wells Fargo (Charlotte, NC)
- …) introduce continuous improvement, standardization/automation, capabilities to conduct destructive and resiliency testing + Automate key SRE metrics and IT ... **About the Role** We are looking for an SRE who enjoys and thrives on solving complex...to avoid a major incident. + Design self- healing resiliency patterns to improve the reliability of the environment… more
- Discover (Riverwoods, IL)
- …emergency response and capacity planning of their services. As an Application Site Reliability Engineer ( SRE ) you will be part of team of people who are ... achieve yours along the way with a rewarding career. Site reliability engineering ( SRE ) is a set of principles and practices that incorporates aspects of software… more
- JPMorgan Chase (Columbus, OH)
- …of public cloud platforms and technologies + Organize and run game days, resiliency tests and chaos engineering exercises + Utilize programming languages like ... As a Cloud Solutions Lead Software Engineer at JPMorgan Chase within the Infrastructure Production Management, you will play a crucial role in the Public Cloud… more
- Splunk (CO)
- …with low operational burden by managing and improving the reliability and resiliency of SRE -managed services and infrastructure. You thrive on automation, ... + HA, Business Continuity Planning, disaster recovery, backup/restore, RTO, RPO + Chaos engineering + Application uptime and performance + Capacity management &… more
- Wells Fargo (Charlotte, NC)
- …70 million global customers. Wells Fargo Bank NA seeks a **Lead Software Engineer ** in Charlotte, NC. **Job Role and Responsibility:** Apply technology background in ... software engineering, SRE , multi-Cloud platform management, DevOps, CI/CD, Observability, and Continuous Testing to deliver and introduce new technology capabilities… more