- MongoDB (New York, NY)
- …VictoriaMetrics, Splunk, QuickWit, Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also ... to build next-generation, AI-powered applications. **Team and Role Overview** The SRE Observability team is part of the larger Platform Engineering organization, and… more
- Regions Bank (Hoover, AL)
- …logging into the careers section of the system. **Job Description:** At Regions, the Site Reliability Engineer is responsible for ensuring the dependability ... Solid familiarity with Splunk, Elastic, OpenSearch, Prometheus, Grafana + Implementing Site Reliability Engineering (SRE) principles SLO/SLI + Experience… more
- Cisco (VA)
- …fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, ... applications with low operational burden by handling and improving the reliability and resiliency of SRE-managed services and infrastructure. You thrive on… more
- Cisco (CA)
- …a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers ... Splunk Cloud. **Meet the Products and Technology Team** Want to build security and observability products people love AND work with people as smart (and humble) as… more
- Palo Alto Networks (Santa Clara, CA)
- …engineer with a passion for technology and a strong motivation for high reliability at the service level. + Observability Tools: High proficiency with Thanos, ... including the design, implementation, and continuous enhancement of our comprehensive observability systems. To meet the opportunities that such a role provides,… more
- General Motors (Roswell, GA)
- …respective innovation centers three times per week._ **_The Role:_** The Software Engineering Site Reliability Engineer (SRE) is responsible for ensuring the ... health. + Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and … more
- General Motors (Mountain View, CA)
- …health. + Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and ... future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements...to analyze and provide inputs in architecture, infrastructure resources, observability to achieve reliability and scalability goals.… more
- UKG (Ultimate Kronos Group) (Weston, FL)
- …united by purpose, inspired by you. About the Team: We are seeking aPrincipal Observability and Reliability Tooling Engineer to lead cost-effective ... In this role, you will play a crucial part in enhancing our observability framework, ensuring robust monitoring and alerting practices that leverage the pillars of… more
- CVS Health (Hartford, CT)
- …skilled Staff Observability Operations Engineers with a strong background in Site Reliability Engineering (SRE), modern observability practices, and the ... DevOps Institute Observability Foundation + DevOps Institute Site Reliability Engineering Foundation or Practitioner +...Health job opportunities Join CVS Health as a Staff Observability Operations Engineer and contribute to our… more
- UCLA Health (Los Angeles, CA)
- …ensuring their reliability , performance, and security. As an Analytics Observability Engineer , you will design, implement, and maintain observability ... Description UCLA Health IT is seeking an exceptional Analytics Observability Engineer to join the Solutions Architecture and Engineering (SAE) team. The SAE… more
- Eliassen Group (Greenwood Village, CO)
- …be a hybrid on- site /remote schedule in Englewood, CO. The Observability Engineer will contribute significantly to planning, implementing, and maintaining ... ** Observability Engineer ** **Greenwood Village, CO** **Type:**... dashboards in Splunk, Datadog, and Grafana. + Builds observability artifacts that monitor systems performance, reliability ,… more
- JPMorgan Chase (Plano, TX)
- …Angular. + Strong sense of accountability and commitment to problem solving. + Site Reliability Engineering (SRE) Principles. + Demonstrated ability to plan and ... the limits of what's possible. As a Lead Software Engineer at JPMorgan Chase within the Consumer & Community...levels. + Collaborates with others to create and implement observability and reliability designs for complex systems… more
- Amazon (Seattle, WA)
- …an at-scale storage platform, including designing and building automation, tooling, observability and data analytics solutions. You will collaborate with internal ... class storage solutions. We are looking for an experienced Software Development Engineer to build and operate large-scale distributed software systems for monitoring… more
- Amazon (Seattle, WA)
- …language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a ... and operating distributed systems - * 3+ years of experience with telemetry, observability and monitoring of cloud services Amazon is an equal opportunity employer… more
- Bank of America (Charlotte, NC)
- Site Reliability Engineer Lead Charlotte, North Carolina **To proceed with your application, you must be at least 18 years of age.** Acknowledge Refer a ... and technology teams to implement measures prescribed by the Site Reliability Engineer teams it...production systems. This role is critical to ensuring the reliability , observability , and performance of our enterprise-scale… more
- Microsoft Corporation (Atlanta, GA)
- …services and working with some of Microsoft's most critical customers? We're looking for a ** Site Reliability Engineer II** with the right mix of software ... and accountability for application architecture, system design, and end-to-end implementation. As a Site Reliability Engineer , you will identify and deliver… more
- Bright Horizons (Newton, MA)
- The Principal Site Reliability Engineer (Principal SRE) plays a pivotal role in ensuring the seamless and reliable operation of an organization's digital ... preventing and mitigating potential issues. This role will report to the Director of Site Reliability Engineering, and will help foster a culture of innovation,… more
- Huntington National Bank (Columbus, OH)
- Description Summary: As a Site Reliability Engineer (SRE) Level II, you will play a key role in maintaining the availability, scalability, and performance of ... related field, or equivalent work experience. + 3 years of experience in site reliability engineering, DevOps, systems administration, or related roles. + Proven… more
- General Motors (Des Moines, IA)
- …for automakers and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General motors, you'll join a dedicated ... : Develop tools and software to automate operational processes, improve system reliability , and reduce manual intervention. + ** Observability and Monitoring** :… more
- UKG (Ultimate Kronos Group) (Lowell, MA)
- …whatever gives you purpose. We're united by purpose, inspired by you. About the Team: Site Reliability Engineers at UKG are critical team members that have a ... planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation. Site Reliability Engineers must be passionate about learning and… more