- Global Payments, Inc. (Atlanta, GA)
- …role oversees a dedicated team of SREs and Support Engineers responsible for monitoring , incident response, and the stability of AI services in production. You will ... of initiatives with evolving business strategies, at a global level. RESPONSIBILITIES Lead the SRE and Support function for AI production systems, including LLM… more
- PCR Staffing (Concord, NC)
- …of Palo Alto and/or FortiGate firewall management and troubleshooting . Expertise in monitoring and observability tools , including Grafana and Datadog . ... a Site Reliability Engineer (SRE) with deep expertise in AWS networking , infrastructure automation , and production system...automation , and production system reliability and ability to lead when needed . This role is NOT an… more
- Eli Lilly and Company (Indianapolis, IN)
- …What You Should Bring: Extensive observability background, OpenTelemetry, AWS knowledge, Kubernetes, DevOps practices, monitoring tools (Splunk, AppDynamics, ... Software Product Engineering team is actively looking for a Lead Site Reliability Engineer (SRE). Are you ready to...Are you passionate about technology with extensive experience in observability , AWS , Kubernetes, and DevOps practices? If… more
- UKG, Inc. (Fort Lauderdale, FL)
- …leveraging observability tools such as Datadog and Grafana for production monitoring . *Experience with modern cloud technology (GCP, AWS , Azure, Kubernetes, ... vision reflect our principles and standards. *Ensure effective adoption of observability tools for proactive alerting of production performance issues, adopt… more
- Wayfair (Boston, MA)
- …a direct impact on Wayfair's operational efficiency and brand loyalty. What You'll Do: Lead and manage a team of 6 engineers focused on improving agent experience ... support. Drive engineering execution including architectural design, development, testing, monitoring , and deployment, with a strong emphasis on reliability and… more
- Xperi (San Jose, CA)
- …technologies for data search, analysis, and visualization. Skills in software monitoring and observability . Practical knowledge of the Software Development ... and object-oriented programming concepts. Experience with cloud platforms such as AWS , Azure, or Google Cloud is a plus. Microservices development experience… more
- Alchemy (New York, NY)
- …and cloud-native tools. Build and maintain automation pipelines for deployments, monitoring , and incident response. Lead incident management, conducting root ... cause analyses and driving continuous reliability enhancements. Develop observability frameworks using Prometheus, Grafana, ELK, and tracing tools like Jaeger or… more
- Palo Alto Networks (Santa Clara, CA)
- …on this infrastructure. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability. Our Infrastructure Platform ... and automation frameworks Automate robust deployment of robust services Orchestrate end-to-end monitoring and alerting Participate with SRE and Dev teams in the… more
- Fannie Mae (Washington, DC)
- …as well as coach and mentor team members. *THE IMPACT YOU WILL MAKE* The * Lead AWS Monitoring & Observability Engineer - AWS & APM Tools* role will ... on our team, you will act as the team lead in designing and developing advanced solutions for information...of hands-on experience managing the Monitoring and Observability platform using Splunk/ Dynatrace/ Open Telemetry/ AWS… more
- Fannie Mae (Reston, VA)
- …errors all while operating under limited supervision. *THE IMPACT YOU WILL MAKE* The *Senior Monitoring & Observability Engineer - AWS & APM Tools* role will ... * 1+ years of hands-on experience managing/ using the Monitoring and Observability platform using Splunk/ Dynatrace/ Open Telemetry/ AWS Cloudwatch in a… more
- Truist (Dallas, TX)
- …States of America) **Please review the following job description:** The Head of Observability and Monitoring will lead the strategy, architecture, and ... implementation of observability , monitoring , and telemetry capabilities within a regulated banking environment....metrics, and distributed tracing across the Bank's technology stack. Lead the design and deployment of monitoring … more
- HUB International (Chicago, IL)
- …seeking an experienced and strategic Manager, Observability & Automation to lead the evolution of our monitoring , alerting, and automation capabilities ... scale our rapidly expanding business. **Core Responsibilities for this Role:** + ** Observability Leadership** + Lead the deployment, management, and maintenance… more
- CVS Health (Hartford, CT)
- …technologies within the existing environment. Work with partners to migrate legacy monitoring to modern solutions. Work with the observability engineering team ... existing or developing new solutions. **Platform Management:** Manage and administer observability and event management platforms. Lead system upgrades,… more
- Marriott (Bethesda, MD)
- … lead the global strategy and execution of **Enterprise Observability and Technology Resiliency & Recoverability** across Marriott's global technology landscape. ... visibility, prevent outages, and ensure recoverability * Architect and operationalize observability solutions across AWS , Azure, Alibaba, and hybrid cloud… more
- Prime Therapeutics (Washington, DC)
- … observability and monitoring . We are looking for a senior Dynatrace Observability Monitoring Engineer (Sr. IT Systems Engineer) to build and mature ... Configuring Alerts and High-end business dashboards. + Setting up Observability monitoring Using OTel to capture all...SLI's) + Adding Extensions and performing Integration to all AWS , GCE and Azure services. + Work in Rotational… more
- CVS Health (Providence, RI)
- …to join our technology team and work on a tight-knit team leading enterprise observability governance efforts. Reporting to the Lead Director of Observability ... with business goals and deliver value to stakeholders. + **Program Management:** Lead and manage observability programs, including project planning, execution,… more
- OneMain Financial (Baltimore, MD)
- We are seeking an experienced and dynamic Vice President - Director to lead observability across the Technology organization. The ideal candidate will have a ... troubleshoot, and optimize their systems. This Senior leader will lead the strategy and execution of a technical roadmap...and vision. Develop and articulate the product vision for observability and monitoring products. + Define and… more
- General Motors (Roswell, GA)
- …and performance of software systems. Their job profile includes: + **System Monitoring and Troubleshooting:** Monitoring the performance and availability of ... infrastructure to streamline software deployment, configuration management, and system monitoring . + **Performance Optimization:** Analyzing system performance, identifying bottlenecks,… more
- Amazon (Seattle, WA)
- …- come help us make history! What you will do: You will build and lead a team building large-scale monitoring and real-time telemetry systems for operating an ... Description Amazon Web Services ( AWS ) Hardware Engineering team creates compute and storage...systems - * 3+ years of experience with telemetry, observability and monitoring of cloud services Amazon… more
- General Motors (Roswell, GA)
- …you will develop and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an innovation first team, and we ... You'll Do** + Implement scalable, reliable, secure SRE and Observability platform to monitor health of our production system...5+ years of hands-on SRE experience (software development, systems monitoring ) with at least one of the public cloud… more