• UKG, Inc. (Lowell, MA)
    …About the Team: We are seeking aPrincipal Observability and Reliability Tooling Engineer to lead cost-effective observability initiatives and drive the ... you will play a crucial part in enhancing our observability framework, ensuring robust monitoring and alerting...highly available platforms for metrics, logging, tracing, real user monitoring , and synthetic monitoring . Lead more
    Upward (07/04/25)
    - Save Job - Related Jobs - Block Source
  • Dallas County (Dallas, TX)
    …streaming, and event-based data architectures. Supports data operations through monitoring , logging, and automated incident management. Enforces data governance, ... applications. Collaborates with DevOps teams to implement CI/CD and observability pipelines. Maintains operational readiness through automated recovery and version… more
    Upward (07/03/25)
    - Save Job - Related Jobs - Block Source
  • JPMorgan Chase & Co. (Chicago, IL)
    …significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the CIB - Global ... on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex...one or more technical disciplines Proficiency and experience in observability such as white and black box monitoring more
    Upward (07/10/25)
    - Save Job - Related Jobs - Block Source
  • MongoDB (Palo Alto, CA)
    …next-generation, AI-powered applications. About the Role We're looking for a Staff Engineer to join our team building the inference platform for embedding models ... integrated into Atlas and optimized for developer experience. As a Staff Engineer , you'll be hands-on with design and implementation, while working with engineers… more
    Upward (07/13/25)
    - Save Job - Related Jobs - Block Source
  • Alchemy (New York, NY)
    …and cloud-native tools. Build and maintain automation pipelines for deployments, monitoring , and incident response. Lead incident management, conducting root ... University, Coinbase, and Charles Schwab, among others. About the Role As an engineer in the Infrastructure department at Alchemy, you will collaborate with our… more
    Upward (07/17/25)
    - Save Job - Related Jobs - Block Source
  • Nayya (New York, NY)
    monitoring , and alerting. Familiarity with DataDog or similar observability platform(s) and tooling. Experience with serverless infrastructure particularly within ... About the Role We are looking for a passionate and driven Senior Site Reliability Engineer (SRE) to join our growing engineering team at Nayya. As a Senior SRE at… more
    Upward (07/11/25)
    - Save Job - Related Jobs - Block Source
  • Hearst (Dallas, TX)
    …Blending software engineering with systems operations, PREs focus on automation, observability , incident response, and the continuous reduction of toil across ... the overall reliability of the platform and/or reduce toil. Establish modern observability patterns and implement those patterns. Monitor the overall platform health… more
    Upward (06/27/25)
    - Save Job - Related Jobs - Block Source
  • Celonis (Redwood City, CA)
    …visa sponsorship, now or in the future. Nice to Have Experience with observability and monitoring tools (eg, Datadog, etc.). Experience in developing and ... modern Software Engineering practices to build resilient and scalable systems. Lead reliability efforts for a fleet of 80+ FedRAMP-compliant microservices running… more
    Upward (07/06/25)
    - Save Job - Related Jobs - Block Source
  • American Federation of State, County and Municipal Employees (Washington, DC)
    …analysts to troubleshoot and resolve issues across environments. Develop and manage observability tools for monitoring and logging using AWS-native services such ... implementing, and managing the continuous integration and deployment infrastructure, monitoring solutions, and cloud-based architecture that supports development and… more
    Upward (07/04/25)
    - Save Job - Related Jobs - Block Source
  • Palo Alto Networks (Reston, VA)
    …robust and performant. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability. Our Infrastructure Platform ... tools and automation frameworks, championing Infrastructure as Code (IaC) and Monitoring as Code (MaC) principles Automate robust deployments and orchestrate… more
    Upward (07/07/25)
    - Save Job - Related Jobs - Block Source
  • Dallas County (Dallas, TX)
    …Collaborates with DevOps and infrastructure teams to ensure CI/CD, observability , and resilience. Manages vendor development teams, third-party integrations, and ... prevention as core quality practices. Defines metrics and dashboards for monitoring code quality, platform stability, and technical debt. Leads retrospectives… more
    Upward (07/03/25)
    - Save Job - Related Jobs - Block Source
  • Spring Health (Seattle, WA)
    …ML infrastructure (eg, model registry, CI/CD, feature stores) to LLM orchestration and observability . Champion AI Trust & Safety: Work in close partnership with our ... our MLOps and LLMOps capabilities. You will help establish robust, automated monitoring for model performance, latency, and cost; define SLOs for platform… more
    Upward (07/21/25)
    - Save Job - Related Jobs - Block Source
  • Lead AWS Monitoring

    Fannie Mae (Reston, VA)
    …as well as coach and mentor team members. *THE IMPACT YOU WILL MAKE* The * Lead Monitoring & Observability Engineer - AWS & APM Tools *role will offer you ... on our team, you will act as the team lead in designing and developing advanced solutions for information...years * 4 years of hands-on experience managing the Monitoring and Observability platform using Splunk/ Dynatrace/… more
    Fannie Mae (05/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Monitoring & Observability

    Fannie Mae (Reston, VA)
    …errors all while operating under limited supervision. *THE IMPACT YOU WILL MAKE* The *Senior Monitoring & Observability Engineer - AWS & APM Tools* role will ... * 2 years * 1+ years of hands-on experience managing/ using the Monitoring and Observability platform using Splunk/ Dynatrace/ Open Telemetry/ AWS Cloudwatch… more
    Fannie Mae (05/02/25)
    - Save Job - Related Jobs - Block Source
  • Lead Infrastructure Engineer

    Truist (Atlanta, GA)
    …the following job description:** We are seeking a highly skilled and forward-thinking lead observability engineer to architect, implement, and evolve ... champion a shift from reactive monitoring to proactive, intelligence-driven observability . You'll lead efforts to standardize telemetry pipelines, embed … more
    Truist (07/15/25)
    - Save Job - Related Jobs - Block Source
  • Principal Observability Engineer

    Medtronic (Mounds View, MN)
    …walk, creating an inclusive culture where you can thrive. We are seeking a Principal Observability Engineer to lead the design, automation, and support of ... in a more connected, compassionate world. **A Day in the Life** Principal Observability Engineer Careers That Change Lives Transforming Patient Management with… more
    Medtronic (07/24/25)
    - Save Job - Related Jobs - Block Source
  • Staff Observability Operations…

    CVS Health (Hartford, CT)
    …technologies within the existing environment. Work with partners to migrate legacy monitoring to modern solutions. Work with the observability engineering team ... existing or developing new solutions. **Platform Management:** Manage and administer observability and event management platforms. Lead system upgrades,… more
    CVS Health (07/04/25)
    - Save Job - Related Jobs - Block Source
  • Sr. IT Systems Engineer

    Prime Therapeutics (Washington, DC)
    …on observability and monitoring . We are looking for a senior Dynatrace Observability Monitoring Engineer (Sr. IT Systems Engineer ) to build and ... decision we make. **Job Posting Title** Sr. IT Systems Engineer - Observability Team - Remote **Job...Configuring Alerts and High-end business dashboards. + Setting up Observability monitoring Using OTel to capture all… more
    Prime Therapeutics (06/13/25)
    - Save Job - Related Jobs - Block Source
  • Lead Reliability, Maintainability…

    The Boeing Company (Hazelwood, MO)
    …(BDS) is seeking a ** Lead ** **Reliability, Maintainability and System** **Health Engineer for Supportable Low Observability ** (Level 4) to support the ... system architecture. + Supports the planning, organization, implementation and monitoring of requirements management processes, tools, risk, issues, opportunity… more
    The Boeing Company (07/18/25)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer - Site Reliability…

    General Motors (Roswell, GA)
    …times per week._ **_The Role:_** The Software Engineering Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance ... of software systems. Their job profile includes: + **System Monitoring and Troubleshooting:** Monitoring the performance and availability of software systems,… more
    General Motors (06/24/25)
    - Save Job - Related Jobs - Block Source