• Infrastructure Observability

    CACI International (Washington, DC)
    Infrastructure Observability and Monitoring Lead Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: TS/SCI ... is looking for an experienced, innovative, and motivated ** Infrastructure Observability and Monitoring Lead ** to support the mission objectives and needs… more
    CACI International (03/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior Director - Resiliency, Observability

    Marriott (Bethesda, MD)
    …** Observability :** * Create and implement a comprehensive strategy for infrastructure observability , encompassing monitoring , logging, and tracing across ... in the hospitality industry. These areas include: 1. Maturing application & Infrastructure observability platform capabilities 2. Strategy and migration of hotel… more
    Marriott (03/07/25)
    - Save Job - Related Jobs - Block Source
  • VP/Director Observability Engineering

    OneMain Financial (Baltimore, MD)
    We are seeking an experienced and dynamic Vice President - Director to lead observability across the Technology organization. The ideal candidate will have a ... troubleshoot, and optimize their systems. This Senior leader will lead the strategy and execution of a technical roadmap...and vision. Develop and articulate the product vision for observability and monitoring products. + Define and… more
    OneMain Financial (04/25/25)
    - Save Job - Related Jobs - Block Source
  • Manager, Monitoring Tooling and Engineering

    OneMain Financial (Baltimore, MD)
    …, **BigPanda** , **AppDynamics** . + Strong understanding of building end-to-end observability , monitoring , and alerting infrastructure , as well as ... We are seeking a skilled and dynamic **Manager, Monitoring Tooling and Engineering** to lead ...development and optimization of tools and instrumentation associated with observability , monitoring , and alerting. You will collaborate… more
    OneMain Financial (04/05/25)
    - Save Job - Related Jobs - Block Source
  • Lead Site Reliability Engineer (Remote…

    Cognizant (Annapolis, MD)
    …to visualize and track system performance metrics. + Implement SRE practices for monitoring and observability ensuring system reliability and uptime. + Develop ... ** Lead Site Reliability Engineer (Remote -CST)** **About Cognizant**...automation + Triage and RCA of production incidents + Observability and monitoring with APM tools and… more
    Cognizant (05/17/25)
    - Save Job - Related Jobs - Block Source
  • SRE Technical Product Manager

    Ford Motor Company (Annapolis, MD)
    …for transforming our monitoring practices from traditional application and infrastructure monitoring to comprehensive full-stack observability solutions. ... team and lead the development, enhancement, and extension of our observability platform. This individual will play a pivotal role in shaping product roadmaps,… more
    Ford Motor Company (05/07/25)
    - Save Job - Related Jobs - Block Source
  • Lead Data Engineer - AWS/Full Stack

    The Hartford (Hunt Valley, MD)
    …Informatica Data Integration and/or Talend. + AWS certification. + Experience with Observability monitoring tools such as Dynatrace, Splunk. Candidates must be ... valuable, discoverable, reusable asset. As a Data Governance Tech Lead , you will be an expert in our procedures...Data Platform leadership to design and implement processes and observability tooling that can be used to monitor data… more
    The Hartford (04/09/25)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer Data Engineering - US Remote

    Anywhere Real Estate (Baltimore, MD)
    …+ 5+ years' experience managing production data platforms. + 5+ years' experience building observability ( Monitoring & Alerting) using tools such as Data Dog and ... you'll be responsible for designing, implementing, and managing the data infrastructure . You will work closely with data scientists, software engineers, and… more
    Anywhere Real Estate (04/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Engineer - Global Commercial Services

    American Express (Annapolis, MD)
    …and can drive technical excellence at scale. **Key responsibilities** + Lead the technical direction for complex projects and systems, ensuring scalability, ... to key system components across the stack-front end, back end, and infrastructure -based on priority and team need. + Uphold and champion Center's engineering… more
    American Express (05/21/25)
    - Save Job - Related Jobs - Block Source
  • Automation Evangelist

    Arrow Electronics (Washington, DC)
    …on Technology Business Management Solutions (including FinOps, IT Financial Management, and Observability ), and IT Automation. This person will lead our ... the use of automation technologies (eg, FinOps, IT Financial Management, and Observability ) across the organization, fostering a culture of innovation and continuous… more
    Arrow Electronics (05/03/25)
    - Save Job - Related Jobs - Block Source
  • VP / Director - DevOps Principal Engineer

    OneMain Financial (Baltimore, MD)
    …tools exposed as interfaces and advanced but easy-to-manage logging, alerting, and monitoring strategies for full stack observability and quick problem ... Along with the head of DevOps, the Principal Engineer for DevOps will lead OneMain's delivery organization toward these objectives. The leader will pursue a… more
    OneMain Financial (04/25/25)
    - Save Job - Related Jobs - Block Source
  • System Development Engineer II, Systems…

    Amazon (Columbia, MD)
    …(KDS/KDF/KDA/MSK) services in Amazon Dedicated Cloud (ADC) and GovCloud regions. Amazon Monitoring and Observability (AMO) AMO Provides AWS CloudWatch services ... (AWS) is the world leader in providing a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in… more
    Amazon (02/23/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer II, Software…

    Amazon (Columbia, MD)
    …(KDS/KDF/KDA/MSK) services in Amazon Dedicated Cloud (ADC) and GovCloud regions. Amazon Monitoring and Observability (AMO) AMO Provides AWS CloudWatch services ... (AWS) is the world leader in providing a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers hundreds of thousands of businesses in… more
    Amazon (02/23/25)
    - Save Job - Related Jobs - Block Source
  • Sr Principal DevOps Platform Engineer

    Northrop Grumman (Linthicum Heights, MD)
    …and testing workflows across various environments + Deploy and maintain robust monitoring , alerting, and observability tools (eg Prometheus, Grafana, ELK) to ... is seeking a **Senior Principal DevOps Platform Engineer** with demonstrated ability to lead development of new technologies to support our innovative MDA team in… more
    Northrop Grumman (04/08/25)
    - Save Job - Related Jobs - Block Source
  • FLEX Software Engineer - Salesforce Commerce…

    Marriott (Bethesda, MD)
    …(eg Slack, Jira, Confluence) + Experience with Dynatrace and/or Splunk for observability and monitoring , including setting up and configuring dashboards, alerts, ... repetitive tasks related to system maintenance and operational processes. + Lead incident management and problem management processes to ensure timely resolution… more
    Marriott (05/14/25)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer-TS/SCI Clearance…

    Northrop Grumman (Linthicum Heights, MD)
    …and test workflows across various environments + Deploy and maintain robust monitoring , alerting, and observability tools (eg Prometheus, Grafana, ELK) to ... reliability, and security (ie K8s, OpenShift) + Develop and manage infrastructure -as-code (IaC) solutions to automate and standardize platform tool deployments (eg… more
    Northrop Grumman (05/01/25)
    - Save Job - Related Jobs - Block Source
  • Principal/ Senior Principal Site Reliability…

    Northrop Grumman (Linthicum Heights, MD)
    …and test workflows across various environments + Deploy and maintain robust monitoring , alerting, and observability tools (eg Prometheus, Grafana, ELK) to ... reliability, and security (ie K8s, OpenShift) + Develop and manage infrastructure -as-code (IaC) solutions to automate and standardize platform tool deployments (eg… more
    Northrop Grumman (05/01/25)
    - Save Job - Related Jobs - Block Source