• Observability and Resiliency

    Vanguard (Malvern, PA)
    …are systems that reside in a technically complex and constantly evolving resiliency landscape. Passionate, technically skilled engineers are at the center of our ... resiliency operations, and we are looking to grow our...to grow our team. We are seeking an experienced engineer with broad, end-to-end software development experience, including operating… more
    Vanguard (11/18/25)
    - Save Job - Related Jobs - Block Source
  • Sr Engineer - Observability

    Target (Brooklyn Park, MN)
    …like OpenTelemetry, Grafana, and ClickHouse, reflecting our commitment to innovation in the observability space. As a Senior Software Engineer on this team, you ... culture. Learn more about Target here (https://corporate.target.com/about) . Join our Observability team, a dynamic group that provides the vital telemetry pipeline… more
    Target (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Staff Software Engineer - Network…

    LinkedIn (Mountain View, CA)
    …This role will be based in Mountain View, CA. The Network Infrastructure Observability team is responsible for delivering the platforms, tools, and insights that ... all data center and backbone network environments. As a Senior Staff Software Engineer , you will serve as a technical leader driving the architecture, innovation,… more
    LinkedIn (11/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Engineer , Server Networking…

    MongoDB (New York, NY)
    …and resiliency of MongoDB Server + Design and implement observability improvements that enable MongoDB engineers and customers to quickly and accurately ... The Networking & Observability Team builds infrastructure for low-overhead observability and communication between MongoDB Server nodes, clients, and other… more
    MongoDB (10/08/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    The Hartford (Chicago, IL)
    Engineer with expertise in Splunk , Dynatrace, CDN, and other industry observability tools . The Senior Reliability Engineer will be responsible for ensuring ... Staff Reliability Engineer - IE07KE We're determined to make a...as we help shape the future. The Hartford's RE&A Observability team is seeking a highly motivated and experienced… more
    The Hartford (11/14/25)
    - Save Job - Related Jobs - Block Source
  • Resiliency Automation Engineer

    Carnegie Mellon University (Pittsburgh, PA)
    We are seeking a highly skilled Resiliency Automation Engineer to join our team supporting embedded systems development in a regulated environment. This role ... mission-critical environments. (eg, aerospace, defense, embedded systems) + Familiarity with observability , logging, and monitoring tools as part of the software… more
    Carnegie Mellon University (10/29/25)
    - Save Job - Related Jobs - Block Source
  • Resiliency Automation Engineer

    Carnegie Mellon University (Pittsburgh, PA)
    We are seeking a highly skilled Resiliency Automation Engineer to join our team supporting embedded systems development in a regulated environment. This role ... mission-critical environments. (eg, aerospace, defense, embedded systems) + Familiarity with observability , logging, and monitoring tools as part of the software… more
    Carnegie Mellon University (10/29/25)
    - Save Job - Related Jobs - Block Source
  • Grafana Stack DevOps Engineer - Assistant…

    Citigroup (Tampa, FL)
    **Overview** We are seeking a highly skilled and motivated Grafana Stack DevOps Engineer to join our team as an Assistant Vice President (AVP) in Tampa. The ideal ... building, maintaining, and ensuring the stability and resilience of our Prime Observability Platform. This role requires a strong understanding of the Grafana… more
    Citigroup (08/26/25)
    - Save Job - Related Jobs - Block Source
  • GenAI Cloud Engineer

    Leidos (Gaithersburg, MD)
    …The Civil Group at Leidos has an opening for an early career GenAI Cloud Engineer to help design, secure, and automate an AWS contact center platform that uses ... while gaining hands on experience with cloud networking, encryption, identity, observability , and GenAI safety controls in a mission environment. **Primary… more
    Leidos (11/07/25)
    - Save Job - Related Jobs - Block Source
  • Staff Engineer , Site Reliability Lead

    CVS Health (Scottsdale, AZ)
    …do it all with heart, each and every day. **Position Summary** The Staff Engineer , Site Reliability Engineer (SRE) will support CVS Health PCW Digital ... and coach teams across SRE and Front-Line Ops on best practices of Observability , Monitoring and SOPs. We are looking for skilled candidates who are enthusiastic… more
    CVS Health (11/07/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Software Engineer

    LinkedIn (Mountain View, CA)
    …as a senior technical leader driving the long-term reliability and observability strategy across LinkedIn's infrastructure + Re-architect LinkedIn's backend systems ... incident response + Define and build frameworks to improve monitoring, alerting, and observability across hundreds of services and systems + Define and own the… more
    LinkedIn (09/24/25)
    - Save Job - Related Jobs - Block Source
  • (USA) Principal, Software Engineer

    Walmart (Sunnyvale, CA)
    **Position Summary ** We are seeking a highly skilled Principal Engineer (Ceph Storage) with 10years+ of deep technical experience in distributed storage systems. ... problem solving, and continuous learning, while driving adoption of automation, observability , and next-generation storage technologies. As part of this team, you… more
    Walmart (11/20/25)
    - Save Job - Related Jobs - Block Source
  • Lead Software Engineer - SRE

    JPMorgan Chase (Columbus, OH)
    …where you can push the limits of what's possible. As a Lead Software Engineer at JPMorgan Chase within the Home Lending division, you will have the opportunity ... development with a focus on system reliability, automation, and observability , ensuring our Home Lending platforms run smoothly in...smoothly in production. We are seeking a Lead Software Engineer and thought leader who is passionate in building… more
    JPMorgan Chase (09/05/25)
    - Save Job - Related Jobs - Block Source
  • Principal Reliability Engineer

    The Hartford (Hartford, CT)
    …+ Support enterprise needs with improvements in Performance, Scalability, Resiliency , Reliability, Stability, Observability , Security, etc.. continuously ... Principal Security Engineer - IS06BE We're determined to make a...enable infrastructure provisioning, application availability, testing, quality, application deployment, resiliency , recovery, and efficiency of IT applications.​ Additionally they… more
    The Hartford (10/23/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer - Generative AI…

    Insight Global (Whitpain, PA)
    Job Description Our Healthcare Insurance Client is looking to hire a Principal Software Engineer - Generative AI. This position is remote work from home and will ... conversion to FTE will be $215-260K. We are seeking a Principal Software Engineer to lead the design and delivery of enterprise-scale Generative AI solutions that… more
    Insight Global (10/07/25)
    - Save Job - Related Jobs - Block Source
  • Storage & Backup Platform Operations…

    Lilly (Indianapolis, IN)
    …The Cloud and Connectivity organization is actively looking for a Storage Platform Operations Engineer to join them. Do you like to solve challenges and have an ... and reliability through repeatable patterns, new architectural designs, improvements in observability to prevent outages to help increase value across the… more
    Lilly (11/11/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer III - SRE

    JPMorgan Chase (Columbus, OH)
    …you to take your software engineering career to the next level. As a Software Engineer III at JPMorgan Chase within the Home Lending division, you will have the ... be part of a team that focuses on system reliability, automation, and observability , ensuring our Home Lending platforms run smoothly in production. This role offers… more
    JPMorgan Chase (09/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer /SRE - Electronic…

    Bloomberg (New York, NY)
    …compliant software at high efficiency. We are looking for a Senior Software Engineer who is passionate about improving operational resiliency and developer ... Senior Software Engineer /SRE - Electronic Trading Location New York Business...You'll Do + Build automation and frameworks that improve resiliency , observability , and recovery + Partner with… more
    Bloomberg (11/15/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI Infrastructure Software Engineer

    NVIDIA (Santa Clara, CA)
    …on building the AI/ML platform for improving productivity, optimizing efficiency and resiliency of AI workloads, as well as developing scalable AI infrastructure ... services globally. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental in...and optimize tools to improve AI/ML workload efficiency and resiliency . + Root cause and analyze and triage failures… more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source
  • Senior DGX Cloud AI Infrastructure Software…

    NVIDIA (Santa Clara, CA)
    …powers our innovative AI research. This team focuses on optimizing efficiency and resiliency of AI workloads, as well as developing scalable AI and Data ... foster innovation. We are seeking an AI infrastructure software engineer to join our team. You'll be instrumental in...Develop and optimize tools to improve infrastructure efficiency and resiliency . + Root cause and analyze and triage failures… more
    NVIDIA (11/01/25)
    - Save Job - Related Jobs - Block Source