• Infrastructure Monitoring

    Google (New York, NY)
    …everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers ... to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running,… more
    Google (05/08/25)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer,…

    MongoDB (New York, NY)
    …Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure . As a Staff SRE, you will be very hands-on technically while also ... team collaborates closely with other engineering teams to ensure that our infrastructure adheres to the highest security standards. They build essential security … more
    MongoDB (05/07/25)
    - Save Job - Related Jobs - Block Source
  • Lead, Site Reliability Engineering,…

    MongoDB (New York, NY)
    …Lead for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure . As a Lead SRE, you will be very hands-on technically while also directly ... team collaborates closely with other engineering teams to ensure that our infrastructure adheres to the highest security standards. They build essential security … more
    MongoDB (04/10/25)
    - Save Job - Related Jobs - Block Source
  • Manager, Site Reliability Engineer…

    NBC Universal (Englewood Cliffs, NJ)
    …spin-off is expected to be completed during 2025. NBC Universal's Enterprise Technology Site Reliability team improves system reliability by anticipating and ... learn and adapt to new processes, solutions, and platforms relevant to Site Reliability Engineering, including Automation and Event Management platforms,… more
    NBC Universal (05/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    MongoDB (New York, NY)
    …any infrastructure , and our newest offering, Atlas Data Lake. The Cloud Site Reliability Engineering Team designs and builds the global infrastructure ... requirements. The SRE Team's mission is to build this increasingly complex infrastructure , while continually lowering the operational burden associated with it, and… more
    MongoDB (04/10/25)
    - Save Job - Related Jobs - Block Source
  • Principal TPM, Ops Tech Solutions…

    Amazon (New York, NY)
    …cross-organizational initiatives to develop and implement automated availability metrics and monitoring systems - Drive adoption of Operational Excellence (OE) best ... https://www.aboutamazon.com/workplace/employee-benefits . This position will remain posted until filled. Applicants should apply via our internal or external career site more
    Amazon (05/13/25)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    MongoDB (New York, NY)
    …We also own related services, including our telemetry pipeline, and our monitoring and alerting infrastructure . Our stack includes VictoriaMetrics, Splunk, ... spans the globe - including several cloud providers + Build for reliability , making services and infrastructure available, resilient, fault tolerant and… more
    MongoDB (05/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior Reliability Engineer

    Celonis (New York, NY)
    …and resilience of our platform. The team applies advanced software engineering and Site Reliability Engineering (SRE) principles to drive system reliability , ... + Join a highly technical, collaborative, and innovation-driven team that blends Site Reliability Engineering with modern Software Engineering practices to build… more
    Celonis (04/26/25)
    - Save Job - Related Jobs - Block Source
  • DevOps Engineer

    Paramount (New York, NY)
    …services. **Basic Qualifications:** 2+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure Engineering as well as ... Debug and resolve production issues related to latency, scaling, and reliability . **Key Projects:** Build and optimize Kubernetes-based infrastructure for… more
    Paramount (03/20/25)
    - Save Job - Related Jobs - Block Source
  • AI, Analytics & RPA Support SRE Lead

    Mizuho Corporate Bank (New York, NY)
    …and optimization. + Experience in implementing and managing Site Reliability Engineering practices, including automation, monitoring , and proactive issue ... on the Azure cloud platform, ensuring optimal performance and scalability. + Site Reliability Engineering (SRE): Implement SRE practices to enhance system… more
    Mizuho Corporate Bank (05/21/25)
    - Save Job - Related Jobs - Block Source
  • Devops Engineer

    Insight Global (Englewood Cliffs, NJ)
    …to quantify the scope of reported issues . Create new metrics and identify monitoring deliverables to improve site reliability . Administer monitoring ... monitoring /alerting tools for on-air environments . Interact with automated monitoring infrastructure to ensure healthy environments . Create system… more
    Insight Global (05/16/25)
    - Save Job - Related Jobs - Block Source
  • Distinguished Full Stack Architect (US) - Capital…

    TD Bank (New York, NY)
    …orchestrated workloads such as Kubernetes/Docker. + Experience developing DevOps, Site Reliability , and Solutions Architecture methodologies and practices. ... reliability , scalability, and the development of the architectural infrastructure ; including highly complex and scalable systems​ + Develops observability… more
    TD Bank (05/15/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Middleware Engineer Kafka (Hybrid)

    Broadridge Financial Solutions (New York, NY)
    …in CI/CD pipelines (eg, Jenkins) and version control (Git) for distributed systems. Monitoring & Reliability + Proven ability to monitor and troubleshoot ... means youll be assigned to a Broadridge office and will work both on- site and remote. Responsibilities: Architecture & Design + Architect, design, and implement… more
    Broadridge Financial Solutions (05/02/25)
    - Save Job - Related Jobs - Block Source
  • Observability Engineer

    Capgemini (New York, NY)
    …professional with over 5 years of experience in IT operations, observability, or Site Reliability Engineering (SRE) roles. The ideal candidate will have hands-on ... experience with Dynatrace and expertise in monitoring AWS workloads and databases. Proficiency in Golden Signal metrics, Kubernetes monitoring , distributed… more
    Capgemini (05/03/25)
    - Save Job - Related Jobs - Block Source
  • Cloud Systems Engineer (New York, NY)

    Chobani (New York, NY)
    …The Cloud Systems Engineer is an integral part of the Chobani IT Infrastructure team, supporting both cloud and on-premise environments for the business. The role ... This position will focus on ensuring system performance, security, and reliability , while collaborating with cross-functional teams to align systems with business… more
    Chobani (03/29/25)
    - Save Job - Related Jobs - Block Source
  • Director, Production Design Engineering

    NBC Universal (New York, NY)
    …party and senior subject matter expert for technology, ensuring excellence and reliability . Additional Requirements: + Required On- Site : This position is ... Report on project deliverables and timelines, and establish proper reporting standards Monitoring and Security + Collaborate closely with the NBCU cyber security… more
    NBC Universal (05/20/25)
    - Save Job - Related Jobs - Block Source
  • Lead Software Engineer - Platform

    JPMorgan Chase (New York, NY)
    …and accessibility for team members. + Lead efforts to improve system reliability and performance through proactive monitoring and incident response strategies. ... optimize and maintain high availability and scalability. + Develop and maintain infrastructure as code using tools like Terraform, ensuring consistent and repeatable… more
    JPMorgan Chase (05/19/25)
    - Save Job - Related Jobs - Block Source
  • Chief Information Officer

    Charles B. Wang Community Health Center (Manhattan, NY)
    …the CEO and is responsible for overseeing the management of IT infrastructure , information security, and reporting operations to ensure delivery of high-quality, ... patient-centered care. This position is primarily on- site in New York City, with some remote flexibility as needed. As part of the executive team, the CIO will work… more
    Charles B. Wang Community Health Center (05/14/25)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer

    Disney Entertainment (New York, NY)
    …(Python, Powershell, Bash) + Designing APIs and building RESTful services + Modern infrastructure and site reliability engineering practices, including ... engineering behind personalization, commerce, lifecycle, and identity. The **Core Infrastructure Automation Team** is looking for a Principal Software Engineer… more
    Disney Entertainment (05/01/25)
    - Save Job - Related Jobs - Block Source
  • Electronic Alarms Specialist 2

    New York State Civil Service (Orangeburg, NY)
    …required to inspect, maintain, troubleshoot, repair and install firealarm detection, monitoring and transmitting systems (to include aircraft firefighting foam and ... to understand and carry out written and verbal instructions.\* Demonstrated reliability and trustworthiness* Will complete and attend training as required.* Periodic… more
    New York State Civil Service (03/20/25)
    - Save Job - Related Jobs - Block Source