• Senior Engineer - AI and HPC…

    NVIDIA (Santa Clara, CA)
    Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the next-generation ... observability platform for large-scale AI workloads, GPU clusters, and high-performance computing environments. This...The Crowd: + Proven experience designing and scaling full-stack observability platforms for large-scale AI , GPU, or… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer , AIOps…

    NVIDIA (Santa Clara, CA)
    …to do their best work. We are looking for a highly skilled Principal Software Engineer to design and develop AIOps & Observability platforms at NVIDIA. The ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...of engineers, product managers, and partners to define the observability strategy, roadmap, and standard methodologies for NVIDIA. You… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    Airtable (San Francisco, CA)
    …by nearly every engineering team at Airtable. We also work on LLM observability for AI -powered features. We provide visibility into prompts, model calls, ... keep Airtable's monitoring capabilities at the cutting edge Extend observability to LLM and AI features +...Us? + High Impact Lead the modernization of Airtable's observability stack, influencing how every engineer monitors… more
    Airtable (01/09/26)
    - Save Job - Related Jobs - Block Source
  • Lead Software Engineer - Enterprise…

    Humana (Helena, MT)
    …is the team for you. **About the Role** We're looking for a Lead Software Engineer with deep expertise in logging and observability engineering. You should be ... Networking, Platform Engineering, and Data Science teams to evolve our observability strategy using AI /ML and cloud-native technologies. **Key Responsibilities**… more
    Humana (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation, multi-region observability platform. This platform ... powers our rapidly expanding AI , Data, and Observability ecosystem, operating at an immense scale: trillions of metrics, hundreds of terabytes of logs, and… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Software Engineer II - Observability

    Microsoft Corporation (Redmond, WA)
    …potential of AI to create intelligent, adaptive, and transformative software. The Observability group is seeking a **Software Engineer II - Observability ... **Overview** Core AI is at the forefront of Microsoft's mission...performance. We are seeking a passionate and skilled software engineer to join the Observability platform team.… more
    Microsoft Corporation (11/26/25)
    - Save Job - Related Jobs - Block Source
  • Site Reliability Engineer (Senior…

    MongoDB (New York, NY)
    …VictoriaMetrics, Splunk, QuickWit, Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also ... **Team and Role Overview** The SRE Observability team is part of the larger Platform...the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt… more
    MongoDB (11/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior Full Stack Engineer - Cloud-Native…

    Cisco (Milpitas, CA)
    Senior Full Stack Engineer - Cloud-Native Observability Platform We are the Catalyst Center Platforms and Capabilities team, responsible for delivering scalable, ... innovation. One of our key initiatives is a cloud-native observability platform purpose-built for Cisco Catalyst Center deployments-bridging on-premises network… more
    Cisco (11/12/25)
    - Save Job - Related Jobs - Block Source
  • Senior Staff Engineer , Server Networking…

    MongoDB (New York, NY)
    The Networking & Observability Team builds infrastructure for low-overhead observability and communication between MongoDB Server nodes, clients, and other ... core components for data processing systems + Familiarity with observability ecosystem and best practice + Excellent verbal and...the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt… more
    MongoDB (01/06/26)
    - Save Job - Related Jobs - Block Source
  • Distinguished Engineer

    NVIDIA (Santa Clara, CA)
    …Intelligence: Real world experience applying model development, RAG, MCP, and Agentic AI technical solutions to the problem of observability data analytics, ... at NVIDIA, you will own the development of DGX Cloud strategy for observability , monitoring, and remediation across all layers of infrastructure, IaaS, platforms and… more
    NVIDIA (11/24/25)
    - Save Job - Related Jobs - Block Source
  • Observability Engineer

    Capgemini (New York, NY)
    Observability Engineer Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be ... + Should have 7+ Hands on experience working with Observability tools◦ Cortex/Mimir + Grafana - Building dashboards ,...engineering, all fueled by its market leading capabilities in AI , generative AI , cloud and data, combined… more
    Capgemini (01/07/26)
    - Save Job - Related Jobs - Block Source
  • Distinguished, Software Engineer

    Walmart (Sunnyvale, CA)
    **Position Summary ** **What you'll do ** As an observability Distinguished Engineer , you will be a key researcher and technical lead expert in the architecture ... and development of cloud native observability designs, managed services, and real-time telemetry software systems. You will use your depth of engineering and… more
    Walmart (10/21/25)
    - Save Job - Related Jobs - Block Source
  • Full-Stack Software Engineer - Backend,…

    HP Inc. (Fort Collins, CO)
    **Description** * Our ideal candidate is a versatile Full-Stack Software Engineer who thrives in a fast-paced, start-up culture where innovation, ownership, and ... Python and Go, building intuitive UIs, and enhancing system observability with Prometheus and telemetry tools. You'll work with...for data management and have the opportunity to explore AI integration as a bonus area of growth. **Key… more
    HP Inc. (01/11/26)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Networking…

    MongoDB (Seattle, WA)
    Join and be a part of leading the MongoDB Networking Observability team, helping build the core of a distributed database! Our team focuses on creating and enhancing ... make these processes, and their communication, easily observable. Networking Observability 's responsibilities include improving MongoDB networking, improving the efficiency… more
    MongoDB (12/26/25)
    - Save Job - Related Jobs - Block Source
  • Observability And Monitoring…

    TEKsystems (West Des Moines, IA)
    …seeking a contractor with deep expertise in monitoring tools and system observability . This person will evaluate the current environment recommend improvements, and ... help integrate tools into a cohesive ecosystem. Experience with AI -driven monitoring solutions is a plus. Responsibilities + Integrate existing tools (Dynatrace,… more
    TEKsystems (01/07/26)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, ... Production + 8+ years experience delivering foundational infrastructure and observability platforms. + Experience in one or more of...This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed… more
    NVIDIA (12/19/25)
    - Save Job - Related Jobs - Block Source
  • Principal Firmware Engineer - Server…

    NVIDIA (Santa Clara, CA)
    …NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical architect to ... the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company...This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed… more
    NVIDIA (11/12/25)
    - Save Job - Related Jobs - Block Source
  • Reliability and Observability Lead

    Vanguard (Malvern, PA)
    …experience operate within a complex and rapidly evolving resiliency landscape. As an Application Engineer within the ChAI (Chat & AI ) team, you will contribute ... build, and support application-level capabilities that improve reliability, performance, and observability for AI and Generative AI workloads. You will also… more
    Vanguard (12/17/25)
    - Save Job - Related Jobs - Block Source
  • Senior Engineering Manager, ML Optimization Tools…

    Google (Sunnyvale, CA)
    Senior Engineering Manager, ML Optimization Tools and Observability _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... of the following: ML performance, debugging, optimization, profiling, or observability . + 5 years of experience in a people...Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have… more
    Google (12/11/25)
    - Save Job - Related Jobs - Block Source
  • AI Operations Engineer - C3

    Cisco (San Jose, CA)
    …distributed tracing initiatives across an organization + Experience with using AI Agents to continually refine observability outcomes + Understanding ... the Team** The DevOps team within Cisco's newly formed AI Software and Platform group designs and operates the...Assistants. Within the DevOps team, our Cloud Platform and Observability group provides all the necessary insights to power… more
    Cisco (01/10/26)
    - Save Job - Related Jobs - Block Source