- NVIDIA (Santa Clara, CA)
- …Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the next-generation ... observability platform for large-scale AI workloads, GPU clusters, and high-performance computing environments. This...The Crowd: + Proven experience designing and scaling full-stack observability platforms for large-scale AI , GPU, or… more
- NVIDIA (Santa Clara, CA)
- …to do their best work. We are looking for a highly skilled Principal Software Engineer to design and develop AIOps & Observability platforms at NVIDIA. The ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...of engineers, product managers, and partners to define the observability strategy, roadmap, and standard methodologies for NVIDIA. You… more
- Airtable (San Francisco, CA)
- …by nearly every engineering team at Airtable. We also work on LLM observability for AI -powered features. We provide visibility into prompts, model calls, ... keep Airtable's monitoring capabilities at the cutting edge Extend observability to LLM and AI features +...Us? + High Impact Lead the modernization of Airtable's observability stack, influencing how every engineer monitors… more
- Humana (Helena, MT)
- …is the team for you. **About the Role** We're looking for a Lead Software Engineer with deep expertise in logging and observability engineering. You should be ... Networking, Platform Engineering, and Data Science teams to evolve our observability strategy using AI /ML and cloud-native technologies. **Key Responsibilities**… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's Observability team is seeking a Senior/Staff Engineer to compose and build the next-generation, multi-region observability platform. This platform ... powers our rapidly expanding AI , Data, and Observability ecosystem, operating at an immense scale: trillions of metrics, hundreds of terabytes of logs, and… more
- Microsoft Corporation (Redmond, WA)
- …potential of AI to create intelligent, adaptive, and transformative software. The Observability group is seeking a **Software Engineer II - Observability ... **Overview** Core AI is at the forefront of Microsoft's mission...performance. We are seeking a passionate and skilled software engineer to join the Observability platform team.… more
- MongoDB (New York, NY)
- …VictoriaMetrics, Splunk, QuickWit, Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also ... **Team and Role Overview** The SRE Observability team is part of the larger Platform...the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt… more
- Cisco (Milpitas, CA)
- Senior Full Stack Engineer - Cloud-Native Observability Platform We are the Catalyst Center Platforms and Capabilities team, responsible for delivering scalable, ... innovation. One of our key initiatives is a cloud-native observability platform purpose-built for Cisco Catalyst Center deployments-bridging on-premises network… more
- MongoDB (New York, NY)
- The Networking & Observability Team builds infrastructure for low-overhead observability and communication between MongoDB Server nodes, clients, and other ... core components for data processing systems + Familiarity with observability ecosystem and best practice + Excellent verbal and...the market. We have redefined the database for the AI era, enabling innovators to create, transform, and disrupt… more
- NVIDIA (Santa Clara, CA)
- …Intelligence: Real world experience applying model development, RAG, MCP, and Agentic AI technical solutions to the problem of observability data analytics, ... at NVIDIA, you will own the development of DGX Cloud strategy for observability , monitoring, and remediation across all layers of infrastructure, IaaS, platforms and… more
- Capgemini (New York, NY)
- Observability Engineer Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be ... + Should have 7+ Hands on experience working with Observability tools◦ Cortex/Mimir + Grafana - Building dashboards ,...engineering, all fueled by its market leading capabilities in AI , generative AI , cloud and data, combined… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As an observability Distinguished Engineer , you will be a key researcher and technical lead expert in the architecture ... and development of cloud native observability designs, managed services, and real-time telemetry software systems. You will use your depth of engineering and… more
- HP Inc. (Fort Collins, CO)
- **Description** * Our ideal candidate is a versatile Full-Stack Software Engineer who thrives in a fast-paced, start-up culture where innovation, ownership, and ... Python and Go, building intuitive UIs, and enhancing system observability with Prometheus and telemetry tools. You'll work with...for data management and have the opportunity to explore AI integration as a bonus area of growth. **Key… more
- MongoDB (Seattle, WA)
- Join and be a part of leading the MongoDB Networking Observability team, helping build the core of a distributed database! Our team focuses on creating and enhancing ... make these processes, and their communication, easily observable. Networking Observability 's responsibilities include improving MongoDB networking, improving the efficiency… more
- TEKsystems (West Des Moines, IA)
- …seeking a contractor with deep expertise in monitoring tools and system observability . This person will evaluate the current environment recommend improvements, and ... help integrate tools into a cohesive ecosystem. Experience with AI -driven monitoring solutions is a plus. Responsibilities + Integrate existing tools (Dynatrace,… more
- NVIDIA (Santa Clara, CA)
- …Design, implement and support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, ... Production + 8+ years experience delivering foundational infrastructure and observability platforms. + Experience in one or more of...This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed… more
- NVIDIA (Santa Clara, CA)
- …NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical architect to ... the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company...This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed… more
- Vanguard (Malvern, PA)
- …experience operate within a complex and rapidly evolving resiliency landscape. As an Application Engineer within the ChAI (Chat & AI ) team, you will contribute ... build, and support application-level capabilities that improve reliability, performance, and observability for AI and Generative AI workloads. You will also… more
- Google (Sunnyvale, CA)
- Senior Engineering Manager, ML Optimization Tools and Observability _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... of the following: ML performance, debugging, optimization, profiling, or observability . + 5 years of experience in a people...Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have… more
- Cisco (San Jose, CA)
- …distributed tracing initiatives across an organization + Experience with using AI Agents to continually refine observability outcomes + Understanding ... the Team** The DevOps team within Cisco's newly formed AI Software and Platform group designs and operates the...Assistants. Within the DevOps team, our Cloud Platform and Observability group provides all the necessary insights to power… more