- NVIDIA (Santa Clara, CA)
- …at NVIDIA, you will own the development of DGX Cloud strategy for observability , monitoring , and remediation across all layers of infrastructure, IaaS, platforms ... define and drive the technical implementation for DGX Cloud offerings in the observability , monitoring , and remediation practice. + Collaborate on Cross Domain… more
- LinkedIn (Mountain View, CA)
- …driving systemic improvements in availability and performance + Previous experience in a Distinguished Engineer or equivalent role at a high-growth or web-scale ... and incident response + Define and build frameworks to improve monitoring , alerting, and observability across hundreds of services and systems + Define and own… more