- Epsilon, Inc (Phoenix, AZ)
- **Enterprise Monitoring Engineer I** **Epsilon is now part of AMERICAN SYSTEMS!** We are pleased to announce Epsilon, Inc. has joined AMERICAN SYSTEMS. Together, ... Solution Architecture, and Project Management. **An average day:** As Enterprise Monitoring Engineer I, you will ensure the performance, availability,… more
   
- Home Depot (Atlanta, GA)
- …Pinecone, or Vertex AI). + Strong background in platform reliability and observability , with practical experience defining and monitoring service level ... **Position Purpose:** The Sr. Software Engineer is responsible for leading the deployment, integration,...tuning (managing data freshness and latency SLOs), and production monitoring . This role also involves hardening RAG ingestion pipelines,… more
   
- DataRobot (Olympia, WA)
- …(eg Python, Java, Go, C++, or equivalent). + Strong understanding of observability and monitoring : metrics, tracing, logging; and instrumentation of services. ... their business - today and in the future. As a Staff Software Engineer focused on Application Scalability & Performance, you will lead the design, implementation,… more
   
- Signature Aviation (Orlando, FL)
- …agents capable of reasoning, planning, and executing multi-step tasks. + ** Engineer LLM Applications:** Develop fine-tuned and optimized large language model ... **Establish AI Engineering Practices:** Define standards for AI model deployment, observability , safety, and compliance. + **Collaborate Across Teams:** Partner with… more
   
- Home Depot (Atlanta, GA)
- **Position Purpose:** The Staff Software Engineer is responsible for setting the technical strategy and leading the architecture for The Home Depot's Gemini ... at enterprise scale while enabling governed AI agent workflows. As a Staff Software Engineer , you will be part of a dynamic team with engineers of all experience… more
   
- Zscaler (San Jose, CA)
- …world's cloud security leader. We're looking for an experienced Staff-level Applied AI Engineer to join our IT AI/ML Engineering team. This role offers flexibility ... the organization + Designing AI lifecycle management strategies with automated observability , CI/CD workflows, and robust testing frameworks + Collaborating with… more
   
- Zscaler (San Jose, CA)
- …operations, DevOps, or site reliability roles + Demonstrated expertise in system observability , including log analysis and metrics monitoring using tools like ... strategy. We are seeking an experienced Senior Staff Infrastructure Operations Engineer to join our team. This critical role involves designing, implementing,… more
   
- Lyric (Newtown Square, PA)
- …testing tools (k6, Gatling, JMeter, Locust) + Strong experience with monitoring / observability (Grafana, Datadog, New Relic) + Experience testing distributed ... integration of quality gates into CI/CD pipelines using GitLab + Drive observability practices across the platform with Datadog and OpenTelemetry for performance… more
   
- Home Depot (Atlanta, GA)
- …quickly + Experience with cloud platforms such as Google Cloud Platform (GCP), and observability and monitoring tools such as Grafana, New Relic, etc. + ... **Position Purpose:** As a Software Engineer Manager on the Cart & Checkout team,...+ Writes custom code or scripts to automate infrastructure, monitoring services, and test cases + Works with vendors… more
   
- Wipfli LLP (Milwaukee, WI)
- …Summary Under the direction of the IT Directo r, the Senior DevOps Engineer is responsible for the administration, governance, and strategic enablement of the Azure ... and that the platform is maintained with high reliability and performance. The engineer will lead efforts to define best practices, enforce standards, and mentor… more
   
- Carnegie Mellon University (Pittsburgh, PA)
- …or mission-critical environments. (eg, aerospace, defense, embedded systems) + Familiarity with observability , logging, and monitoring tools as part of the ... We are seeking a highly skilled Resiliency Automation Engineer to join our team supporting embedded systems development in a regulated environment. This role will… more
   
- Carnegie Mellon University (Pittsburgh, PA)
- …or mission-critical environments. (eg, aerospace, defense, embedded systems) + Familiarity with observability , logging, and monitoring tools as part of the ... We are seeking a highly skilled Resiliency Automation Engineer to join our team supporting embedded systems development in a regulated environment. This role will… more
   
- McAfee, Inc. (Frisco, TX)
- **_Role Overview:_** As the SRE engineer , you will be accountable & responsible to maintain the appropriate service levels (availability, latency, and reliability) ... will ensure applications on-boarded to SRE are instrumented for full-stack observability and continuous testing, introduce continuous improvement, integrate into IT… more
   
- NetApp (San Jose, CA)
- …the Cloud Architecture, Containerization (Kubernetes, dockers) o Extensive knowledge in observability platforms such as Application performance monitoring such ... NetApp is seeking a highly skilled and experienced Senior Engineer to join our Active IQ Engineering team. As...join our Active IQ Engineering team. As a Senior Engineer , you will be responsible for designing, optimizing, and… more
   
- Quantum-Si (Branford, CT)
- We are looking for an experienced and ambitious instrument control software engineer to join our instrument software team.?We're developing a next-generation system ... capabilities for cloud connectivity, secure software updates, and performance monitoring . _Note: While some responsibilities involve supporting our current product… more
   
- Zscaler (San Jose, CA)
- …agility with a cloud-first strategy. We're looking for an experienced Principal Software Engineer to join our Workflow Automation team. This is a hybrid role ... engineers and fostering team growth through knowledge sharing and guidance + Monitoring system health, troubleshooting issues, and optimizing services with a DevOps… more
   
- Zscaler (Short Hills, NJ)
- …Exceptional remote candidates will also be considered. As a Principal Site Reliability Engineer - ML Platform, you will: + Architect, build, and maintain large-scale ... (SRE) for AI-driven applications deployed on AWS, ensuring performance, availability, observability , and scalability + Collaborate with the engineering team to… more
   
- Tyto Athene (Reston, VA)
- …(Kubernetes / Docker) with a focus on resilience and scale + Implement observability , monitoring , alerting, and automated remediations + Lead incident response, ... **Description** Tyto Athene is hiring a **Senior DevSecOps & Automation Engineer ** with deep expertise in Kubernetes orchestration and strong hands-on experience… more
   
- Zscaler (San Jose, CA)
- …and agility with a cloud-first strategy. We're looking for an experienced Sr. Software Engineer to join our Digital Experience team. This role is hybrid and based in ... microservices, data/feature pipelines, and low-latency inference paths with solid observability + Operate with excellence: add tests, monitors, tracing, dashboards;… more
   
- Zscaler (San Jose, CA)
- …agility with a cloud-first strategy. We're looking for an experienced Senior Staff Development Engineer to join our team. This role is hybrid and based in our San ... and overall architecture + Experience with graph databases such as Neo4j, alongside observability tools like Prometheus, Grafana, and logging systems such as the ELK… more
   
            Related Job Searches:
                Dynatrace Observability Monitoring Engineer, 
                Engineer, 
                Enterprise Monitoring Observability Engineer, 
                Lead Monitoring Observability Engineer, 
                Monitoring, 
                Monitoring Engineer, 
                Observability Engineer, 
                Senior Monitoring Observability Engineer, 
                Sr Monitoring Observability Engineer
        
    