• Cleared Jobs (Arlington, VA)
    …and observability tools, such as ELK Stack, Prometheus, or Grafana , to ensure system reliability and performance.CompTIA Security+ or similar DoD 8570 ... future.Overview of Opportunity Two Six Technologies is seeking a Senior DevOps Engineer to join our team, focusing on developing and maintaining infrastructure that… more
    Talent (07/15/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (Seattle, WA)
    …and observability , with experience in tools such as Prometheus, Grafana , or Datadog.Strong problem-solving skills, with the ability to troubleshoot complex ... (IaC) solutions using Terraform to automate provisioning and management.Enhance observability and monitoring for infrastructure health and performance using tools… more
    Talent (07/11/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (Dearborn, MI)
    …to cloud infrastructure and distributed systems. Experience with monitoring and observability tools (eg, Prometheus, Grafana , Datadog). Excellent communication ... As a Software Engineer on the Cloud Engineering team, you will...for security, IAM, encryption, and compliance to safeguard cloud environments. Observability & Monitoring - Develop and maintain monitoring, logging,… more
    Talent (07/11/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (Santa Clara, CA)
    …large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of a team supporting the ... running on this infrastructure. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability.Our Infrastructure Platform stack includes… more
    Talent (07/14/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (Atlanta, GA)
    …driving improvements in infrastructure & system reliability, performance, high availability, observability , and overall stability of the platform by leveraging the ... and automate routine tasks to improve operational efficiency. * Incorporates Observability as part of day-to-day operations. * Guides reliability practices through… more
    Talent (07/11/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (San Diego, CA)
    …best practices (EKS)2+ years of experience with monitoring and observability of applications (Prometheus, Grafana , Alertmanager, Datadog)Proficiency in ... our backgrounds or responsibilities, we are one team.About the RoleThe Sr. Infrastructure Engineer helps design and maintain reliable systems at scale in the cloud.… more
    Talent (07/11/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (San Francisco, CA)
    …pipelines, automated testing, and rollout strategies for infra changes.Develop an observability stack (Prometheus, Grafana , OpenTelemetry, eBPF) plus GPU ... telemetry with NVIDIA DCGM.Optimize high‑performance networking (InfiniBand/RDMA) and debug perf bottlenecks.Run and continuously improve the 24x7 on‑call rotation; lead post‑incident reviews.Partner with researchers and engineers, communicate crisply, and… more
    Talent (07/11/25)
    - Save Job - Related Jobs - Block Source
  • grabjobs (New York, NY)
    …and other container orchestration tools.- RDBMS knowledge (preferably MSSQL/PostgreSQL)- Observability stack (Prometheus, Loki, Jaeger, Grafana )- Kafka- A ... very strong communicator with the ability to interface directly with clients and analysts to ensure technical requirements and delivery align with expectations- A strong understanding of Agile/Scrum and ability to deliver solutions under this methodology-… more
    Talent (07/11/25)
    - Save Job - Related Jobs - Block Source
  • Experis (Plano, TX)
    Position Title: Monitoring Automation Engineer Contract duration: 12 months with a possibility of extension Targeted Start date: May/June Desired Core Location(s): ... Management: Knowledge of Artifactory for package management. Monitoring & Observability : Experience configuring and managing Splunk, Dynatrace, and OpenTelemetry… more
    Upward (06/29/25)
    - Save Job - Related Jobs - Block Source
  • Xperi (San Jose, CA)
    …experiences. We can't wait to show you what's next. Staff Software Engineer Key Qualifications: 3-5 years of experience in object-oriented programming with Java ... strong debugging skills, and crash log analysis. Experience with Splunk, Grafana , or similar technologies for data search, analysis, and visualization. Skills… more
    Upward (06/21/25)
    - Save Job - Related Jobs - Block Source
  • Acubed (Sunnyvale, CA)
    …TensorFlow, ONNX, etc.) and distributed data loading. Experience with monitoring and observability tools (eg, Grafana ). Passion for aviation and autonomous ... The Opportunity/Role Description As a Senior Machine Learning Operations and Infrastructure Engineer , you will develop, operate and maintain our AI software and… more
    Upward (07/13/25)
    - Save Job - Related Jobs - Block Source
  • Alchemy (New York, NY)
    …conducting root cause analyses and driving continuous reliability enhancements. Develop observability frameworks using Prometheus, Grafana , ELK, and tracing ... driving company-wide reliability efforts and engineering initiatives. Strong familiarity with observability best practices and tooling like Prometheus, Grafana ,… more
    Upward (07/17/25)
    - Save Job - Related Jobs - Block Source
  • PCR Staffing (Concord, NC)
    …, guiding teams toward higher automation and reliability. Build and manage observability dashboards using Grafana , CloudWatch and Datadog to track application ... Site Reliability Engineer We are seeking a Site Reliability ...firewall management and troubleshooting . Expertise in monitoring and observability tools , including Grafana and Datadog… more
    Upward (07/13/25)
    - Save Job - Related Jobs - Block Source
  • Palo Alto Networks (Santa Clara, CA)
    …large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of a team supporting the ... running on this infrastructure. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability. Our Infrastructure Platform stack… more
    Upward (07/13/25)
    - Save Job - Related Jobs - Block Source
  • Booz Allen Hamilton, Inc. (Melbourne, FL)
    …reliability engineering or DevOps roles Experience with hands-on monitoring or observability tools such as Dynatrace, Grafana , Kibana, Elasticsearch, CloudWatch, ... Site Reliability Engineer The Opportunity: As a Site Reliability Engineer (SRE) on our team, you will play a critical part in ensuring our systems and services'… more
    Upward (07/20/25)
    - Save Job - Related Jobs - Block Source
  • HCL Global Systems, Inc. (Roanoke, TX)
    …including CI/CD pipelines Hands on experience with one or more observability tools (Prometheus, Grafana , ELK/OpenSearch, OpenTelemetry, Datadog, etc.) Use ... Datadog, Catchpoint, Splunk & Grafana for Application Observability and monitoring of app & infrastructure Experienced in Instrumentation with systems skills on… more
    Upward (07/09/25)
    - Save Job - Related Jobs - Block Source
  • Mindlance (Austin, TX)
    Duration:0-8 month(s) Description/Comment:SRE_ Automation Engineer Job Description: We are looking for a highly skilled Automation Engineer with a strong systems ... work on automating infrastructure, integrating tools via APIs, improving observability , and implementing AIOps-driven solutions. If you?re passionate about… more
    Upward (06/28/25)
    - Save Job - Related Jobs - Block Source
  • Omni Inclusive (Columbus, OH)
    …Skill PCF (Pivotal Cloud Foundry) and Mongo DB Exposure to at least 1 Observability Tool such as AppDynamics, Splunk, Grafana Change Mgmt using CI/CD pipeline. ... Harness or equivalent tools Secondary Skill SSL Certificate management Vulnerability Management more
    Upward (07/01/25)
    - Save Job - Related Jobs - Block Source
  • Brown & Brown, Inc. (Daytona Beach, FL)
    …of Linux systems, networking, and performance tuning. Experience with monitoring and observability tools such as Azure Monitor, Zabbix, Grafana , Datadog, ... doing what is best for our customers. Brown & Brown is seeking a Site Reliability Engineer to join our growing team in Daytona Beach, FL and Dallas, TX! The Site… more
    Upward (07/14/25)
    - Save Job - Related Jobs - Block Source
  • Staff Observability Operations…

    CVS Health (Hartford, CT)
    …dashboards to deliver technical and business process insights leveraging standard observability /BI platforms (eg, AppDynamics, Grafana , Tableau, PowerBI). + ... to apply for CVS Health job opportunities Join CVS Health as a Staff Observability Operations Engineer and contribute to our mission of driving health care… more
    CVS Health (07/04/25)
    - Save Job - Related Jobs - Block Source