- Cleared Jobs (Arlington, VA)
- …and observability tools, such as ELK Stack, Prometheus, or Grafana , to ensure system reliability and performance.CompTIA Security+ or similar DoD 8570 ... future.Overview of Opportunity Two Six Technologies is seeking a Senior DevOps Engineer to join our team, focusing on developing and maintaining infrastructure that… more
- grabjobs (Seattle, WA)
- …and observability , with experience in tools such as Prometheus, Grafana , or Datadog.Strong problem-solving skills, with the ability to troubleshoot complex ... (IaC) solutions using Terraform to automate provisioning and management.Enhance observability and monitoring for infrastructure health and performance using tools… more
- grabjobs (Dearborn, MI)
- …to cloud infrastructure and distributed systems. Experience with monitoring and observability tools (eg, Prometheus, Grafana , Datadog). Excellent communication ... As a Software Engineer on the Cloud Engineering team, you will...for security, IAM, encryption, and compliance to safeguard cloud environments. Observability & Monitoring - Develop and maintain monitoring, logging,… more
- grabjobs (Santa Clara, CA)
- …large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of a team supporting the ... running on this infrastructure. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability.Our Infrastructure Platform stack includes… more
- grabjobs (Atlanta, GA)
- …driving improvements in infrastructure & system reliability, performance, high availability, observability , and overall stability of the platform by leveraging the ... and automate routine tasks to improve operational efficiency. * Incorporates Observability as part of day-to-day operations. * Guides reliability practices through… more
- grabjobs (San Diego, CA)
- …best practices (EKS)2+ years of experience with monitoring and observability of applications (Prometheus, Grafana , Alertmanager, Datadog)Proficiency in ... our backgrounds or responsibilities, we are one team.About the RoleThe Sr. Infrastructure Engineer helps design and maintain reliable systems at scale in the cloud.… more
- grabjobs (San Francisco, CA)
- …pipelines, automated testing, and rollout strategies for infra changes.Develop an observability stack (Prometheus, Grafana , OpenTelemetry, eBPF) plus GPU ... telemetry with NVIDIA DCGM.Optimize high‑performance networking (InfiniBand/RDMA) and debug perf bottlenecks.Run and continuously improve the 24x7 on‑call rotation; lead post‑incident reviews.Partner with researchers and engineers, communicate crisply, and… more
- grabjobs (New York, NY)
- …and other container orchestration tools.- RDBMS knowledge (preferably MSSQL/PostgreSQL)- Observability stack (Prometheus, Loki, Jaeger, Grafana )- Kafka- A ... very strong communicator with the ability to interface directly with clients and analysts to ensure technical requirements and delivery align with expectations- A strong understanding of Agile/Scrum and ability to deliver solutions under this methodology-… more
- Experis (Plano, TX)
- Position Title: Monitoring Automation Engineer Contract duration: 12 months with a possibility of extension Targeted Start date: May/June Desired Core Location(s): ... Management: Knowledge of Artifactory for package management. Monitoring & Observability : Experience configuring and managing Splunk, Dynatrace, and OpenTelemetry… more
- Xperi (San Jose, CA)
- …experiences. We can't wait to show you what's next. Staff Software Engineer Key Qualifications: 3-5 years of experience in object-oriented programming with Java ... strong debugging skills, and crash log analysis. Experience with Splunk, Grafana , or similar technologies for data search, analysis, and visualization. Skills… more
- Acubed (Sunnyvale, CA)
- …TensorFlow, ONNX, etc.) and distributed data loading. Experience with monitoring and observability tools (eg, Grafana ). Passion for aviation and autonomous ... The Opportunity/Role Description As a Senior Machine Learning Operations and Infrastructure Engineer , you will develop, operate and maintain our AI software and… more
- Alchemy (New York, NY)
- …conducting root cause analyses and driving continuous reliability enhancements. Develop observability frameworks using Prometheus, Grafana , ELK, and tracing ... driving company-wide reliability efforts and engineering initiatives. Strong familiarity with observability best practices and tooling like Prometheus, Grafana ,… more
- PCR Staffing (Concord, NC)
- …, guiding teams toward higher automation and reliability. Build and manage observability dashboards using Grafana , CloudWatch and Datadog to track application ... Site Reliability Engineer We are seeking a Site Reliability ...firewall management and troubleshooting . Expertise in monitoring and observability tools , including Grafana and Datadog… more
- Palo Alto Networks (Santa Clara, CA)
- …large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of a team supporting the ... running on this infrastructure. This includes automation, architecture, performance, observability , troubleshooting, security, and reliability. Our Infrastructure Platform stack… more
- Booz Allen Hamilton, Inc. (Melbourne, FL)
- …reliability engineering or DevOps roles Experience with hands-on monitoring or observability tools such as Dynatrace, Grafana , Kibana, Elasticsearch, CloudWatch, ... Site Reliability Engineer The Opportunity: As a Site Reliability Engineer (SRE) on our team, you will play a critical part in ensuring our systems and services'… more
- HCL Global Systems, Inc. (Roanoke, TX)
- …including CI/CD pipelines Hands on experience with one or more observability tools (Prometheus, Grafana , ELK/OpenSearch, OpenTelemetry, Datadog, etc.) Use ... Datadog, Catchpoint, Splunk & Grafana for Application Observability and monitoring of app & infrastructure Experienced in Instrumentation with systems skills on… more
- Mindlance (Austin, TX)
- Duration:0-8 month(s) Description/Comment:SRE_ Automation Engineer Job Description: We are looking for a highly skilled Automation Engineer with a strong systems ... work on automating infrastructure, integrating tools via APIs, improving observability , and implementing AIOps-driven solutions. If you?re passionate about… more
- Omni Inclusive (Columbus, OH)
- …Skill PCF (Pivotal Cloud Foundry) and Mongo DB Exposure to at least 1 Observability Tool such as AppDynamics, Splunk, Grafana Change Mgmt using CI/CD pipeline. ... Harness or equivalent tools Secondary Skill SSL Certificate management Vulnerability Management more
- Brown & Brown, Inc. (Daytona Beach, FL)
- …of Linux systems, networking, and performance tuning. Experience with monitoring and observability tools such as Azure Monitor, Zabbix, Grafana , Datadog, ... doing what is best for our customers. Brown & Brown is seeking a Site Reliability Engineer to join our growing team in Daytona Beach, FL and Dallas, TX! The Site… more
- CVS Health (Hartford, CT)
- …dashboards to deliver technical and business process insights leveraging standard observability /BI platforms (eg, AppDynamics, Grafana , Tableau, PowerBI). + ... to apply for CVS Health job opportunities Join CVS Health as a Staff Observability Operations Engineer and contribute to our mission of driving health care… more