- Truist (Atlanta, GA)
- …the following job description:** We are seeking a highly skilled and forward-thinking lead observability engineer to architect, implement, and evolve ... you'll champion a shift from reactive monitoring to proactive, intelligence-driven observability . You'll lead efforts to standardize telemetry pipelines, embed… more
- ServiceNow, Inc. (Atlanta, GA)
- …strategy for observability and data systems across the organization. You'll lead the architecture and design of telemetry, monitoring, and data platforms that ... and telemetry frameworks. + Establish SLAs, SLOs, and data contracts that connect observability to system and business outcomes. + Lead architectural design… more
- Truist (Atlanta, GA)
- …1st shift (United States of America) **Please review the following job description:** This Lead Infrastructure Engineer partners with a wide cross-section of ... systems, cloud-native architectures, and K8s, with the ability to identify observability gaps across service meshes, APIs, and event-driven platforms. **OTHER JOB… more
- Oracle (Atlanta, GA)
- …Description** Oracle Cloud Infrastructure (OCI) is looking for a Principal Software Engineer to lead the development of scalable, resilient, and secure ... within the Host Provisioning Services (HoPS) team, which owns the critical infrastructure responsible for automating the full server lifecycle from rack integration… more
- Insight Global (Alpharetta, GA)
- Job Description Job Description: As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in multiple public cloud service provider platforms, ... you will be responsible for operating infrastructure solutions, following the principles and practices pioneered by Google's SRE model. Your work will ensure our… more
- Teradata (Atlanta, GA)
- …you will be part of Teradata's Product Engineering leadership team and will lead the strategy and execution for infrastructure , test automation, and release ... Trino, Spark), and event streaming platforms (eg, Kafka, Pulsar). + Champion observability , monitoring, and resilience in infrastructure design, leveraging tools… more
- Oracle (Atlanta, GA)
- **Job Description** OCI (Oracle Cloud) AI Infrastructure Innovation team is inventing the next generation of storage technologies. You will lead architecture and ... the opportunity to advance the state of the art. Responsibilities Lead end-to-end architecture, system design, and implementation for distributed storage platforms.… more
- Confluent (Atlanta, GA)
- …a strong focus on scalability, security, and developer experience. + Lead **operational design** for reliability: build comprehensive observability , monitoring, ... **About the Role:** We are seeking a Senior Software Engineer II to architect, build, and operate services that...operate services that are core to the company's cloud infrastructure and product platforms. This is a hybrid engineering… more
- Oracle (Atlanta, GA)
- **Job Description** OCI (Oracle Cloud) AI Infrastructure Innovation team is pioneering the creation of next-generation AI/HPC networking for GPU superclusters at ... high performance for AI training and inference. You will define architecture, lead complex system design, and implement innovative networking software that advances… more
- Chick-fil-A (Atlanta, GA)
- …hone SRE principles, establish reliability goals, and develop tooling for operational observability . We are a small team working through many different patterns to ... bring observability to everyone. SREs at Chick-fil-A collaborate across teams...Own solution architecture decisions for the team's product + Lead delivery and operations of the team's product, including… more
- EPAM Systems (Atlanta, GA)
- …not just building software - we're engineering excellence. We're looking for a ** Lead Site Reliability Engineer (SRE)** with a passion for performance, ... Troubleshoot mission-critical systems and implement preventative problem management solutions + Lead on promoting observability , scalability, and resiliency best… more
- JPMorgan Chase (Atlanta, GA)
- …provide an adventure where you can push the limits of what's possible. As a Lead Software Engineer at JPMorganChase within the Consumer & Community Banking, you ... Groovy to streamline build and release processes + Designs, provisions, and operates cloud infrastructure in AWS (EC2, EKS, VPC, IAM, S3, RDS, etc.) + Author and… more
- CoStar Realty Information, Inc. (Atlanta, GA)
- …. CoStar Real Estate Manager are looking for a **Site Reliability Engineer (SRE)** to join our high-impact SaaS infrastructure team. In ... CoStar Real Estate Manager - Site Reliability Engineer Job Description CoStar Group (NASDAQ: CSGP) is...cloud platform. You will be instrumental in driving automation, observability , and operational excellence across our hybrid infrastructure… more
- Waystar (Atlanta, GA)
- …architectures that can automatically recover from failures. + Optimize infrastructure monitoring and observability using Prometheus, Grafana, Loki, ... **ABOUT THIS POSITION** SRE Principal Engineer We are seeking a highly skilled SRE...design, build, scale and optimize our cloud platform and infrastructure . This role demands deep hands-on experience with AWS… more
- Waystar (Atlanta, GA)
- …up. + Collaborate with software engineers to optimize deployment pipelines and infrastructure . + **Monitoring & Tooling** + Enhance observability through ... platforms (AWS, GCP, or Azure), container orchestration (Kubernetes), and infrastructure -as-code (Terraform, CloudFormation). + Strong proficiency in observability… more
- General Motors (Atlanta, GA)
- …of the business, our users, and the growth of our engineers. This engineer will start delivering impact through observability frameworks and will evolve ... times a week, at minimum._ **About Us** The AI Cloud and Developer Infrastructure organization is responsible for delivering and maintaining the tools and services… more
- Oracle (Atlanta, GA)
- …new analytics applications using OCI technology. As a Senior Member of Technical Staff engineer , you will be responsible and lead efforts in designing and ... Oracle has formed a new organization - Oracle Health Applications & Infrastructure (OHAI). This team focuses on product development and product strategy for… more
- General Motors (Atlanta, GA)
- …operational processes, improve system reliability, and reduce manual intervention. + ** Observability and Monitoring** : Lead , Implement and improve monitoring ... specialists in reliability and production engineering, with a focus on automation, observability , and shared responsibility. We are looking for individuals who are… more
- CVS Health (Atlanta, GA)
- …a good understand with working within Cloud providers with a focus on efficient infrastructure . The engineer will have laser focus on customer experience and ... Qualifications** + Experience with full stack development. + Experience with observability , Telemetry for infrastructure services. + Experience training and… more
- Oracle (Atlanta, GA)
- …WA * Optional: Redwood City, CA **Job Description** As a Senior Principal Engineer in Oracle Cloud Infrastructure , you will provide key technical guidance ... and function as a lead developer in the development, delivery and operation of...experience and deep domain knowledge in media production pipeline infrastructure services used in film, animation and game development… more