- Apple Inc. (Seattle, WA)
- …The ideal candidate will have a strong background in software development and experience with distributed systems. Join a team dedicated to delivering high-quality ... A leading technology company in Seattle is seeking a passionate Site Reliability Engineer to enhance their services like iCloud and Siri.… more
- Apple Inc. (Seattle, WA)
- …that will help millions of customers, then this is the place for you! The Cloud Monitoring SRE organization is specifically tasked with enabling other teams ... highest quality Apple Services experience. Our services have to scale globally, stay highly available, and "just work." If...more challenging. As a Site Reliability Engineer on the Cloud Monitoring Team at Apple you will… more
- Apple Inc. (Seattle, WA)
- …States Software and Services Apple Cloud infrastructure is vast, and the storage SRE teams of Apple Cloud are building and running the next generation ... new and existing services, platforms, and application stacks. Experienced in SRE principles, such as monitoring , alerting, error budgets, fault analysis,… more
- Apple Inc. (Seattle, WA)
- …Services Engineering team as a site reliability engineer to help support and scale cloud services for thousands of development and operations engineers. This ... hands‑on role is to maintain and enhance SRE practices for a private cloud service...be responsible for providing the platform for mission critical cloud systems to maintain constant uptime, scale … more
- Rubrik, Inc. (Seattle, WA)
- …solutions that make it possible for organizations to operate production‑grade AI agents at scale . As the Engineering Manager for the Cloud Platform team , you'll ... and operating the multi‑ cloud infrastructure foundation that powers Rubrik Agent Cloud . You'll shape how we securely deploy, scale , and manage services… more
- Coupang (Seattle, WA)
- … web‑based Java architectures, and JVM configuration. Professional certifications in cloud platforms, monitoring tools, or related technologies. Previous ... production incidents, maintain SLI/SLA bars, and influence design with SRE principles and best practices. If you take pride...experience working on a large‑ scale GPU/ Cloud Infrastructure platform. SLO/SLA management and… more
- Apple Inc. (Seattle, WA)
- … Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud services for millions of Apple users.We are building and supporting ... Redis, etc, alongside internally developed services. Description The Apple Services Engineering Cloud Services SRE organization is looking for a strong,… more
- Docker, Inc. (Seattle, WA)
- …create the platform that enables teams across Docker to rapidly prototype, deploy, and scale their own AI developer tools. Your team will own two critical mandates ... scratch, and create an environment where innovation thrives. Responsibilities Build and Scale the AI Developer Tools Team: Hire, onboard, and develop a… more
- Engineering (Seattle, WA)
- …applications (we manage clusters in our on-prem locations as well as in the " cloud ") Experience with monitoring tools and technologies (we use a combination of ... is the unstructured data platform to store and manage exabyte- scale data anywhere - at the edge, in the...edge, in the core data center and in the cloud . With unstructured data growing in more locations faster… more
- Apple Inc. (Seattle, WA)
- …team. Our team's mission is simple: help people work together-and with Apple devices-at scale , whether that means a school district, a small business, or a Fortune ... work will help redefine how Apple engineers deliver and scale software-combining robust infrastructure with the creativity and capability...Collaborate with service owners, Dev & QA engineers, and SRE teams to ensure software is built and deployed… more
- Rokt (Seattle, WA)
- …accountable not only for the health, stability, and reliability of Rokt's critical cloud infrastructure, but also for the growth and performance of the engineers who ... while balancing cost and speed. Champion operational excellence by refining monitoring , alerting, and CI/CD pipelines, and by running high‑quality post‑incident… more
- MongoDB (Seattle, WA)
- …with ensuring our uptime guarantees to our Atlas customer base + Help scale the worldwide Cloud Operations Engineering team with the strategic implementation ... be responsible for day-to-day duties such as creating and monitoring system's alert dashboards, reviewing critical events and system...**Requirements** + Experience with being an on call DevOps, SRE , or Cloud Operations engineer (at least… more
- Google (Kirkland, WA)
- …in production operations for isolated systems and expertise in distributed cloud /on-premise. + Experience building or operating large- scale infrastructure ... to the availability and operability of GDC's products. + Lead, develop, and scale the mission-critical SRE and Production Operations team, including managing… more
- Microsoft Corporation (Redmond, WA)
- …+ 2-4+ Yrs of experience in roles cloud operations, incident response, SRE or large- scale system engineering preferably in platforms like Azure, AWS, or ... certifications (eg, AWS Certified DevOps Engineer, Azure Solutions Architect, GCP Professional Cloud Architect). + Certifications in ITIL, SRE , or other relevant… more
- Microsoft Corporation (Redmond, WA)
- …controls + OR equivalent experience. + 1+ year(s) technical experience working with large- scale cloud or distributed systems. + 3+ Years of demonstrated ... industrial controls; + OR equivalent hands-on experience. + Proven experience in cloud operations, incident & crisis management, or large- scale systems… more
- Oracle (Seattle, WA)
- …databases on Exadata platform + Proven experience in designing and managing large- scale cloud infrastructure operations in environments like OCI, AWS, Azure, ... and process improvements. **Education and Experience** + 10+ years of experience in cloud infrastructure operations, SRE , or similar roles. + Bachelor's degree… more
- Amazon (Seattle, WA)
- …consulting partners. Together they provide our customers with the expertise and scale needed to build innovative solutions for their most complex challenges. Today, ... AWS's observability services are critical for customers running modern applications at scale . The insights provided by AWS' full stack observability solutions help… more
- Microsoft Corporation (Redmond, WA)
- …and operability of one or more platforms, systems, or products operating at scale . + Leverages technical expertise in cloud technologies and specific products, ... **Overview** Leverages end-to-end technical expertise in large scale distributed systems' infrastructure, code, inter- and intra-service dependencies, and operations… more
- Amazon (Seattle, WA)
- …reliability, and efficiency at the intersection of hardware, software, networking, and cloud services. This role involves creating tools, monitoring systems, ... that enhance quality while reducing execution time. You will decompose large- scale system challenges in testability, reliability, and diagnostics into actionable… more
- Oracle (Seattle, WA)
- **Job Description** At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We ... act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world. Values are OCI's foundation… more