- Ford Motor Company (Sacramento, CA)
- …lives, this is the opportunity for you. Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, ... ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE...+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for… more
- NVIDIA (CA)
- …NVIDIA infrastructure. Work with NVIDIA's DGX Cloud team as a Senior Site Reliability Engineer to maintain high-performance DGX Cloud ... most impactful fields of our generation: Cloud Engineering, Cloud Infrastructure, and Site Reliability Engineering. If you're a creative engineer who… more
- Walmart (Sunnyvale, CA)
- …through our high-performance checkout services running in Edge and Cloud . As a Site Reliability Engineer in the CPC Team, you will work with L2, Other ... and high impact problems. This role is part of Cloud Powered Checkout team and will build the next...example, probability of failure, frequency of failure) to measure site reliability . Monitors site … more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... SRE at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and at the same… more
- SpaceX (Hawthorne, CA)
- Site Reliability Engineer (Special Programs) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out exploring the stars ... the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (SPECIAL PROGRAMS)...data analysis team is building highly reliable on-premises and cloud compute clusters to host machine learning inference-heavy models.… more
- TP-Link North America, Inc. (Irvine, CA)
- …simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team and play a ... crucial role in ensuring our cloud platform's security, Reliability , scalability, and operational...related field. + 5+ years of experience as a Site Reliability Engineer . + Proficiency… more
- TP-Link North America, Inc. (Irvine, CA)
- …with simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Site Reliability Engineer to join our team and play a ... crucial role in ensuring our cloud platform's security, Reliability , scalability, and operational...related field. + 1-3 years of experience as a Site Reliability Engineer or in… more
- Zscaler (San Jose, CA)
- …speed and agility with a cloud -first strategy. We're looking for an experienced Staff Site Reliability Engineer to join our Government Cloud team, ... founded in 2007 with a mission to make the cloud a safe place to do business and a...(Minimum Qualifications)** + 5+ years of experience as a Site Reliability Engineer with expertise… more
- iCIMS (Sacramento, CA)
- **Job Overview** We are seeking a skilled Engineer , Site Reliability (SRE) to contribute to the reliability , scalability, and performance of our multi- ... work environment where everyone belongs. **Responsibilities** + **System Monitoring & Reliability :** + Monitor multi- cloud infrastructure (AWS, Azure, GCP) using… more
- NVIDIA (Santa Clara, CA)
- …automated, and secure production environments. We are seeking a deeply skilled Staff Site Reliability Engineer (SRE) to advance our enterprise security ... outcomes by implementing, integrating, and scaling innovative technologies across cloud -native and hybrid infrastructures. This position requires a strong software… more
- SpaceX (Hawthorne, CA)
- Site Reliability Engineer (Starshield)...and manage compute resources both on-premises and in the cloud + Deploy and manage core infrastructure such as ... possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (STARSHIELD) - TOP SECRET CLEARANCE Starshield leverages SpaceX's… more
- Amazon (Culver City, CA)
- Description As a Site Reliability Engineer , you'll have end-to-end ownership of the product, user experience, design, and technology required to deliver ... executives at all levels. Our Infrastructure Engineering team is looking for a Site Reliability Engineers to build, deploy, operate, and sustain our critical… more
- General Motors (Sunnyvale, CA)
- …+ Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability ... an OEM - comprehensive control over both in-vehicle and cloud software - to deliver seamless solutions to our...future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements… more
- Amazon (Culver City, CA)
- …executives at all levels. Our Infrastructure Engineering team is looking for Sr Site Reliability Engineers to build, deploy, operate, and sustain our critical ... within existing frameworks, tools and processes to continuously improve systems. Site Reliability Engineers focus on automating infrastructure at scale… more
- Leidos (Vista, CA)
- **Description** This position will require up to 75% travel Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for ... critical infrastructure on-prem and remote + Manage on-premises and private/public cloud environments via infrastructure-as-code (IaC) and hands-on/client site … more
- NVIDIA (Santa Clara, CA)
- …impact on the world. NVIDIA is looking to hire a deeply technical and creative Site Reliability Engineer to build, support and maintain the next generation ... role will give an opportunity to collaborate with the Cloud and AI/ML workforce in a dynamic and agile...automate processes, and iterate for efficiency + Tackle systemic reliability issues with multi-functional teams. + Monitor, optimize, and… more
- MongoDB (Los Angeles, CA)
- …or remotely in the United States region. **Role Overview** We are seeking a talented Site Reliability Engineer (SRE) with a strong networking background to ... data platform, MongoDB Atlas, is the only globally distributed, multi- cloud database and is available in more than 115...available in more than 115 regions across AWS, Google Cloud , and Microsoft Azure. Atlas allows customers to build… more
- PennyMac (Westlake Village, CA)
- …quickly and accurately, is critical to the success of anyone in this role. The Engineer III, Site Reliability Operations will: + Monitoring - Oversee 24/7 ... journey. A Typical Day As a member of the Site Reliability Operations (SRO) team, you will...Pennymac is now almost completely migrated into the AWS cloud . Individuals in this role should be comfortable working… more
- JPMorgan Chase (Palo Alto, CA)
- …You've discovered the perfect environment to have a major impact. As a **Principal Site Reliability Engineer ** at JPMorgan Chase within the **Enterprise ... qualifications, capabilities, and skills** + Formal training or certification on site reliability engineering concepts and 10+ years applied experience.… more
- Palo Alto Networks (Santa Clara, CA)
- …As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: Utilize your expertise in monitoring cloud platforms, particularly GCP, to ... optimize our infrastructure, leveraging cloud -native technologies + Monitoring Expertise: Improve monitoring processes, alerts, and metrics. Work with development… more