- MalaceHR (Thousand Oaks, CA)
- MalaceHR is seeking a Senior Reliability Engineer to support strategic asset management and reliability engineering initiatives across a diverse ... management, and maintenance effectiveness, while identifying opportunities to enhance system reliability and minimize operational risks. This position serves as a… more
- SpaceX (Hawthorne, CA)
- Sr . Site Reliability Engineer , Data (Application Software) Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is out ... the ultimate goal of enabling human life on Mars. SR . SITE RELIABILITY ENGINEER...weekends when needed COMPENSATION AND BENEFITS: Pay Range: Software Engineer / Senior : $160,000.00 - $220,000.00/per year Your actual… more
- SpaceX (Hawthorne, CA)
- Sr . Site Reliability Engineer (Starshield) - Top Secret Clearance Hawthorne, CA Apply SpaceX was founded under the belief that a future where humanity is ... the ultimate goal of enabling human life on Mars. SR . SITE RELIABILITY ENGINEER...and alcohol testing COMPENSATION AND BENEFITS: Pay Range: Software Engineer / Senior : $160,000.00 - $220,000.00/per year Your actual… more
- NVIDIA (Santa Clara, CA)
- …secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance our enterprise security ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
- TP-Link North America, Inc. (Irvine, CA)
- …with simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team ... and tools + Help to mentor and train less senior members of the team + Ability to be...related field. + 5+ years of experience as a Site Reliability Engineer . + Proficiency… more
- NVIDIA (CA)
- …using high-performance NVIDIA infrastructure. Work with NVIDIA's DGX Cloud team as a Senior Site Reliability Engineer to maintain high-performance ... fields of our generation: Cloud Engineering, Cloud Infrastructure, and Site Reliability Engineering. If you're a creative engineer who enjoys autonomy and… more
- TP-Link North America, Inc. (Irvine, CA)
- …with simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Site Reliability Engineer to join our team and play a ... and tools. + Participate in mentoring and training less senior members of the team. + Be part of...related field. + 1-3 years of experience as a Site Reliability Engineer or in… more
- The Walt Disney Company (Anaheim, CA)
- …closely with the Disneyland Resort, Disney Cruise Line and Walt Disney World partners. The Senior Site Reliability Engineer will report to the Manager, ... Technology. **About The Role & Team** This Engineer will be expected to play multiple critical roles...Engineer will work with business partners to perform site walks, work on installation recommendations, hardware layout, sustainment… more
- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Senior Site Reliability Engineer to work in IPP (Infrastructure, Planning and Process). IPP is a global organization within ... NVIDIA. This group works with various other groups within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure needs. These cloud services provide almost… more
- NVIDIA (Santa Clara, CA)
- …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play ... a crucial role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build… more
- Tarana Wireless (Milpitas, CA)
- …internet speeds worldwide, bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer , you will help us ... manage software that runs on the cloud and remotely manages millions of radio devices. You will work on a team and be a main point of contact during off shore hours and responsible for all aspects of cloud operations, such as: + Infrastructure as Code + Manage… more
- Google (Sunnyvale, CA)
- Senior Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Durham, NC, USA; Raleigh, NC, USA; +3 more; +2 more **Mid** ... meet some of our SREs. + Read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a software engineer … more
- Google (Sunnyvale, CA)
- Senior Staff Software Engineer , Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes ... Reliability Engineering (https://landing.google.com/sre/book.html) or read acareer profile (https://careers.google.com/stories/ site - reliability -engineering-profile-google/) about why a Software Engineer… more
- Amazon (Culver City, CA)
- …and studio executives at all levels. Our Infrastructure Engineering team is looking for Sr Site Reliability Engineers to build, deploy, operate, and sustain ... within existing frameworks, tools and processes to continuously improve systems. Site Reliability Engineers focus on automating infrastructure at scale… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Coinbase (Sacramento, CA)
- …Q3 2023. *What you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics * Build automation and ... service disruptions and automate incident response * Proactively find and analyze reliability problems across our business units and stack, then design and implement… more
- Palo Alto Networks (Santa Clara, CA)
- …actionable insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: ... influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +...DevOps/SRE Expertise: 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong… more
- Rubrik (Sacramento, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- NVIDIA (Santa Clara, CA)
- …NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity ... efficiency of services and drive efficiency with software and hardware optimizations ( SR -IOV/ DPU) + Experience with Technologies like eBPF and XDP for Observability… more