- NVIDIA (Santa Clara, CA)
- Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer . At NVIDIA, you'll be part of the team shaping the future of computing and ... guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: + Own the solutions you build, collaborating… more
- Lockheed Martin (Sunnyvale, CA)
- **Description:** As a Site Reliability Engineer , you will: * Design, implement, and maintain highly available and scalable systems and infrastructure to ... and integrity of the classified system **Basic Qualifications:** * Experience in site reliability engineering, DevOps, or a related field, with a focus on… more
- NVIDIA (Santa Clara, CA)
- …and drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big ... and operating large scale compute infrastructure + Proven experience in site reliability engineering for high-performance computing environments with operational… more
- Palo Alto Networks (Santa Clara, CA)
- …and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering + DevOps/SRE Expertise - 4+ years of experience ... our systems' performance and health. **Your Impact** As a Senior SRE with the Cortex Cloud Security Posture Management...influence the operability of the product and ensure the reliability and availability of our services **Your Experience** +… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big ... comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Develop, define and document standard methodologies… more
- NVIDIA (Santa Clara, CA)
- …large-scale systems supporting critical use cases for AI Infrastructure, driving reliability , operability, and scalability across global public and private clouds. + ... + Build tools and frameworks to improve observability, define actionable reliability metrics, and enable fast issue resolution, driving continuous improvement in… more
- Rubrik (Palo Alto, CA)
- …and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and ... visibility * Perform Production Readiness Assessments of new services to identify reliability needs and surface potential gaps * Develop and maintain documentation… more
- Palo Alto Networks (Santa Clara, CA)
- …actionable insights into our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise: ... influence the operability of the product and ensure the reliability and availability of our services. **Your Experience** +...DevOps/SRE Expertise: 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong… more
- ServiceNow, Inc. (Santa Clara, CA)
- …in the future. **As a Senior Staff Machine Learning Engineer - Site Reliability Engineer you will:** + Contribute to the design, development and ... It all started in sunny San Diego, California in 2004 when a visionary engineer , Fred Luddy, saw the potential to transform how we work. Fast forward to today -… more
- General Motors (Mountain View, CA)
- …+ Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability ... future for generations to come. In this SRE SW Engineer role, you will develop and maintain key elements...and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an… more
- Palo Alto Networks (Santa Clara, CA)
- …configuration management with a framework such as Terraform, Helm + Experience in Site Reliability Engineering, Production Engineering, or DevOps + Passion for ... Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the App Services team, you will be part of… more
- Rubrik (Palo Alto, CA)
- …public cloud technologies + Minimum 1-3 years of experience as a Development, DevOps or Site Reliability Engineer Willing to provide 24/7 coverage + Strong ... win, we want to talk to you! **About The Role:** Sr. Site Reliability Engineers at Rubrik are systems/software engineers who ensure that Rubrik's infrastructure… more
- SanDisk (Milpitas, CA)
- …to keep our world moving forward. **Job Description** We are seeking a Principal Engineer , Reliability Engineering to join our team in Milpitas, United States. ... deploy and maintain test infrastructure such as thermal chambers + Present reliability findings and recommendations to senior management and stakeholders +… more
- LinkedIn (Mountain View, CA)
- …role at a high-growth or web-scale technology companySuggested Skills:- Site Reliability Engineering (SRE)-Leadership-Large scale infrastructureLinkedIn is ... based in Sunnyvale, CA or San Francisco, CA.Key ResponsibilitiesServe as a senior technical leader driving the long-term reliability and observability strategy… more
- Silicon Valley Power (Santa Clara, CA)
- ** Senior Electric Utility Engineer ** Print (https://www.governmentjobs.com/careers/cityofsantaclaraca/jobs/newprint/4679157) ** Senior Electric Utility ... Services (AWS), and NVIDIA. **The Positions** SVP is seeking dynamic and innovative Senior Electric Utility Engineer candidates to fill three (3) vacancies in… more
- Walmart (Sunnyvale, CA)
- …of orders daily through our high-performance checkout services running in Edge and Cloud. As a Site Reliability Engineer in the CPC Team, you will work with ... criteria (for example, probability of failure, frequency of failure) to measure site reliability . Monitors site reliability conditions and new … more
- BD (Becton, Dickinson and Company) (Milpitas, CA)
- **Job Description Summary** Seeking an experienced Senior Staff Firmware Engineer to lead the development and evolution of high-quality instrument control ... and document firmware that meets product requirements with high reliability and robustness. You will provide significant technical leadership...to join our Milpitas, CA R&D team. As a Senior Staff Firmware Engineer , you will be… more
- Amazon (Palo Alto, CA)
- Description Come build the future as a Senior Software Development Engineer at Amazon, where you will be inspired working along best-in-class inventors and ... millions of people around the world. As an Amazon Senior Software Development Engineer , you will solve...Seller experience for billions around the globe. Whether building site wide features such as reviews and recommendations, category… more
- Microsoft Corporation (San Jose, CA)
- …on DPU based nodes to provide unmatched performance at the lowest cost. We are hiring Senior Software Engineer - Azure Storage to join us in the mission of ... developing and deploying DPU based storage. As a Senior Software Engineer in Azure Storage, you will drive and lead the design, implementation, and optimizations… more
- Amazon (Tracy, CA)
- …with the Amazon smile. Come join us on our journey! About the Role: As a Senior Automation Engineer , you will play a crucial role in maximizing equipment ... of thousands of products to hundreds of countries worldwide, every day. The Reliability & Maintenance Engineering (RME) team are the business partners that work… more