- NVIDIA (Santa Clara, CA)
- …Administrator/DevOps engineers to design, develop and implement a global, dynamic, innovative Service Reliability Operations Center, to provide extraordinary ... you will partner with other key members of our organization including Site Reliability Engineering, Security Operations Center, DevOps teams, and other partners… more
- ServiceNow, Inc. (San Diego, CA)
- …people at the problem-we **engineer it away** with software. You'll join Network Reliability & Resiliency (NR2): a diverse crew of network, software, hardware, and ... operations pros who reduce mean time to mitigate and...is expanding in scale and complexity. We need a senior leader / builder who can **own design through… more
- Coinbase (Sacramento, CA)
- …company wide system's reliability and less customer impact . As a * Senior Software Engineer* you will help to promote reliability culture across Coinbase. ... and alignment. Attendance is expected and fully supported. *Core Reliability team* is a vital part of Infrastructure (Platform)...to scale the system by 10-50x and help secure service configurations & secrets by building/enhancing world class … more
- NVIDIA (Santa Clara, CA)
- …reduce network disruptions and decrease Mean Time to Recovery (MTTR), improving overall service reliability and user satisfaction. + Work closely with Security ... GeForce Now is looking for a Manager, Network Site Reliability Engineer (SRE) to enhance our network infrastructure and...position focuses on managing Network SRE to streamline network operations , minimize manual tasks, and achieve service … more
- Coinbase (Sacramento, CA)
- …Whether the customer needs are trading, staking, governance, custody, web3 operations , or API integrations, Coinbase Institutional has them covered. The ... Q3 2023. *What you'll be doing (ie. job duties):* * Improve observability, reliability and availability by defining and measuring key metrics * Build automation and… more
- Palo Alto Networks (Santa Clara, CA)
- …DevOps/SRE engineer with a passion for technology and a strong motivation for high reliability at the service level + Observability Tools: High proficiency with ... our systems' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you...monitoring and alerting tasks by building tools for cloud operations , such as automated remediation of known issues and… more
- General Motors (Sunnyvale, CA)
- …exciting journey toward a better future. From engineering to product management and operations , GM is looking for people who can combine a passion for technology ... you will develop and maintain key elements of the infrastructure health and reliability monitoring for GM's commercial fleet. We are an innovation first team, and… more
- Rubrik (Sacramento, CA)
- …FedRAMP requirements * Develop and automate Security tasks that span from Security Operations to Infrastructure as Code in support of InfoSec initiatives * Manage ... and services with the objective of achieving and exceeding availability and reliability goals * Manage and streamline monitoring systems to enhance observability and… more
- NVIDIA (Santa Clara, CA)
- …accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role ... deploying distributed storage solutions, build automation tools, and ensuring the efficient operations of our growing IT ecosystem. You will collaborate closely with… more
- Tarana Wireless (Milpitas, CA)
- …speeds worldwide, bridging the digital divide in ways previously thought impossible. As a Senior Site Reliability Engineer, you will help us manage software that ... shore hours and responsible for all aspects of cloud operations , such as: + Infrastructure as Code + Manage...ngFWA technology has been embraced by more than 300 service providers in 24 countries. Tarana is headquartered in… more
- C&W Services (Hesperia, CA)
- …Corrective Action activities at the site + **Process Improvement:** Engage Amazon Senior Operations personnel to identify process challenges and coordinate ... **Job Title** Reliability Engineer **Job Description Summary** **Job Description** **Our...+ **Team Collaboration:** Collaborate with various teams, including Fulfillment Operations , Operations Engineering, and Safety, to optimize… more
- Amazon (Cupertino, CA)
- …cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna...with employees, supervisors, and staff to ensure exceptional customer service ; and follow all federal, state, and local laws… more
- Amazon (Cupertino, CA)
- …across cross-geographical ODMs and CMs. As part of the Manufacturing, Quality and Reliability Team in AWS Annapurna Labs focused on Machine Learning products that ... - from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to...performance at low cost. The Trainium Manufacturing, Quality and Reliability Team is part of AWS Annapurna Labs focused… more
- Amazon (Culver City, CA)
- …takes you! Basic Qualifications - Minimum of 10 years of hands-on systems reliability engineering and providing senior level technical direction on enterprise ... Video's Studios Technology Services team is searching for a Manager, Site Reliability Engineering. The Studios Technology Services team supports our Media Supply… more
- Palo Alto Networks (Santa Clara, CA)
- …+ Experience in navigating the complexities of business requirements and ensuring high service reliability in a dynamic environment. + US citizenship for FedRAMP ... Career** We are actively seeking a highly motivated DevOps/SRE Senior Manager to lead our Global InfoSec SRE team,...The InfoSec SRE group is fundamental to ensuring the reliability and availability of the production environment that hosts… more
- Red Hat (Sacramento, CA)
- **About the Job** The Red Hat ROSA OpenShift Site Reliability Engineering team is seeking a Site Reliability Engineering (SRE) Manager to join our team. ... is Enterprise Kubernetes and SRE-P delivers Red Hat OpenShift Service on AWS (ROSA), Azure Red Hat OpenShift (ARO),...a team of SREs in both the development and operations of our managed OpenShift services. You will interact… more
- Southern California Edison (Thousand Oaks, CA)
- Join the Clean Energy Revolution Become a Senior Supervisor, Meter Operations at Southern California Edison (SCE) and build a better tomorrow. In this job, ... of Field Service Representatives and Supervising Field Service Representatives within the Metering Operations (MO)...Metering Operations (MO) group of T&D's Distribution Operations . The Senior Supervisor will be held… more
- PagerDuty (San Francisco, CA)
- PagerDuty, Inc. (NYSE:PD) is a global leader in digital operations management. Trusted by nearly half of both the Fortune 500 and the Forbes AI 50, as well as ... is growing, and we are looking for an experienced Senior Director of Corporate IT (End User Services/AI, Systems...Systems Engineering, and Enterprise Security) to lead our IT Operations at scale. In this role, you will architect,… more
- The Boeing Company (Long Beach, CA)
- …airplane program meetings and project reviews. + Support customers (ETOPS) and in- service reliability tracking (ETOPS). + Familiar with FAA regulations for ... performing Extended Operations Performance Standards (ETOPS) Engineer (Mid-Level, Lead or Senior ) to join this talented and fast-paced team, reporting to the BGS… more
- Southern California Edison (Pomona, CA)
- Join the Clean Energy Revolution Become an Inspection Operations Sr. Advisor at Southern California Edison (SCE) and build a better tomorrow. In this job, you'll ... field inspections through cross-functional collaboration and industry benchmarking. As an Inspection Operations Sr. Advisor , your work will help power our planet,… more