- Headway Technologies, Inc. (Milpitas, CA)
- …corrective action, and proposes long-term improvement plan if neededTracks installation, site acceptance testing, and mass production readiness of new equipment. ... to identify and resolve repetitive equipment issues, improve equipment reliability , review procedures, preventative maintenance (PM) frequency, and minimize… more
- TEKsystems (Palo Alto, CA)
- Description: Role: Site Reliability Engineer (SRE for Cloud) Location: Remote Project - MUST live in Pacific coast time zone Duration: 1 year with possible ... extension Number of positions: 1 We urgently looking for 1 Site Reliability Engineer (SRE for Cloud), mid level, who are available asap with the following… more
- Rubrik (Palo Alto, CA)
- …infrastructure services run smoothly and have the capacity for future growth. As a Senior Site Reliability Engineer , you will be responsible for: + Ensure we ... technologies + Minimum 3-5 years of experience as a Development, DevOps or Site Reliability Engineer Willing to provide 24/7 coverage + Strong Documentation… more
- Microsoft Corporation (Mountain View, CA)
- Microsoft is looking for a Senior Site Reliability Engineer to support and expand Viva Engage. Viva Engage (formerly Yammer) is the industry-defining social ... as we scale and modernize our tech stack. We are seeking a Senior Site Reliability Engineer who knows how to manage the conflicting priorities of keeping… more
- Rubrik (Palo Alto, CA)
- …to make an impact on product stability and success. **What you'll do:** As a Senior Site Reliability Engineer , you will be responsible for: + Manage and run ... technologies + Minimum 5 years of experience as a Development, DevOps or Site Reliability Engineer + Willing to provide 24/7 coverage + Strong Documentation… more
- GRAIL (Menlo Park, CA)
- …software development lifecycle. Participate in system design reviews and provide valuable Site Reliability Engineer (SRE) insights during launch reviews, ... information, please visit www.grail.com . GRAIL is seeking a Staff Software Engineer in our Site Reliability Engineering (SRE) team to help us improve… more
- NVIDIA (Santa Clara, CA)
- …on the world. NVIDIA is looking to hire a deeply technical, creative, and Staff Site Reliability Engineer to build, support and maintain the next generation ... architects, and business teams to ensure optimal operation and reliability of applications. + Define and lead technical roadmap...are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want… more
- TEKsystems (Santa Clara, CA)
- Description: As a Senior Site Reliability Engineer , you will have the responsibility for provisioning and operating our high-availability systems that ... data center environments. * Proven track record of technical leadership in site reliability . * Excellent problem-solving skills and an ability to thrive under… more
- EPAM Systems (San Jose, CA)
- EPAM is hiring a **Remote Lead Site Reliability Engineer ** . If you are looking for a high-impact, exciting role with a company that leads the globe in the ... the value of SRE, mentor and train other engineers around proactive reliability decision making and planning + Review code instrumentation with development teams… more
- Insight Global (San Jose, CA)
- Job Description A large networking and software company is looking for a Remote Site Reliability Engineer /Infrastructure Engineer focused in AWS and ... Terraform who has design and implementation experience. This person will create and manage AWS Infrastructure based on Infrastructure as a code principles leveraging Terraform and TerraGrunt. This person will execute stories in the kanban model, analyze and… more
- Walmart (Sunnyvale, CA)
- Position Summary What you'll do **Principal Site Reliability Engineer :** This position is responsible for the operation of a department. An individual in ... align with site environment changes. Integrates the business goals of site reliability engineering and site safety engineering. Trains team members on… more
- General Motors (Palo Alto, CA)
- …this exciting journey toward a better future **.** **Responsibilities:** + Lead Site Reliability engineering effort to improve anomaly detection, platform ... + Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability… more
- Splunk (San Jose, CA)
- …journey! **Role:** _Splunk_ 's _Cloud_ Services group is looking for aS _ite Reliability_ Engineer to help lead, design and build the next generation of our _large ... scale cloud_ offering. You will be working on core services and applications that form the primitives for our current and future cloud service offerings. _Site Reliability_ Engineers in this role will be engaging with multiple service owners across the… more
- Netflix (Los Gatos, CA)
- …the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About ... a Live Streaming Pipeline SRE, you will be responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting… more
- Zoom (San Jose, CA)
- …clusters within different infrastructures. You will also design and implement reliability best practices to accomplish a highly available service (99.99%). ... Additionally, you will identify and fix problems in Kubernetes operators, submitting code fixes to OSS if needed. Contributing to capacity planning, and anticipating performance bottlenecks are critical to success. You will also troubleshoot production issues… more
- Palo Alto Networks (Santa Clara, CA)
- …with the DevOps and the RND team to develop new features and maintain high reliability for our SAAS Products (XDR, XSIAM, XSOAR and XSPANSE) + Work with the US ... and Israeli DevOps teams to provide follow-the-sun operational coverage in the production of our SaaS product + Build automated tools for cloud operations such as automated remediation of known issues, auto-scaling, etc. + Collaborate with the US SRE team to… more
- Zoom (San Jose, CA)
- …& experience. We also have a location based compensation structure; there may be a different range for candidates in this and other locations. Ways of WorkingOur ... structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting. BenefitsAs part of our award-winning workplace culture and commitment… more
- Lacework (Mountain View, CA)
- …best practices alongside engineering/operations teams to improve the scalability and reliability of internal processes. + Participate in an on-call rotation. Your ... Professional Profile: + 6+ years DevOps experience + Strong development and automation skills. + Extensive experience with CI/CD pipelines and Infrastructure as Code (Terraform, CloudFormation, etc). + Extensive experience with a variety of AWS services (eg… more
- Meta (Sunnyvale, CA)
- …backgrounds, from new grads to industry experts. Relevant industry experience is important ( Site Reliability Engineer (SRE), Systems Engineer , Software ... Engineer , DevOps Engineer , Network Engineer , Systems Administrator, Linux Administrator, Database Administrator or similar role), but ultimately less so than… more
- Google (Sunnyvale, CA)
- …years of experience designing, analyzing, and troubleshooting large-scale distributed systems. Site Reliability Engineering (SRE) combines software and systems ... Google Cloud's services-both our internally critical and our externally-visible systems-have reliability , uptime appropriate to customer's needs and a fast rate of… more