- IBM (San Jose, CA)
- …Responsibilities Location preference the Silicon Valley area We're looking for an experienced Site Reliability Engineer to join our team. At IBM, the Software ... colo, and AWS/multi-cloud) + Admin-level Linux skills Required Technical and Professional Expertise + 3+ years of hands-on experience creating SaaS applications… more
- General Motors (Palo Alto, CA)
- …on this exciting journey toward a better future **.** **Responsibilities:** + Lead Site Reliability engineering effort to improve anomaly detection, platform ... Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability… more
- Palo Alto Networks (Santa Clara, CA)
- …+ Strong Linux administration, internals, and network troubleshooting + Experience in DevOps, Site Reliability , or infrastructure engineering + Expertise in ... moving towards the future where cloud-based applications are increasingly common. As a Site Reliability Engineer, you will develop the frameworks and pathways to… more
- NVIDIA (Santa Clara, CA)
- …on the world. NVIDIA is looking to hire a deeply technical, creative, and Staff Site Reliability Engineer to build, support and maintain the next generation AI ... powered enterprise products that improve engineering efficiency, data security, and power our product development....architects, and business teams to ensure optimal operation and reliability of applications. + Define and lead technical roadmap… more
- GRAIL (Menlo Park, CA)
- …please visit www.grail.com . GRAIL is seeking a Staff Software Engineer in our Site Reliability Engineering (SRE) team to help us improve security ... development lifecycle. Participate in system design reviews and provide valuable Site Reliability Engineer (SRE) insights during launch reviews, influencing… more
- Netflix (Los Gatos, CA)
- …partners. + Preferred - BS in Computer Science, Electrical or Computer Engineering (or equivalent professional experience). Our compensation structure consists ... the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About… more
- LinkedIn (Mountain View, CA)
- …and support career growth. Join us to challenge yourself with work that matters. The Site Reliability TPM team at LinkedIn is seeking a Staff Technical Program ... Science or related technical field, or equivalent practical experience * 5+ years professional experience in an engineering or technical team, managing technical… more
- IBM (San Jose, CA)
- …want to grow their career. We are seeking a skilled SRE to join our Platform Engineering team for Data and AI organization within IBM Software. As part of our team, ... managers, and other stakeholders to understand requirements and ensure the reliability of the platform. Continuous Improvement: Participate in post-incident reviews,… more
- Lacework (Mountain View, CA)
- …are a part of every conversation. + Develop best practices alongside engineering /operations teams to improve the scalability and reliability of internal ... we build and support observability tooling and work with engineering to continually build more telemetry and observability into...processes. + Participate in an on-call rotation. Your Professional Profile: + 6+ years DevOps experience + Strong… more
- Amazon (Cupertino, CA)
- …tempo and quality. - 7+ years or more in software development, systems development, SRE ( Site Reliability Engineering ), or Resilience Engineering - 7+ ... join us - we are looking for builders like you. The AWS Hardware Engineering team creates server designs for Amazon's innovative web services. Our designs are… more
- Amazon (Cupertino, CA)
- …Preferred Qualifications - 7+ years or more in software development, systems development, SRE ( Site Reliability Engineering ), or Resilience Engineering - ... join us - we are looking for builders like you. The AWS Hardware Engineering team creates server designs for Amazon's innovative web services. Our designs are… more
- NVIDIA (Santa Clara, CA)
- As a Sr Manager in Site Reliability Engineering (SRE), you will lead a team dedicated to the design, construction, and maintenance of expansive production ... What We Need To See: + Extensive experience in a senior-level role within Site Reliability Engineering , particularly in managing storage infrastructure. +… more
- Amazon (Santa Clara, CA)
- …centers. The Engineering Technician will help ensure overall availability and reliability to meet or exceed defined service levels of data center operations. The ... must maintain better than 99.999% uptime. The Data Center Engineering Technician will continue to maintain high reliability...facility and rack level events - Ensure all personnel on- site follow safety protocols - Work on-call and a… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- …incidents in order to prevent recurrence. **Certifications and Licenses:** + Professional Engineering licensure highly preferred **SLAC Manager Competencies:** + ... fabrication, office and conference room spaces, on a 426-acre site in Menlo Park, CA. Under the general direction...Other duties may also be assigned. + Manage Systems Engineering & Reliability Assessments. + Lead and/or… more
- Amazon (San Jose, CA)
- …on concurrent projects, sometimes in multiple geographical regions. - Initiate and lead engineering site audits within Amazon's owned or colo data centers. - ... able to showcase your in-depth understanding of data center design, engineering , and operations of infrastructure common to data centers, telecommunications… more
- Amazon (Sunnyvale, CA)
- …business and customer problems. Systems Development Engineers perform traditional Systems Engineering and writes or develops scripts, applications, or mechanisms to ... GUIs to implement infrastructure as code and even server-less systems. Enterprise Engineering : Enterprise Engineering owns the key products, services, and tools… more
- Amazon (Cupertino, CA)
- …mentorship and other career-advancing resources here to help you develop into a better-rounded professional . We are open to hiring candidates to work out of one of ... Seattle, WA, USA Basic Qualifications - 4+ years of engineering team management experience - 4+ years of working...- 4+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience… more
- Amazon (Cupertino, CA)
- Description The Hardware Engineering - Security Monitoring Team, is part of AWS Engineering , which is one of the world's largest infrastructure as a service ... resources here to help you develop into a better-rounded professional . We are open to hiring candidates to work...2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience… more
- Amazon (Cupertino, CA)
- …locations: Cupertino, CA, USA Basic Qualifications - 5+ years of non-internship professional software development experience - 5+ years of programming with at least ... 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience...Experience as a mentor, tech lead or leading an engineering team - Experience developing embedded systems - Experience… more
- Amazon (East Palo Alto, CA)
- …such as Java, C++, or C# including object-oriented design. - Knowledge of professional software engineering & best practices for full software development life ... tools and technologies, setting up dashboards, and ensuring the scalability and reliability of the observability infrastructure. * Develop and integrate tools for… more