- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** **Role summary:** The (USA) **Staff, Site Reliability Engineer ** will design and implement scalable, secure, and ... messaging systems including MySQL, Redis, and Kafka, and deploy AI /ML models to address complex challenges. Collaborating across functions,...or related area., SRE certification (for example, IBM Cloud Site Reliability Engineer )., We value… more
- Palo Alto Networks (Santa Clara, CA)
- …we all win with precision. **Your Career** We are looking for a proactive and innovative Site Reliability Engineer (SRE) to join our growing team. In this ... You will have the unique opportunity to leverage cutting-edge AI tools to redefine our operational practices and build...secure infrastructure. **Your Experience** + Proven experience as a Site Reliability Engineer , DevOps … more
- LinkedIn (Mountain View, CA)
- …on", every engineer to benefit from a more insightful and proactive site -wide reliability ecosystem, and every business and product owner to be well-informed ... Join us to transform the way the world works. Site Health Platform sits at the core of LinkedIn's... Health Platform sits at the core of LinkedIn's Reliability Infrastructure organization, with a primary focus on the… more
- McAfee, Inc. (San Jose, CA)
- **_Job Title:_** Site Reliability Engineer **_Role Overview:_** We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team, ... This role is ideal for someone passionate about toolchain reliability , automation, and the integration of AI -driven...and incidents. **About You:** + 3+ years' experience in site reliability engineering, DevOps, or a related… more
- Palo Alto Networks (Santa Clara, CA)
- …a direct and fulfilling impact on the future of AI Security. A Principal Site Reliability Engineer in Prisma AIRS embodies integrity, creativity, and a ... Alto Networks Prisma AIRS leads the industry in advanced AI Security Capabilities, including Runtime Security, Model Scanning and...Runtime Security, Model Scanning and Red Teaming. As a Site Reliability Engineer , you will… more
- Palo Alto Networks (Santa Clara, CA)
- …runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer , you will be part of a team supporting the ... This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability . Central Infrastructure & Platform Engineering Team | Santa… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... engineering. What we need to see: + 10+ years of experience in Site Reliability Engineering, Platform Engineering, or Cloud Architect roles. + BS degree… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...troubleshooting from bare metal to application level, ensuring system reliability and efficiency. + Collaborate with specialist teams to… more
- NVIDIA (Santa Clara, CA)
- …We take great pride in providing excellent, comprehensive support to our customers! Sr Site Reliability Engineer in this role will significantly impact and ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...or related field. + 8+ years of experience in site reliability engineering and/or software development roles.… more
- IBM (San Jose, CA)
- …of IBM, where growth and innovation thrive. . **Your role and responsibilities** As a Site Reliability Engineer , you will work in an agile, collaborative ... curious, we are a team dedicated to creating the world's leading AI -powered, cloud-native software solutions for our customers. Our renowned legacy creates endless… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed… more
- CBRE (San Jose, CA)
- Reliability Engineer Job ID 255063 Posted 09-Jan-2026 Service line GWS Segment Role type Full-time Areas of Interest Engineering/Maintenance Location(s) Fremont ... States of America **ABOUT THE ROLE** As a CBRE Reliability Engineer , you will monitor, analyze, and...asset's maintenance plan. + Provide technical support to the site for mechanical equipment and fixed assets. + Work… more
- NVIDIA (Santa Clara, CA)
- …by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts ... as the brains of computers, generative AI , robots, and self-driving cars that can understand the...DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability,… more
- Cisco (San Jose, CA)
- …takes these cutting-edge ASICs to production and we are looking for an experienced Reliability Engineer to join Quality and Reliability organization. Join an ... Cisco's Silicon Components business. **Your Impact** As a Product Reliability Engineer , you will play a critical...data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly… more
- Amazon (Cupertino, CA)
- …AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting … more
- Zscaler (San Jose, CA)
- …a cloud-first strategy. We're seeking a highly skilled and experienced SRE Platform Engineer to join our SRE Cloud Platform Engineering Team. Reporting to the ... databases (eg, Clickhouse, Redis) + Knowledge of MLOps and Generative AI applications within SRE environments \#LI-Hybrid \#LI-CM3 Zscaler's salary ranges are… more
- Cisco (San Jose, CA)
- …spear in interacting with our customers. Our CRE team adapts the best practices of Site Reliability Engineering (SRE) and applies them to our customers. As part ... a proactive approach vs a reactive approach to customer reliability and you will use existing data to help...data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …an impact on the world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual ... solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. +...Proven track record as a Principal or Senior Staff Engineer . + Expert-level knowledge of NVIDIA GPU architecture and… more
- Amazon (Cupertino, CA)
- …and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a ... particular focus on large-scale generative AI applications. Key job responsibilities * Architect and lead...* Drive technical excellence in performance optimization and system reliability across the Neuron ecosystem * Design and implement… more
- Amazon (Cupertino, CA)
- Description Do you want to shape the future of Generative AI at AWS? Join the team building the foundation of the world's most advanced cloud for AI training and ... deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI /ML and HPC workloads. If you're passionate about pushing the limits… more