- NVIDIA (Santa Clara, CA)
- …demands robust, automated, and secure production environments. We are seeking a deeply skilled Senior Staff Site Reliability Engineer (SRE) to advance ... position requires a strong software engineering background, but focuses on reliability , scalability, and operational excellence. A strong candidate excels in… more
- Google (Sunnyvale, CA)
- Senior Staff Software Engineer, Site Reliability Engineering _corporate_fare_ Google _place_ Sunnyvale, CA, USA; New York, NY, USA **Advanced** ... + Master's degree in Computer Science or Engineering. **About the job** Site Reliability Engineering (SRE) combines software and systems engineering to… more
- NVIDIA (Santa Clara, CA)
- …is inspired to do their best work. We are seeking a highly skilled Principal Staff SRE to join our dynamic team. Our company is at the forefront of technological ... NTP/PTP, DHCP, and LDAP. This includes building for performance and reliability at global scale, covering automation, monitoring, high availability, capacity… more
- Amazon (Cupertino, CA)
- …cutting AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary ... Description The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna...staff to conceive and design infrastructure technologies. You will… more
- Genentech (South San Francisco, CA)
- **The Position** **We are seeking a strategic and visionary Senior Manager, Facilities Data & Analytics to build and lead the data foundation for our Facilities & ... leader will transform how we leverage data to enhance infrastructure reliability , optimize capital planning, and drive operational excellence across our building… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …high-performance computing or AI infrastructure. Proven track record as a Principal or Senior Staff Engineer. + Expert-level knowledge of NVIDIA GPU architecture ... Engineer to join our team. This is a hands-on, senior individual contributor role that will be pivotal in...solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI… more
- Zscaler (San Jose, CA)
- …speed and agility with a cloud-first strategy. We are seeking an experienced Senior Staff Infrastructure Operations Engineer to join our team. This critical ... + 8+ years of experience working in infrastructure operations, DevOps, or site reliability roles + Demonstrated expertise in system observability, including… more
- Google (Sunnyvale, CA)
- Senior Staff Software Engineer, One Platform _corporate_fare_ Google _place_ Kirkland, WA, USA; Sunnyvale, CA, USA **Advanced** Experience owning outcomes and ... building partnerships to drive design alignment and cross-functional success with Site Reliability Engineering (SRE), Engineering Productivity (Engprod), and… more
- Celestica (San Jose, CA)
- …Region: Americas Country: USA State/Province: California City: San Jose **Summary** The Senior Staff Engineer, Software develops, debugs, tests, deploys and ... etc.) and complies with the product life cycle development (phase/gate deliverables). The Senior Staff Engineer, Software works in cross functional teams with… more
- Amazon (San Jose, CA)
- …prioritize issues for whole program. You use data to drive manufacturability, reliability , and quality objectives in design and development teams. Key job ... and develop new manufacturing technology and methodologies to enhance PCBA quality, reliability , throughput, and cost. - Drive closure of all operational issues… more
- Amazon (Newark, CA)
- …of thousands of products to hundreds of countries worldwide, every day. The Reliability & Maintenance Engineering (RME) team are the business partners that work ... us on our journey! About the Role: As the Senior Regional Automation Engineer, you will engage on all...code/parameter sets, performance metrics, and feedback mechanisms to ensure reliability and operational efficiency of our equipment. What Do… more
- Amazon (Sunnyvale, CA)
- Description Amazon's AGI Information is seeking an exceptional Senior Software Development Engineer to drive advancements in the Amazon Knowledge Graph (AKG) ... Establish monitoring frameworks and define alerting strategies that ensure 24/7 reliability . Engage directly with customers to understand their needs and architect… more
- Amazon (Cupertino, CA)
- …is looking for a senior leader to deliver development, implementation, and reliability for our accelerated GPU offerings in AWS. In this role, you will lead ... role, you will define implementation strategy and report status frequently to senior leadership. Your work will directly enable Amazon's customer's ability to… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior High Performance Computing Engineer Job ID 6383 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **About SLAC:** The SLAC National ... the nature of this position, SLAC is open to on- site and hybrid work options.** **Position Overview:** As a... and hybrid work options.** **Position Overview:** As a Senior High Performance Computing Engineer in the Scientific Computing… more
- Amazon (East Palo Alto, CA)
- …we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... language experience - 4+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a… more
- Amazon (East Palo Alto, CA)
- Description As an Amazon Web Services (AWS) Senior Solutions Architect within the Strategic Accounts segment, you are responsible for partnering with our most ... AWS. Influence leaders on topics such as AI/ML, app modernization, reliability , and operational efficiency, security, cost, and performance. Working backwards from… more
- Amazon (Cupertino, CA)
- …we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... experience - 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Fundamentals of… more
- Amazon (Cupertino, CA)
- …we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... experience - 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Fundamentals of… more
- Amazon (Cupertino, CA)
- …we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... experience - 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Fundamentals of… more
- Amazon (Sunnyvale, CA)
- …peering with other public and private networks. A day in the life As a senior software engineer you will be responsible for leading the design of embedded software ... language experience - 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a… more