- TikTok (San Jose, CA)
- …with industry trends, best practices, and emerging technologies related to site reliability and infrastructure engineering. Qualifications Minimum Qualifications ... this job Share this listing: Responsibilities Team Introduction Our Compute Platform SRE team supports all Big Data services...the team's future together. We are responsible for the reliability of all the company's major data warehouse products,… more
- Amazon (Cupertino, CA)
- Description The AWS Mainstream Compute team drives system innovation in the servers used across all major Amazon Web Services - EC2, S3, DynamoDB etc. Our engineers ... We are looking for a seasoned Senior System Development Engineer engineer to build and own the...as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations… more
- Amazon (Cupertino, CA)
- Description The AWS Mainstream Compute team drives system innovation in the servers used across all major Amazon Web Services - EC2, S3, DynamoDB etc. Our engineers ... We are looking for a seasoned Senior System Development Engineer engineer to build and own the...as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations… more
- Amazon (Cupertino, CA)
- Description Serverless Compute (https://aws.amazon.com/serverless/?nc2=type\_a) is changing the way we think about computing in the cloud. Serverless computing ... services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to… more
- Amazon (Santa Clara, CA)
- …scale. EC2 Nitro drives the planet's largest, fastest growing and most feature-rich compute cloud. Nitro is AWS's ground-up design for virtualization at global scale ... experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience… more
- Amazon (Sunnyvale, CA)
- Description We are looking for a Sr Embedded Software Development Engineer to help design, develop, and integrate our next generation devices. In this role you will ... Amazon Lab126 is an inventive research and development company that designs and engineer 's high-profile consumer electronics. Lab126 began in 2004 as a subsidiary of… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve engineer 's productivity. As a Site Reliability Engineer , you are responsible for the big ... you will provide leadership in the design and implementation of ground breaking compute clusters that powers all silicon development across NVIDIA. We seek an expert… more
- Amazon (Cupertino, CA)
- …AI platforms for the world's largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to ... Engineering, Physics or related field, or equivalent experience - 7+ years of Reliability Engineering work experience with server compute platforms or on… more
- Google (Sunnyvale, CA)
- …and technologies. + Experience in building large-scale operations capabilities in Site Reliability Engineering. Google Cloud's software engineers develop the ... developing infrastructure, distributed systems, or networks, or experience with compute technologies, storage, or hardware architecture. + 5 years...on and is growing every day. As a software engineer , you will work on a specific project critical… more
- Amazon (Sunnyvale, CA)
- …startups through the Fortune 500. We are looking for an experienced kernel software engineer to drive development for new EC2 compute platforms. In this role, ... as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations...day in the life As a Nitro Operating Systems engineer , your day centers around developing and optimizing the… more
- Amazon (Sunnyvale, CA)
- …startups through the Fortune 500. We are looking for an experienced kernel software engineer to drive development for new EC2 compute platforms. In this role, ... as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations...day in the life As a Nitro Operating Systems engineer , your day centers around developing and optimizing the… more
- Amazon (Cupertino, CA)
- …welcomes bold ideas and empowers you to own them to completion. Server Hardware Engineer (aka Lead Engineer (LE)). Amazon Web Services (AWS) Hardware Engineering ... team creates compute , storage, accelerator, and enterprise servers for Amazon's innovative...key factors such as total cost of ownership, quality, reliability , performance, and serviceability. You will be an end-to-end… more
- Amazon (Cupertino, CA)
- …services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to ... UC organization, you'll support the development and management of Compute , Database, Storage, Internet of Things (IoT), Platform, and...empowers you to own them to completion. Server Hardware Engineer (aka Lead Power Design Engineer ). Amazon… more
- Amazon (East Palo Alto, CA)
- …business challenges into technological breakthroughs? Join Amazon as a Software Development Engineer (SDE) and help shape the future of global commerce. At Amazon, ... services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to… more
- Amazon (Cupertino, CA)
- …services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to ... UC organization, you'll support the development and management of Compute , Database, Storage, Internet of Things (IoT), Platform, and...use them. This role is for a senior software engineer in the Machine Learning Applications (ML Apps) team… more
- Amazon (Cupertino, CA)
- …drivers. - 5+ years or more in software development, systems development, SRE ( Site Reliability Engineering), or Resilience Engineering - 5+ years of server ... AWS server platform teams, eg AI/ML servers, storage servers, compute servers, etc. Given the sheer number of programs...- 2+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience… more
- Microsoft Corporation (San Jose, CA)
- …services through their full lifecycle: design → development → testing → deployment → site reliability and live- site response. + Mentor engineers, influence ... Model (LLM) inference and training. As a Senior Software Engineer - Azure Storage, you will design and build...low-latency data paths and distributed storage protocols to performance-tuned compute and metadata services. Your focus will be on… more
- Amazon (Cupertino, CA)
- …will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power for ML in the cloud. This is all enabled by cutting edge ... in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will...as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations… more
- Amazon (Sunnyvale, CA)
- …(SDN) networking in one of the world's biggest public clouds? The Amazon Elastic Compute Cloud (EC2) VPC Dataplane team owns the packet pipeline that runs right ... knowledge. Key job responsibilities Your responsibilities will include: * Being an engineer on a small team, mentoring junior engineers, ensuring the right… more
- Amazon (Sunnyvale, CA)
- …solutions that enable distributed training of trillion-parameter models across thousands of compute nodes on AWS infrastructure. Our team is responsible for creating ... communication optimization, and ultra-high-bandwidth inter-rack connectivity. As a senior engineer , you'll drive technical architecture decisions and lead the… more