- Meta (Menlo Park, CA)
- …and from software development to hardware / systems debug. **Required Skills:** Production Systems Engineer , Tooling Responsibilities: 1. Full-stack ... **Summary:** Meta is seeking an experienced Production Systems Engineer to...cycle of servers in production . Production Systems Engineers in this role build tooling … more
- Meta (Menlo Park, CA)
- …next-gen Artificial Intelligence platforms, which is a company-wide priority. **Required Skills:** Production Systems Engineer , Tooling Responsibilities: ... throughout the whole lifecycle of our programs (from early Hack to Mass Production ). 4. Enhance existing tooling and validation frameworks to support large-scale… more
- NVIDIA (Santa Clara, CA)
- …be applying strong programming skills and a deep understanding of the distributed systems design for crafting and building production -grade software. + Focus on ... a small and fast moving team, and we own production excellence of everything we develop, on all layers...+ Responsible for the big picture of how our systems relate to each other and utilizing a breadth… more
- Meta (Menlo Park, CA)
- …validation, supporting customer deployment, production issue triage. **Required Skills:** Production Systems Engineer , Cooling & Power Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...3. Drive power and cooling system integration with datacenter tooling . 4. Contribute to hacks to enable AI platform… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced candidate to join the Foundation Labs team as an Production Systems Engineer . The Production Systems ... accessible across multiple geographical regions along with offering standardized tooling , enabling the end users (engineers) to focus on...to Data Center labs through production . The Production Systems Engineer is a… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... health and life cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities: 1. Develop robust,… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation ... lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI...suites for various architectures. 2. Proactively create experiments and tooling to detect and diagnose hardware, firmware, and software… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI...a Tech Lead, owning and proactive creating experiments and tooling to detect and diagnose hardware/firmware/software health issues, in… more
- Meta (Menlo Park, CA)
- …platforms, all the way to mass production and deployment. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: 1. Support ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...network interface system validation all the way to mass production 2. Proactively create experiments and tooling … more
- Meta (Menlo Park, CA)
- … production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: 1. ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...(eg NICs) interface integration. 2. Proactively create experiments and tooling to detect and diagnose hardware/firmware/software health issues. 3.… more
- Meta (Menlo Park, CA)
- …opportunities through all the hack activities the team supports. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... out (eg NICs) interface integration. 2. Proactively create experiments and tooling to detect and diagnose hardware/firmware/software health issues. 3. Develop… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Production Engineer with in-depth understanding of networking, systems , automation, and tooling to join the PE Network ... new network products that enable networking for AI training and Inference.A Network Production Engineer in this role would support leading Meta's server fleet… more
- Meta (Menlo Park, CA)
- …is looking for a Network Production Engineer with experience in networking, systems , tooling , and automation to join the Network Infra team. This team is ... of people using our applications globally! **Required Skills:** Network Production Engineer , Infrastructure Responsibilities: 1. Conceptualize, build, and… more
- Meta (Fremont, CA)
- …has either developed robust hardware system test plans, or validation planning as a systems engineer supporting AI/ML, compute, storage & network hardware in a ... **Summary:** The Systems Integration Engineer (SIE) is responsible...understand infrastructure deployment and operational requirements. 9. Collaborate with Production Engineers, Hardware Validation Engineers, Tooling Automation… more
- Meta (Menlo Park, CA)
- …"Apply to Job" online on this web page. **Required Skills:** Network Production Engineer Responsibilities: 1. Research, architect, develop and deploy scalable ... datacenter network architectures and related tooling . 2. Work closely with our hardware, software and...and troubleshoot our networks. 4. Develop automated network monitoring systems to mitigate and remediate network events. 5. Analyze… more
- Meta (Menlo Park, CA)
- …operate our worldwide Data Center network. **Required Skills:** Network Production Engineer , Datacenter Infrastructure Responsibilities: 1. Architect, design, ... devices.For this role, we are looking for a network engineer with software engineering skills who design, build, and...with our software teams to develop and implement the tooling required to provision, manage, monitor, and troubleshoot our… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Hardware Systems Engineer to join our Release to Production (RTP) team working on new NPI hardware. Our servers and data ... the value, enable go/no-go decisions and optimize these systems in production . **Required Skills:** Hardware Systems Engineer , NPI Responsibilities: 1.… more
- Meta (Menlo Park, CA)
- …hardware software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / Kernels ... DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler stack, DL… more
- Vitesse Systems (Newark, CA)
- SUMMARY: The Manufacturing Engineer III is responsible for working to improve the manufacturing process of furnace brazed cold plates and radio waveguides, along ... with developing supporting documentation for long run and high-volume production . We are looking for a precision-oriented professional with developed leadership… more
- NVIDIA (Santa Clara, CA)
- … systems design developing tools for running large scale private or public cloud systems in production . + Experience in one or more of the following: Python, ... strong background in computer science fundamentals who are interested in building tooling , reporting, automation, and AI to support the operational flywheel across a… more