- Meta (Menlo Park, CA)
- …platforms, all the way to mass production and deployment. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...Inference Accelerator (MTIA) program as a part of the AI /ML initiatives supporting large scale AI Training… more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface ... **Summary:** Meta is seeking a Production Systems Engineer to...systems issues. 15. 2+ years of experience supporting AI or HPC systems and/or related … more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Lead Responsibilities: 1. Lead ... **Summary:** Meta is seeking an experienced Production Systems Engineer to...systems issues. 18. 4+ years of experience supporting AI or HPC systems and/or related … more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI ... services, and data center operations teams to enable new systems that will be deployed in our production...Silicon hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities:… more
- Meta (Menlo Park, CA)
- … based approach to the new product introduction (NPI) phase. **Required Skills:** Hardware Systems Engineer , AI NPI Responsibilities: 1. Drive and execute ... services, and data center operations teams to enable new systems that will be deployed in our production...strategy (hardware and software), with a focus on various AI /HPC hardware systems in datacenter applications. 2.… more
- Google (Goleta, CA)
- …of experimental data. + Experience with Linux, Python, and SQL. As a Cryogenic Systems Engineer , you will be responsible for improving the operations and ... the reliability, flexibility, and capacity of cryostats that Quantum AI uses for both research and production ...maintained. + Manage test strategies for both prototype and production cryostat systems and components. + Plan… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Meta (Sacramento, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Capital One (San Francisco, CA)
- Lead AI Engineer At Capital One, we are...- scalability, cost, latency, throughput - of large scale production AI systems . + Contribute to ... latest AI research and AI systems , and judiciously apply novel techniques in production...regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700… more
- NVIDIA (Santa Clara, CA)
- …expertise will be crucial in driving down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable at all times. What You'll Be ... We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA,...+ Hands-On Coding & Optimization: Contribute to large-scale distributed systems with high-quality, production -level C++ and Python… more
- Charles Schwab (San Francisco, CA)
- … Incubation and Enablement team is looking for a talented, technical, hands-on Senior Engineer to drive the development of innovative AI solutions. This position ... iterative software development using Large Language Models. The Senior Engineer on the AI Incubation and Enablement...building complex products from scratch and running them in production . + 3 + years of experience building applications… more
- Abbott (San Diego, CA)
- …& Systems organization. The right candidate for this position will be an experienced AI Engineer who has demonstrated success in leading AI projects and ... female executives, and scientists. **The Opportunity** The **Senior Staff AI Machine Learning Engineer ** is within our...and image processing. + Proven track record of developing AI solutions and deploying to production environment.… more
- Walmart (Sunnyvale, CA)
- …**RAG frameworks** to lead the design, development, and deployment of advanced AI systems . This role involves architecting scalable solutions, integrating ... redefine customer experiences. We are seeking a **Principal, Software Engineer ** with deep expertise in **Generative AI **.... 2. **Architecture ; Scalability:** + Architect scalable, distributed AI systems with a focus on performance,… more
- NVIDIA (Santa Clara, CA)
- …and blameless postmortems + Be part of an on call rotation to support production systems + Write and review code, develop documentation and capacity plans, ... automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of...Deployment, BCM, Terraform. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI /HPC… more
- Skyworks (Irvine, CA)
- Senior AI Engineer Apply now " Date:May 9, 2025 Location: Irvine, CA, US Company: Skyworks If you are looking for a challenging and exciting career in the world ... the way the world communicates. Requisition ID: 74855 Position Summary The Enterprise AI Engineer position will be responsible for designing and implementing… more
- Cisco (San Jose, CA)
- …(LLM). + Experience developing large-scale, complex models and deploying them in production systems . + Experience large-scale data processing and parallel ... our journey! Role: As the Senior Principal Machine Learning Engineer in the Artificial Intelligence group, you will be...roadmap for the team, as we develop the core AI /ML capabilities to power the entire Splunk product portfolio… more
- Highmark Health (Sacramento, CA)
- …applications for our enterprise stakeholders! We are seeking an experienced Machine Learning Engineer to join our AI Platforms and Services team. In this ... Anywhere Role! _** Are you passionate about building intelligent systems that solve real-world problems? Do you thrive in...technology? If so, we invite you to join our AI Platforms and Services team as a Machine Learning… more
- General Motors (Sacramento, CA)
- …end to end development lifecycle for artificial intelligence & machine learning. As the AI Safety Principal Engineer , you will need to stay current on industry ... and customers to define safety strategies and targets for AI /ML based autonomous driving systems , understand their...new machine learning solutions, assess the safety of existing production models and ML Ops in a cloud environment,… more
- NVIDIA (Santa Clara, CA)
- …and develop roadmaps for production -level tools + Enable development of integrated systems - AI Blueprints that provide a unified, turnkey experience. + Help ... workflows. The NeMo Retriever team is looking for an AI Engineer to join our team, focusing...Delivery pipeline with the goal of moving changes to production faster and safer while ensuring key operational standards.… more
- NVIDIA (Santa Clara, CA)
- …can connect to enterprise data sources and power search, chatbots and other gen AI applications + Develop platform and systems enabling unified experience across ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...and products that improve business efficiency and productivity. This engineer is expected to be familiar with concepts of… more