We interpreted Mountain View, CA as Mountain View, CA. Other options include: Mountain View (Contra Costa County), CA
- Meta (Menlo Park, CA)
- … production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI … more
- Meta (Menlo Park, CA)
- …platforms, all the way to mass production and deployment. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...Inference Accelerator (MTIA) program as a part of the AI /ML initiatives supporting large scale AI Training… more
- Meta (Menlo Park, CA)
- …path exploration opportunities through all the hack activities the team supports. **Required Skills:** Production Systems Engineer , AI Systems ... You'll be working on some of the most important AI System Platforms, being the gatekeeper for the server...implement systemic solutions to hardware health issues. 7. Leverage production experience to drive external and internal teams to… more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface ... **Summary:** Meta is seeking a Production Systems Engineer to...systems issues. 15. 2+ years of experience supporting AI or HPC systems and/or related … more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Lead Responsibilities: 1. Lead ... **Summary:** Meta is seeking an experienced Production Systems Engineer to...systems issues. 18. 4+ years of experience supporting AI or HPC systems and/or related … more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI ... services, and data center operations teams to enable new systems that will be deployed in our production...to hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Lead… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI ... services, and data center operations teams to enable new systems that will be deployed in our production...Silicon hyperscalar bring up and validation. **Required Skills:** Hardware Systems Engineer , NPI AI Responsibilities:… more
- Meta (Menlo Park, CA)
- … based approach to the new product introduction (NPI) phase. **Required Skills:** Hardware Systems Engineer , AI NPI Responsibilities: 1. Drive and execute ... services, and data center operations teams to enable new systems that will be deployed in our production...strategy (hardware and software), with a focus on various AI /HPC hardware systems in datacenter applications. 2.… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Capital One (San Jose, CA)
- AI Engineer **Overview:** At Capital One, we...- scalability, cost, latency, throughput - of large scale production AI systems . + Contribute to ... are creating responsible and reliable AI systems , changing banking for good. For...be regularly worked. Cambridge, MA: $133,000 - $151,800 for AI Engineer McLean, VA: $133,000 - $151,800… more
- NVIDIA (Santa Clara, CA)
- …expertise will be crucial in driving down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable at all times. What You'll Be ... We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA,...+ Hands-On Coding & Optimization: Contribute to large-scale distributed systems with high-quality, production -level C++ and Python… more
- Walmart (Sunnyvale, CA)
- …**RAG frameworks** to lead the design, development, and deployment of advanced AI systems . This role involves architecting scalable solutions, integrating ... redefine customer experiences. We are seeking a **Principal, Software Engineer ** with deep expertise in **Generative AI **.... 2. **Architecture ; Scalability:** + Architect scalable, distributed AI systems with a focus on performance,… more
- NVIDIA (Santa Clara, CA)
- …and blameless postmortems + Be part of an on call rotation to support production systems + Write and review code, develop documentation and capacity plans, ... automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of...Deployment, BCM, Terraform. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI /HPC… more
- Cisco (San Jose, CA)
- …(LLM). + Experience developing large-scale, complex models and deploying them in production systems . + Experience large-scale data processing and parallel ... our journey! Role: As the Senior Principal Machine Learning Engineer in the Artificial Intelligence group, you will be...roadmap for the team, as we develop the core AI /ML capabilities to power the entire Splunk product portfolio… more
- LinkedIn (Sunnyvale, CA)
- …Machine Learning and Artificial Intelligence Preferred Qualifications Experience in bringing large scale AI systems to production . PhD in Computer Science, ... within FAIT and across the company to realize these AI innovations. As a Principal Staff Engineer ...natural language processing, optimization Experience in building large scale AI models and systems Experience in large… more
- NVIDIA (Santa Clara, CA)
- …can connect to enterprise data sources and power search, chatbots and other gen AI applications + Develop platform and systems enabling unified experience across ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...and products that improve business efficiency and productivity. This engineer is expected to be familiar with concepts of… more
- Cisco (San Jose, CA)
- …learning technologies. The ideal candidate will help build and maintain scalable AI systems while ensuring robust deployment and operational excellence. ... part of our journey! **Role** As the Machine Learning Engineer , AI Platform in the Splunk ...Engineers and Applied Scientists to build efficient model serving systems + Monitor system performance and implement improvements for… more
- Intuit (Mountain View, CA)
- …into cloud environments. + Skilled in evaluating and monitoring the performance of AI technology in production and making necessary adjustments to ensure optimal ... Business Solutions Group in Intuit as a Staff Software Engineer . You will be working on building delightful and...software engineering and executing with high velocity + Launch AI integrations in production and evaluate their… more
- Rubrik (Palo Alto, CA)
- …with enterprise AI infrastructure. **What You'll Do:** + Build advanced generative AI systems : + Design systems leveraging state-of-the-art language and ... solutions that make it simple for organizations to build production -grade AI applications. As a member of...in fine-tuning and optimizing models + Familiarity with multimodal AI systems + Knowledge of modern … more