- NVIDIA (Santa Clara, CA)
- …you can make a lasting impact on the world. We are currently hiring an AI / ML Infrastructure Software Engineer at NVIDIA to join our Hardware Infrastructure ... with customers to identify and resolve infrastructure gaps, enabling innovative AI and ML research on GPU Clusters. Together, we can create powerful, efficient,… more
- Amazon (Cupertino, CA)
- …cost in the cloud, and it's all being enabled by AWS Neuron. Neuron is a Software that include ML compiler and native integration into popular ML frameworks. ... define and drive product strategy for Neuron Runtime and ML Infrastructure integration. You will be part of the...Product Management team, driving innovation in machine learning acceleration software . AWS Neuron is the software stack… more
- LinkedIn (Mountain View, CA)
- …large language models, to computer vision models. We optimize performance across algorithms, AI frameworks, data infra , compute software , and hardware to ... billions of parameters models and large scale feature engineering infra for all AI use cases from...of model parameters), agility (experiment with hundreds of new ML models per quarter using thousands of features), and… more
- Meta (Menlo Park, CA)
- …operate in a multi-organization landscape. **Required Skills:** Technical Program Manager, AI Network Infra Responsibilities: 1. Lead technical program ... management of next-generation Artificial Intelligence/Machine Learning ( AI / ML ) platform(s) for Meta's Network Infrastructure in a matrix organization covering a… more
- Meta (Menlo Park, CA)
- …on rack and power capabilities needed to support emerging next-generation high powered AI / ML servers, and would require engagement with external vendors as well ... new generation power and rack platforms. These platforms are the foundation of our AI Training and Inference systems, and are a key enabler to supporting the… more
- Google (Sunnyvale, CA)
- …that enable orchestration of Google-scale services. Come build things that matter. The ML , Systems, & Cloud AI (MSCA) organization at Google designs, implements, ... the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI... software and hardware, including Google Cloud's Vertex AI , the leading AI platform for bringing… more
- Google (Mountain View, CA)
- …practical experience. + 10 years of experience in product management working on AI / ML products. Preferred qualifications: + Master's degree in a technology or ... field. + 7 years of experience working cross-functionally with AI / ML engineering, UX/UI, sales finance, and other...such as improving freshness and personalization recall. + Strengthen ML modeling infra to enhance relevance and… more
- Meta (Sunnyvale, CA)
- **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI ... web. We are hiring in multiple locations. **Required Skills:** Software Engineer, Systems ML - SW/HW Co-design...approaches 6. Apply in depth knowledge of how the ML infra interacts with the other systems… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI ... web. We are hiring in multiple locations. **Required Skills:** Software Engineer, Systems ML - SW/HW Co-design...approaches 6. Apply in depth knowledge of how the ML infra interacts with the other systems… more
- LinkedIn (Mountain View, CA)
- …large language models, to computer vision models. We optimize performance across algorithms, AI frameworks, data infra , compute software , and hardware. In ... team. Join us to push the boundaries of scaling AI . The AI Infra team...experience 2+ years of experience in hardware acceleration of AI / ML models Experience in deep learning frameworks… more
- Meta (Sunnyvale, CA)
- …and development of State of the Art Ads Recommendation technologies. You will work with Infra and ML Ranking teams of talented ML engineers, product ... Program Manager, ML Responsibilities: 1. Develop and manage end-to-end technical AI / ML product solutions and ensure on-time delivery. 2. Manage and own… more
- Meta (Menlo Park, CA)
- **Summary:** In this role, you will be a member of the AI Networking Software team and part of the bigger DC networking organization. The team develops and owns ... on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer, SystemML - AI Networking Responsibilities: 1. Enabling… more
- Meta (Menlo Park, CA)
- **Summary:** In this role, you will be a member of the AI Networking Software team and part of the bigger DC networking organization. The team develops and owns ... on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer, SystemML - AI Networking Responsibilities: 1. Tech-leading… more
- Google (Mountain View, CA)
- …in Computer Science, Statistics, Mathematics, or a related field. + Familiarity with ML production tools and lifecycle. Google Cloud's software engineers develop ... for the ML platforms at Google. The ML , Systems, & Cloud AI (MSCA) organization...the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI… more
- Meta (Menlo Park, CA)
- …The MTIA (Meta Training & Inference Accelerator) Software team is part of AI Infra PyTorch org. The team's mission is to explore, develop and help ... productize high-performance software and hardware technologies for AI at...AI at datacenter scale. Team has been developing AI frameworks to accelerate Meta's DL/ ML workloads… more
- Google (Sunnyvale, CA)
- …team's product is used by 20+ teams to power the rest of Cloud, powering the AI / ML infra , GCE infrastructure business. In this role, you will directly ... service" benefiting your long term career path under today's AI boosting environment. The ML , Systems, &...the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI… more
- Google (Sunnyvale, CA)
- …new problems across the full-stack as we continue to push technology forward. The ML , Systems, & Cloud AI (MSCA) organization at Google designs, implements, and ... Media Futures Group (MFG) stages, rack design, power, cooling infra (air and liquid). + Experience with the TPU...the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud's Vertex AI… more
- NVIDIA (Santa Clara, CA)
- …data analytics problems. Experience with Object Storage, Metadata Management, Data lake tools, AI / ML Infra + Computer science background with Distributed ... and storage infrastructure to address the most challenging problems faced by AI practitioners. These challenges include a) efficiently storing petabytes of data on… more
- Amazon (Sunnyvale, CA)
- Description Our Machine Learning training infrastructure ( ML Infra ) team is responsible for designing, implementing, and optimizing large-scale computing ... - Demonstrate significant innovation, creativity, and judgement when solving challenging AI / ML infrastructure problems. Identify future skills needed across your… more
- MongoDB (Palo Alto, CA)
- …pipelines, latency-aware routing, and model health monitoring + Collaborate with peers across ML , infra , and product teams to define architectural patterns and ... transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes...world's most popular developer data platform + Collaborate with ML experts from Voyage. ai to bring cutting-edge… more