- General Motors (Mountain View, CA)
- …another relevant field; or equivalent real-world experience + Experience building PB scale ML data management systems (datalake, data warehouse) from the ground ... if more than 3 days]. **The Role** As an engineer on this team, you will be responsible for...in the cloud and providing powerful foundations for GM ML Data Platform tools, frameworks, and services. Responsibilities include… more
- LinkedIn (Mountain View, CA)
- …in LinkedIn's Sunnyvale, CA campus. About the Role We are seeking a Senior Staff Engineer to design, build, and maintain our large-scale GPU infrastructure for ... machine learning ( ML ) and AI workloads. In this role, you will...design, and integrate scalable storage solutions (eg, parallel file systems , object storage, NVMe over Fabric) to meet performance… more
- Snap Inc. (Palo Alto, CA)
- …ranking and recommendation systems more efficient and impactful. We're looking for a Staff Software Engineer , ML Inference to join Snap Inc! What you'll ... privacy at the forefront. You'll play a critical role in scaling our ML Infrastructure, optimizing AI training and inference systems , and driving innovations… more
- General Motors (Mountain View, CA)
- …Onboard Embodied AI team is at the forefront of developing groundbreaking onboard ML systems powering fully autonomous vehicles. We leverage modern end-to-end ... field. + 8-10+ years of extensive experience developing and deploying advanced ML systems , particularly in end-to-end real-time onboard applications. + Proven… more
- Google (Sunnyvale, CA)
- …experience working in a complex, matrixed organization. + Experience with machine learning systems (eg, background theory, TensorFlow, or other ML tools). + ... on and is growing every day. As a software engineer , you will work on a specific project critical...full-stack as we continue to push technology forward. The ML , Systems , & Cloud AI (MSCA) organization… more
- Google (Sunnyvale, CA)
- …team. Our stack is mostly C++ with some Go used for cluster services. The ML , Systems , & Cloud AI (MSCA) organization at Google designs, implements, and manages ... Go. Preferred qualifications: + Experience with system software, distributed systems , and complex multi-component software systems . +...on and is growing every day. As a software engineer , you will work on a specific project critical… more
- IBM (San Jose, CA)
- …most challenging problems? If so, lets talk. **Your role and responsibilities** As a Staff AI/MLOps Development Engineer at Apptio, you will work closely with ... in hybrid cloud environments. You will help design and engineer efficient and resilient MLOps platforms and software products...for a sizable team, in terms of project impact, ML system design, and ML excellence. *… more
- Google (Mountain View, CA)
- …distributed team of engineers. + Lead the design and implementation of recommendation systems , optimize ML infrastructure, and guide the development of model ... development. + 5 years of experience building and deploying recommendation systems models (retrieval, prediction, ranking, embedding) in production and experience… more
- Google (Mountain View, CA)
- …distributed team of engineers. + Lead the design and implementation of recommendation systems , optimize ML infrastructure, and guide the development of model ... development. + 5 years of experience building and deploying recommendation systems models (retrieval, prediction, ranking, embedding) in production and experience… more
- Amazon (Cupertino, CA)
- …services. We seek candidates with strong programming skills, eagerness to learn complex systems , and basic ML knowledge. This role offers growth opportunities in ... Learning accelerators. This role is for a Machine Learning Engineer on one of our AWS Neuron teams: -...ML infrastructure, bridging the gap between frameworks, distributed systems , and hardware acceleration. About the team Annapurna Labs… more
- Snap Inc. (Palo Alto, CA)
- …precision, and always execute with privacy at the forefront. We're looking for a Staff Software Engineer , Machine Learning Infrastructure to join the AI Training ... Google Kubernetes Engine (GKE) and Sagemaker + Build comprehensive data management systems for scalable data collection, labeling, processing, and evaluation + Work… more
- Amazon (Cupertino, CA)
- …Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distribute Training team for AWS Neuron, responsible for development, ... enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such as GPT and Llama, as well as… more
- Amazon (Cupertino, CA)
- …Key job responsibilities AWS Networking is looking for a Sr. Software Development Engineer to drive innovation in Machine Learning ( ML ) Network Performance. You ... and performance benchmarking systems to deeply understand the performance of the ML network and you will develop and deliver innovative solutions that drive ever… more
- Amazon (Cupertino, CA)
- …support the development and scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna systems . The Product: ... advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through… more
- Amazon (Cupertino, CA)
- …team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance ... kernels for ML functions, ensuring every FLOP counts in delivering optimal...at the intersection of software, hardware, and machine learning systems , you'll bring expertise in low-level optimization, system architecture,… more
- Amazon (Cupertino, CA)
- …used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the ... best-in-class ML training performance with the most teraflops (TFLOPS) of...in performance. You: As a Sr. Machine Learning Compiler Engineer III on the AWS Neuron team, you will… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced engineer to work on distributed AI/ ML systems . This role involves working on collective operations - the ... kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or...solving hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful solutions at… more
- Amazon (Cupertino, CA)
- …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... Web Services (AWS) is looking for a Software Development Engineer II to build, deliver, and maintain complex products...customers and raise our performance bar. You'll design fault-tolerant systems that run at massive scale as we continue… more
- Amazon (Santa Clara, CA)
- …the platform. If you have prior experience in building & scaling large scale systems , ML pipelines, and enjoy extracting maximum performance at every layer of ... changes in the industry. * Create solutions to run predictions on distributed systems with exposure to innovative technologies at incredible scale and speed. * Build… more
- Amazon (Cupertino, CA)
- …the Trn1 and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This ... enablement and performance tuning of a wide variety of ML model families, including massive scale large language models...(design patterns, reliability and scaling) of new and existing systems experience - - 5+ years of full software… more