- Amazon (Cupertino, CA)
- …The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's ... ML accelerators. Working across the stack from PyTorch till the hardware- software boundary, our engineers build systematic infrastructure, innovate new methods and… more
- Amazon (Cupertino, CA)
- …lifecycles along with work experience on some optimizations for improving the model execution. - Software development experience in C++, Python (experience ... at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to… more
- Amazon (Cupertino, CA)
- …Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a particular focus on large-scale generative AI ... Description AWS Neuron is the software stack powering AWS Inferentia and Trainium machine...resilient AI infrastructure at AWS. We focus on developing model -agnostic inference innovations, including disaggregated serving , distributed… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** As a **Senior Software Engineer - Machine Learning** , you are a technical leader working at the intersection of ... technologies and innovative solutions. **What you'll do:** + Design and implement scalable model serving platforms for both batch and real-time inference + Build… more
- LinkedIn (Mountain View, CA)
- …Online Learning and Serving performance optimizations across billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra ... optimizing the process for model training and serving . As an engineer in the team,...and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand opportunities… more
- LinkedIn (Mountain View, CA)
- …Online Learning and Serving performance optimizations across billions of user queries Model Training Infrastructure: As an engineer on the AI Training Infra ... optimizing the process for model training and serving . As an engineer in the team,...models and deliver the best performance possible. As a Software Engineer , you will have first-hand opportunities… more
- LinkedIn (Mountain View, CA)
- …Online Learning and Serving performance optimizations across billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra ... GNNs, Flash Attention. PyTorch Lightning and more and more. Model Serving Infrastructure: this team builds low...enabling GPU inference at scale. As a Sr. Staff Software Engineer , you will have first-hand opportunities… more
- Google (Mountain View, CA)
- …the core of the Ads Ecosystems and interact closely with bidding, auctions, targeting and serving . As a Senior Software Engineer , you will possess excellent ... Senior Staff Software Engineer , Ads Budgeting _corporate_fare_ Google...strategy, ML design, and working with ML infrastructure (eg, model deployment, model evaluation, data processing, debugging,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... optimize the GPU-accelerated software that powers today's most sophisticated AI applications. Our...frameworks, which are at the forefront of efficient large-scale model serving and inference. You will play… more
- MongoDB (Palo Alto, CA)
- …distributed systems, and multi-tenant service design + Familiar with concepts in ML model serving and inference runtimes, even if not directly deploying models ... **About the Role** We're looking for a Senior Engineer to help build the next-generation inference platform...focus on building core systems and services that power model inference at scale. You'll own key components of… more
- Microsoft Corporation (Mountain View, CA)
- …preferred, remote locations considered for very strong candidates. **Responsibilities** As a Principal Software Engineer on the team the common tasks of the job ... Artificial Intelligence Cloud Inference team at Microsoft develops AI software that enables running AI models everywhere, from world's...on the models hosted on the Azure OpenAI service serving some of the largest workloads on the planet… more
- Microsoft Corporation (Mountain View, CA)
- …Microsoft products, including Office, Windows, Bing, SQL Server, and Dynamics. As a Senior Software Engineer on the team, you will have the opportunity to work ... on the models hosted on the Azure OpenAI service serving some of the largest workloads on the planet...can thrive at work and beyond. **Responsibilities** As a Software Engineer on the team the common… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... optimize the GPU-accelerated software that powers today's most sophisticated AI applications. Our...vLLM, which are at the forefront of efficient large-scale model serving and inference. You will play… more
- Snap Inc. (Palo Alto, CA)
- …ranking and recommendation systems more efficient and impactful. We're looking for a Software Engineer , ML Infrastructure to join Snap Inc! What you'll do: ... recommendations + Develop high-performance inference systems to ensure fast and efficient AI model serving + Build infrastructure to perform scalable ML model… more
- Deloitte (San Jose, CA)
- Role Overview: As a Full-stack Software Engineer , you will actively engage in your engineering craft, taking a hands-on approach to multiple high-visibility ... modernized software and product delivery, creating a scalable, cost-effective model that focuses on value/outcomes that leverages a progressive and responsive… more
- LinkedIn (Mountain View, CA)
- …determined by the business needs of the team. We are seeking a Senior Staff Software Engineer to define and lead the technical strategy for Search and ... deliver impact by driving innovation while building and shipping software at scale. + You will work closely with...and business partners + You will be a role model and professional coach for engineers with a strong… more
- NVIDIA (Santa Clara, CA)
- …enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project, you will address some of the ... distributed GPU environments. By bringing to bear sophisticated techniques in serving architecture, GPU resource management, and intelligent request handling, Dynamo… more
- Palo Alto Networks (Santa Clara, CA)
- …components that enable high-velocity data ingestion, transformation, and Machine Learning model serving for DLP detections + **Real-time Decisioning:** Architect ... from the office full time, with flexibility when it's needed. This model supports real-time problem-solving, stronger relationships, and the kind of precision that… more
- Zscaler (San Jose, CA)
- …speed and agility with a cloud-first strategy. We're looking for an experienced Sr. Software Engineer to join our Digital Experience team. This role is hybrid ... countries. Bring your vision and passion to our team of cloud architects, software engineers, security experts, and more who are enabling organizations worldwide to… more
- Oracle (San Jose, CA)
- …agents that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and implementation ... will work in a collaborative environment with applied scientists, ML engineers, and software teams to deliver performant and reliable AI infrastructure. This is a… more