GenAI Inference Architect Scale Jobs | Juju

GenAI Inference Architect…

Databricks Inc. (San Francisco, CA)

…data and AI company in San Francisco seeks a Staff Software Engineer for GenAI inference to lead its architecture and optimization efforts. Candidates should ... with at least 6 years of experience and an understanding of ML inference internals. Key tasks include collaborating on model features, optimizing the inference… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer - GenAI…

Databricks Inc. (San Francisco, CA)

Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the architecture, ... low latency, and robust scaling. Your work will encompass the full GenAI inference stack: kernels, runtimes, orchestration, memory, and integration with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect , Autonomous…

NVIDIA Corporation (Santa Clara, CA)

Senior Solutions Architect , Autonomous Driving - GenAI page is loaded Senior Solutions Architect , Autonomous Driving - GenAI Apply locations US, CA, ... and we are looking for an expert AV and GenAI Solutions Architect to help assist customers...VLMs, DiT, etc. Experience in deploying LLM models at scale on mainstream cloud providers (eg, AWS, Azure, GCP).… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer, AI/ML, AWS…

Amazon (San Francisco, CA)

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web Services ... Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer, AI/ML, AWS Neuron,…

Amazon (San Francisco, CA)

Software Development Engineer, AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development ... kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia...ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff AI Performance Architect

Qualcomm (San Diego, CA)

…power enhancements into the HW to enable state of the art training capabilities. AI inference and training systems must scale to a large number of accelerators, ... servers and racks. Our devices must be designed to scale to handle the largest of today's models. The...training systems. Analysis of current accelerator and GPU architectures. Architect enhancements required for efficient training of AI models.… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior ML/AI Architect

Quantiphi, Inc. (Boston, MA)

…design, multi-agent orchestration, LLM-based architectures, and scalable enterprise platforms.* Architect and design enterprise- scale agentic AI platforms, ... experience **Experience Level:**We are seeking a highly experienced Senior ML/AI Architect who will lead the architecture, design, and implementation of… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal Data Scientist, Expert Network

Intuit Inc. (San Diego, CA)

…UX. Deep understanding of Generative AI and other evolving technologies. Application of GenAI at scale in a production environment. Deep knowledge of ... influence our Customer Success strategy and drive business growth at scale . You will partner directly with cross‑functional leaders-across Product Management,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Lead Full Stack Developer - AI/ML Focus Seattle,…

Michael Baker International, Inc. (Seattle, WA)

…a highly skilled Lead Full Stack Developer with deep AI/ML expertise to architect , build, and scale intelligent, data-driven applications across our enterprise ... AI deployments. Incorporate MCP servers and distributed compute frameworks to support large- scale AI/ML inference and training. Productionize ML models and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. Staff Applied AI Engineer

Icon Ventures (San Francisco, CA)

…Experience building on a modern MLOps stack (feature mgmt, orchestration, streaming, online inference at scale ) Compensation, Benefits & Perks Quizlet is an ... us to design and deliver AI-powered learning tools that scale across the world and unlock human potential. About...that boost learner outcomes and creator productivity. You will architect and ship a variety of models and modeling… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Principal Engineer, AI Agents

Teradata Corporation (SE) (Boston, MA)

…data and AI platform in the world. In this role, you will: Architect Teradata's foundational AI strategy - driving capabilities that enable agentic AI, ... GenAI , and advanced analytics on the Teradata VantageCloud platform....other ecosystem partners. Define reusable patterns for embedding AI/ML inference into customer-facing workloads, with strong emphasis on reliability,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal Engineer, AI Agents

Teradata Corporation (SE) (Honolulu, HI)

…data and AI platform in the world. In this role, you will: Architect Teradata's foundational AI strategy - driving capabilities that enable agentic AI, ... GenAI , and advanced analytics on the Teradata VantageCloud platform....other ecosystem partners. Define reusable patterns for embedding AI/ML inference into customer‑facing workloads, with strong emphasis on reliability,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Cloud Platform Engineer - Agent Cloud

Rubrik, Inc. (Palo Alto, CA)

…- streaming telemetry, audit trails, and behavioral analytics across thousands of agents. Architect and scale systems that handle millions of agent decisions per ... possible for organizations to operate production‑grade AI agents at scale . As a member of the Rubrik Agent Cloud...- including model gateways (like LiteLLM or MCP), fine‑tuning, inference optimization, or policy enforcement in AI workloads. Strong… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Backend Engineer - Core AI

Anzen Technologies, Inc. (San Francisco, CA)

…evaluating results and rolling them out with the appropriate infrastructure. Architect production‑grade systems that marry real‑time inference with ... best investors in the world, and are continuing to scale our team in Mexico, Canada, and the United...Broad ML toolbox, from classic ML (especially NLP) to GenAI . Strong backend fundamentals: distributed systems, streaming data pipelines.… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. Staff Engineer, Machine Learning Engineering…

Qualcomm (San Diego, CA)

…the Edge - including model fine tuning, hardware acceleration, model quantization, edge inference and related fields. Come join us on this exciting journey. The ... and software engineers who work with cutting edge AI frameworks and tools. Architect , design, develop and test model optimization techniques that include - but are… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. Applied Scientist, Alexa Connections

Amazon (San Francisco, CA)

…Proof Of Concepts advancing the state of the art in AI & ML for GenAI . Collaborate with cross-functional teams to architect and execute technically rigorous AI ... core capabilities, ensuring a seamless and intuitive user experience. Develop new inference and training techniques to improve the performance of Large Language… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Data Engineer ( Boston or Chicago )

Forsta (Boston, MA)

…automation to streamline the development and deployment of AI solutions. Architect robust, reliable solutions for specific AI applications using appropriate ... deliver complex data products to power training and online inference of AI systems. Deploy ML models, LLMs and... of AI systems. Deploy ML models, LLMs and GenAI systems into production, ensuring reliability, efficiency, and scalability… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Sr Engineer, Machine Learning Engineering (ML…

Qualcomm (San Diego, CA)

…the Edge - including model fine tuning, hardware acceleration, model quantization, edge inference and related fields. Come join us on this exciting journey. In this ... engineers who work with cutting edge AI frameworks and tools. You will architect , design, develop, test, and deploy on-device prototype software for cutting-edge AI… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Sr. ML Performance Engineer, AWS Neuron, Annapurna…

Amazon (Cupertino, CA)

…Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training performances. They are enabled through a state-of-the-art… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect , Autonomous…

NVIDIA (Santa Clara, CA)

…the world's leading AI company, and we are looking for an expert AV and GenAI Solutions Architect to help assist customers with adoption of NVIDIA's full-stack ... and ride sharing algorithms among other things. A Solutions Architect is the first line of technical expertise between...hands-on technical mentorship to partners and customers on Nvidia GenAI stack. Guide customers to develope and deploy Agentic… more

NVIDIA (10/23/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search