- NVIDIA (Santa Clara, CA)
- … solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions Architect ( Inference Focus), you'll collaborate closely with our ... and offer technical mentorship to customers implementing AI at scale. + Architect zero-downtime deployments , autoscaling (eg, HPA or equivalent experience with… more
- NVIDIA (Santa Clara, CA)
- …+ Experience with containerization and orchestration technologies, monitoring, and observability solutions for AI deployments + Strong knowledge of the ... NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that...and work on exciting projects and proof-of-concepts focused on inference for Generative AI and Large Language Models (LLMs).… more
- NVIDIA (Santa Clara, CA)
- …to define the next era of computing. NVIDIA is searching for an AI/ML Solutions Architect focusing on Hyperscale customers and Cloud Service Providers. Your ... to lead software customer technical engagement for AI training, inference and infrastructure being deployed at vast scale. You...as at the customer to ensure successful and trouble-free deployments . If you would you like to partner with… more
- NVIDIA (Santa Clara, CA)
- …bring AI solutions to our largest customers. We are seeking an expert Solutions Architect to assist customers in building AI/ML and HPC software solutions ... team, you will collaborate with strategic customers, providing end-to-end technology solutions and technical support based on our product strategy. Come join… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for an AI Solutions Architect with hands-on experience in efficient AI model training and/or deployment for a customer facing role. Primary ... and reduce infrastructure costs. + Leading and developing proof-of-concepts for AI solutions applied to the Consumer Internet industry, including areas like LLMs and… more
- NVIDIA (Santa Clara, CA)
- …Platforms team seeks a technical product manager to accelerate next-generation inference deployments through innovative libraries, communication runtimes, and ... the boundaries of what is possible with their AI deployments ! For Inference , we are the champions...to hear from you! What you'll be doing: + Architect developer-focused products that simplify high-performance inference … more
- Amazon (Santa Clara, CA)
- …spearhead the development of our large language model foundations and agentic AI solutions for customer service automation. You will architect scalable chatbot ... quantization, parallelization, and caching to balance performance and latency * Architect multi-lingual, multi-channel solutions spanning chat, email, and other… more