- Etched.ai, Inc. (San Jose, CA)
- A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This ... role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more
- Capital One (Washington, DC)
- Senior AI Engineer (Gen AI Platform Services, Agentic Systems) Overview: At Capital One, we are creating responsible and reliable AI systems, ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 -… more
- Capital One (Annapolis, MD)
- Senior Manager, AI Engineering (People Leader) (GenAI Platform Services) Overview : At Capital One, we are creating responsible and reliable AI systems, ... us to be at the forefront of enterprises leveraging AI . From informing customers about unusual charges to answering...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …most advanced AI workloads worldwide. They're now building a serverless inference platform , beginning with cost-efficient batch inference and expanding ... Join a stealth-mode hyperscale data center startup building an AI and cloud platform , powered by thousands...miss this opportunity! Key Responsibilities Take ownership of the inference platform architecture, from batch to low-latency… more
- Red Hat (Boston, MA)
- …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM… more
- Red Hat, Inc. (Boston, MA)
- …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM… more
- quadric.io, Inc (Burlingame, CA)
- …unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform ; [2] optimize the model deployment for efficient ... and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the...; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI… more
- Akamai Technologies GmbH (Cambridge, MA)
- …scale Demonstrate deep expertise across multiple AI /ML disciplines, including inference frameworks, model optimization, platform architecture, and emerging ... and products across the organization Serving as principal technical advisor on AI inference , providing expert guidance on complex issues requiring extensive… more
- The Association of Technology, Management and Applied… (Morgan Hill, CA)
- …of innovation in AI . We are building the next generation of Gen AI platform , empowering new AI initiatives across Consumer, Small Business, Global ... critical platform that will enable secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a… more
- Neara (Palo Alto, CA)
- …On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into the real world. Formed by an exceptionally ... Architect, implement, and maintain distributed systems that support high-throughput, low-latency AI model inference and data services. Partner with ML… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …product requirements and prioritization. Partner closely with Engineering, Infrastructure, and Platform teams to deliver scalable, reliable inference services. ... roles with product responsibilities. Experience building and launching cloud infrastructure, platform , or AI /ML services used in production. Strong understanding… more
- Comfy (San Francisco, CA)
- A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ... ideal candidate will engage in building efficient AI models and tackling complex challenges. The role requires...limits. Join a dynamic team focused on creating innovative AI solutions and shaping the future of visual generative… more
- Inference (San Francisco, CA)
- …a Senior Full-Stack Engineer to develop frontend features for its AI platform . Responsibilities include optimizing performance, mentoring team members, and ... creating user-centric applications. Candidates should have 5+ years of experience with React, Tailwind, and Typescript. A competitive salary of $120,000 - $180,000 plus equity and benefits is offered. #J-18808-Ljbffr more
- Whoop, Inc. (Boston, MA)
- …transform continuous physiological data into clear insights and actionable recommendations. Our AI platform is central to this mission, turning raw physiological ... can act on every day. WHOOP is hiring a Senior AI /ML Engineer to help scale the...this role, you will own core components of the AI Platform that power our internal … more
- Jack & Jill/External ATS (San Francisco, CA)
- …and help you find others if you ask. Senior Data Scientist VC-backed AI product intelligence platform Job Description Join a pioneering team building a ... our customers. To apply, speak to Jack. He's an AI agent that sends you unmissable jobs and then...simulation engine for adaptive software. As a Senior Data Scientist, you'll enhance foundational data science infrastructure,… more
- Inference (San Francisco, CA)
- …performant web experiences that give users super-powers over our globally distributed LLM inference platform . If you love shipping React apps that feel snappy ... Inference .net is hiring a Senior Full-Stack... Inference .net is hiring a Senior Full-Stack (Frontend-Focused) Engineer Help us build beautiful,...status. Ready to build the front door to planet-scale AI ? Send a short note and a link to… more
- Capital One (San Francisco, CA)
- …Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, ... in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.*… more
- Capital One National Association (San Francisco, CA)
- Senior Manager, AI Engineering (People Leader) (GenAI Platform Services) Overview At Capital One, we are creating responsible and reliable AI systems, ... us to be at the forefront of enterprises leveraging AI . From informing customers about unusual charges to answering...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Qualcomm (San Diego, CA)
- …Group, Engineering Group > Software Engineering General Summary We are seeking a Senior AI Platforms Engineer to design, build, and operate the infrastructure ... language models (LLMs) at scale using AWS Bedrock, GCP Vertex, Azure AI Foundry and Kubernetes-based solutions. Optimize inference performance for throughput,… more
- Capital One (San Francisco, CA)
- …Experience developing AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, ... in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.*… more