Software Engineer Ai Inference Jobs

371 jobs (page 1)

Categories

All Categories

Engineering (139)

Software/IT (67)

Management (8)

Software Engineer , AI…

Menlo Ventures (San Francisco, CA)

…and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge ... capabilities of our embodied systems. You will be responsible for optimizing AI inference processes from lightweight to billion-parameter models, ensuring our… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer - AI /ML,…

Amazon (Seattle, WA)

…cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... Overview AWS Neuron is the complete software stack for the AWS Inferentia and Trainium...and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer III, AI…

jobr.pro (Sunnyvale, CA)

…UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google's needs with ... Large Language Models (LLM) and other Machine Learning (ML) models for inference . Experience building GPU-related software . Experience with compilers or ML… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer…

Amazon (San Francisco, CA)

Senior Software Development Engineer , AI /ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and...scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer…

Amazon (San Francisco, CA)

Software Development Engineer , AI /ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... of applied scientists, system engineers, and product managers to deliver state‑of‑the‑art inference capabilities for Generative AI applications. Your work will… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior AI /ML Software…

Amazon (San Francisco, CA)

…technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI /ML projects. You will design and optimize machine ... learning models for deployment on custom hardware accelerators, ensuring maximum performance. Ideal candidates will have over 5 years of experience, strong Python and C++ skills, and knowledge in machine learning principles. This role fosters a collaborative… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
AI /ML Software Engineer…

Amazon (San Francisco, CA)

A leading e-commerce platform in San Francisco is seeking a Software Development Engineer to develop and optimize machine learning models for custom hardware ... accelerators. This role involves performance tuning, debugging, and close collaboration with customers to enhance their models on AWS's services. The ideal candidate has strong programming skills in C++ and Python, along with a solid understanding of machine… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Lead AI Engineer (FM Hosting, LLM…

Capital One (Fredericksburg, VA)

Lead AI Engineer (FM Hosting, LLM Inference ) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For ... customers interact with Capital One. Design, develop, test, deploy, and support AI software components including foundation model training, large language model… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
AI Inference Engineer

quadric.io, Inc (Burlingame, CA)

…GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network... AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal Software Engineer…

Akamai Technologies GmbH (Cambridge, MA)

Senior Principal Software Engineer - Akamai Inference Cloud (Remote) United States (Remote) Job Description Do you thrive on defining the future of AI ... deep understanding of business objectives. As a Senior Principal Software Engineer , you will be responsible for:...across the organization Serving as principal technical advisor on AI inference , providing expert guidance on complex… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineering - Inference…

Virtue AI (San Francisco, CA)

About Virtue AI Virtue AI sets the standard for advanced...to join our core team. What You'll Do As an Inference Engineer , you will own how models are ... Built on decades of foundational and award-winning research in AI security, its AI -native architecture unifies automated...Serve and optimize LLM, embedding, and other ML models' inference across multiple model families Design and operate … more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer - GenAI…

Databricks Inc. (San Francisco, CA)

Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API. You'll bridge research advances and production… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer - GenAI…

Menlo Ventures (San Francisco, CA)

About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management. What… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software…

NVIDIA Corporation (Santa Clara, CA)

Senior Deep Learning Software Engineer , Inference page is loaded## Senior Deep Learning Software Engineer , Inferencelocations: US, CA, Santa Clara: ... requisition id: JR2002670NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for...you will help design, build, and optimize the GPU-accelerated software that powers today's most sophisticated AI … more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer - Large Scale…

San Francisco Compute Co. (San Francisco, CA)

…GPU offtake. About the Role As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase ... solutions to maximize compute utilization Create automated compute purchasing software to optimally fulfill inference job demand...fit for you if You enjoy the craftsmanship of software You're a thoughtful high-agency engineer Have… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Technical Marketing Engineer…

NVIDIA Corporation (Santa Clara, CA)

Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at ... platforms integrate CPUs, GPUs, DPUs, networking, and a full-stack software ecosystem to power AI at scale....a consistent, high-impact go-to-market strategy.This role will focus on AI inference at scale, ensuring that customers… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer , AI…

Red Hat (Boston, MA)

…this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...research teams to scale SOTA deep learning products and software . As an ML Ops engineer , you… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer , AI…

Red Hat, Inc. (Boston, MA)

…this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...research teams to scale SOTA deep learning products and software . As an ML Ops engineer , you… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer , Networking…

OpenAI (San Francisco, CA)

…and low-latency connection management. Have 5+ years of experience as a software engineer and systems architect working on high-scale, high-reliability ... . About the Role We're looking for a senior engineer to design and build the load balancer that...will sit at the very front of our research inference stack - routing the world's largest AI… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Lead AI Engineer (FM Hosting, LLM…

Capital One (San Francisco, CA)

…developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, ... in engineering and mathematics, and your expertise in hardware, software , and AI enable you to see...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search