Senior Deep Learning Inference Jobs

254 jobs (page 1)

Categories

All Categories

Engineering (80)

Software/IT (21)

Management (7)

BCG Platinion | Principal Architect - AI

Boston Consulting Group (Pittsburgh, PA)

…organizations must blend digital and human capabilities. Our diverse, global teams bring deep industry and functional expertise and a range of perspectives to spark ... and implement change. Innovative. They are creative thinkers who apply their deep technology architecture and AI Platform expertise to envision novel design patterns… more

JobLookup XML (12/05/25)
- Save Job - Related Jobs - Block Source
Vice President of Data Science & Analytics

Unknown (New York, NY)

…particularly in fast-paced environments. Deep expertise in advanced machine learning , statistical modeling, experimental design, and causal inference is a ... value-based care. This leadership position demands a visionary with deep technical expertise, capable of building and scaling a...a quantitative field and at least 5-7 years in senior leadership roles. The ideal candidate will have a… more

job goal (12/05/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning…

NVIDIA (Durham, NC)

We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer ... who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-software… more

NVIDIA (10/29/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning…

NVIDIA (Santa Clara, CA)

We are now looking for a Senior Deep Learning Architect for LLM Inference ! NVIDIA is at the forefront of the generative AI revolution. Our Inference ... 6+ years of relevant industry experience + Detailed knowledge of deep learning inference serving, PyTorch programming, profiling, and compiler optimizations.… more

NVIDIA (09/24/25)
- Save Job - Related Jobs - Block Source
Senior Deep Learning Software…

NVIDIA (Santa Clara, CA)

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, ... Our team is responsible for developing and maintaining high-performance deep learning frameworks, including SGLang and vLLM,...at the forefront of efficient large-scale model serving and inference . You will play a central role in improving… more

NVIDIA (12/05/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, Deep…

NVIDIA (Santa Clara, CA)

Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Senior ... best practices with C++11 and C++14. + Familiarity with deep learning concepts and frameworks. + A...models (such as Large Language Models) & frameworks for inference . + Background with C++17. NVIDIA is widely considered… more

NVIDIA (10/02/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer,…

Amazon (Cupertino, CA)

…Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference… more

Amazon (11/27/25)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer,…

Amazon (Seattle, WA)

…Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference… more

Amazon (10/08/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, AI…

NVIDIA (CA)

…Dynamo Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team, and we are a remote friendly work ... on the world. We are now looking for a Senior System Software Engineer to work on user facing...world are using GPUs to power a revolution in deep learning , enabling breakthroughs in problems from… more

NVIDIA (11/29/25)
- Save Job - Related Jobs - Block Source
Senior GenAI Algorithms Engineer - Model…

NVIDIA (Santa Clara, CA)

…and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative ... cache optimization, parallelism strategies). + Drive continuous innovation in deep learning inference performance to strengthen NVIDIA platform integration… more

NVIDIA (09/23/25)
- Save Job - Related Jobs - Block Source
Senior Inference Technical Product…

NVIDIA (Santa Clara, CA)

…Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... We are looking for a Senior Technical Product Marketing Manager. This role will...rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with… more

NVIDIA (09/25/25)
- Save Job - Related Jobs - Block Source
Senior DL Algorithms Engineer…

NVIDIA (Santa Clara, CA)

…experience. + 3+ years of experience. + Strong background in deep learning and neural networks, in particular inference . + Experience with performance ... We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking ...help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid… more

NVIDIA (11/13/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, AI…

NVIDIA (Santa Clara, CA)

…structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories. + Knowledgeable and passionate about ... highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll architect and… more

NVIDIA (11/27/25)
- Save Job - Related Jobs - Block Source
Senior Technical Marketing Engineer - AI…

NVIDIA (Santa Clara, CA)

…Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated...high-impact go-to-market strategy. This role will focus on AI inference at scale, ensuring that customers and partners understand… more

NVIDIA (11/06/25)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect, Generative AI…

NVIDIA (Santa Clara, CA)

…different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our ... and work on exciting projects and proof-of-concepts focused on inference for Generative AI and Large Language Models (LLMs)....equivalent experience) + 8+ years of hands-on experience with Deep Learning frameworks such as PyTorch and… more

NVIDIA (09/26/25)
- Save Job - Related Jobs - Block Source
Senior Software Engineer - vLLM…

Red Hat (Boston, MA)

…ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops engineer, you will work ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...to solving challenging technical problems at the forefront of deep learning , this is the role for… more

Red Hat (12/06/25)
- Save Job - Related Jobs - Block Source
Software Development Manager, LLM Inference…

Amazon (Cupertino, CA)

…AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators. Come optimize LLMs such as Llama and GPT-OSS to run ... fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team...day in the life You will work with your senior management and technical leaders to define the model… more

Amazon (12/06/25)
- Save Job - Related Jobs - Block Source
AI Inference Engineer

quadric.io, Inc (Burlingame, CA)

…for efficient inference ; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI model algorithms, system ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph… more

quadric.io, Inc (11/25/25)
- Save Job - Related Jobs - Block Source
Software Development Manager, AI Inference…

Amazon (Seattle, WA)

…AWS Neuron, the complete software stack for Trainium, Amazon's custom cloudscale machine learning accelerators. Come optimize LLMs such as Llama and GPT OSS to run ... fast on Trainium. As the SDM for the Neuron Inference Technology building blocks team, you will guide your...day in the life You will work with your senior management and technical leaders to define the building… more

Amazon (11/14/25)
- Save Job - Related Jobs - Block Source
Software engineer -AI/ML, AWS Neuron…

Amazon (Seattle, WA)

…This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance ... required) - Experience with PyTorch - Working knowledge of Machine Learning and LLM fundamentals including transformer architecture, training/ inference … more

Amazon (09/09/25)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search