Software Engineer GenAI Inference Jobs

71 jobs (page 1)

Categories

All Categories

Engineering (16)

Software/IT (10)

Staff Software Engineer…

Databricks Inc. (San Francisco, CA)

Staff Software Engineer - GenAI inference P-1285 About This Role As a staff software engineer for GenAI inference , you will lead the ... low latency, and robust scaling. Your work will encompass the full GenAI inference stack: kernels, runtimes, orchestration, memory, and integration with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer - GenAI…

Menlo Ventures (San Francisco, CA)

About This Role As a software engineer for GenAI inference , you will help design, develop, and optimize the inference engine that powers Databricks' ... scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and...BS/MS/PhD in Computer Science, or a related field Strong software engineering background (3+ years or equivalent) in performance‑critical… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
GenAI Inference Engineer…

Databricks Inc. (San Francisco, CA)

A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference . In this role, you'll design, develop, and ... optimize the inference engine powering the Foundation Model API. You will...focusing on large-scale LLM applications. A strong background in software engineering, distributed systems, and machine learning techniques is… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
GenAI Inference Architect: Scale…

Databricks Inc. (San Francisco, CA)

A leading data and AI company in San Francisco seeks a Staff Software Engineer for GenAI inference to lead its architecture and optimization efforts. ... with at least 6 years of experience and an understanding of ML inference internals. Key tasks include collaborating on model features, optimizing the inference… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Software Development Engineer…

Amazon (San Francisco, CA)

Senior Software Development Engineer , AI/ML, AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep... development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Development Engineer…

Amazon (San Francisco, CA)

Software Development Engineer , AI/ML, AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK,… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer…

Menlo Ventures (San Francisco, CA)

About This Role As a staff software engineer for GenAI Performance and Kernel, you will own the design, implementation, optimization, and correctness of the ... high-performance GPU kernels powering our GenAI inference stack. You will lead development of highly-tuned, low-level compute paths, manage trade-offs between… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff GenAI Engineer

Apple Inc. (Seattle, WA)

…with data, ultimately helping teams derive insights that drive product success. As a Staff GenAI Engineer on the Apple Data Platform group's GenAI Platform ... Seattle, Washington, United States Software and Services The Apple Data Platform team...Apple. Description Join Apple's Data Platform as a Staff GenAI Engineer , where you'll be at the… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior AI Software Engineer…

NVIDIA Corporation (Santa Clara, CA)

NVIDIA is now looking for AI Software Engineers for ourGenAI Frameworks ( and ) team. Megatron Core and NeMo Framework are open-source, scalable and cloud-native ... and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, alignment,… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer , AI…

Red Hat (Boston, MA)

…closely with our product and research teams to scale SOTA deep learning products and software . As an ML Ops engineer , you will work closely with our technical ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Principal MLOps Engineer , AI…

Red Hat, Inc. (Boston, MA)

…closely with our product and research teams to scale SOTA deep learning products and software . As an ML Ops engineer , you will work closely with our technical ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Machine Learning Ops Engineer…

Rivian (Palo Alto, CA)

…challenges of electric vehicles through technology that will set the standards for software ‑defined vehicles around the world. The road to the future is uncharted. ... more intelligent, more sustainable for everyone. Role Summary As an ML Ops Engineer , you will be instrumental in building and maintaining a scalable training and… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Software Engineer , Professional…

Freddie Mac (Mclean, VA)

…your work contributes to a greater purpose. Position Overview: We are seeking an Software Engineer , Professional - Gen AI (Data) Scientist with a strong focus ... and sustainable results for our clients. Your Impact: As Software Engineer , Professional- Gen AI (Data) Scientist,...Design and implement scalable AI Agents, Agentic Workflows and GenAI applications to address diverse and complex business use… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Backend…

harvey.ai (San Francisco, CA)

… GenAI ‑native applications - such as supporting high‑throughput model inference , managing streaming and long‑running API interactions, and designing abstractions ... today - and we're just getting started. Role Overview As a Backend Platform Engineer at Harvey, you will help build and operate the cohesive backend platform that… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer - Agent Cloud

Rubrik, Inc. (Palo Alto, CA)

…infrastructure - including model gateways (like LiteLLM or MCP), fine‑tuning, inference optimization, or policy enforcement in AI workloads. Strong programming ... Rubrik's offerings also include Predibase to help further secure and deploy GenAI while delivering exceptional accuracy and efficiency for agentic applications. At… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Applied AI Engineer

Icon Ventures (San Francisco, CA)

…learning coach that's recognized as best‑in‑class. About the Role As an Applied AI Engineer , you will be working at the forefront of our AI strategy, shaping ... roadmap for applied AI across personalization, ranking, search, recommendations, and GenAI /LLM systems; help connect modeling work to business metrics (engaged… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. ML Kernel Performance Engineer , AWS…

Amazon (Cupertino, CA)

Sr. ML Kernel Performance Engineer , AWS Neuron, Annapurna Labs The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development ... kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. Machine Learning Engineer

Northeastern University (Boston, MA)

Sr. Machine Learning Engineer Apply locations Boston, MA (Main Campus) time type Full time posted on Posted 2 Days Ago job requisition id R132702 About the ... Opportunity The Sr Machine Learning (ML) Engineer applies expertise in deploying and scaling AI pipelines...degree and at least 3+ years of experience in software engineering with a focus on cloud infrastructure plus… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Machine Learning Engineer

Red Hat (Boston, MA)

Senior Machine Learning Engineer page is loaded## Senior Machine Learning Engineerremote type: Hybridlocations: Bostonposted on: Posted Todayjob requisition id: ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Machine Learning Engineer

Cisco Systems (Sunnyvale, CA)

…that power AI across Splunk and Cisco. We manage large-scale, multi-tenant LLM inference across major cloud providers and build platform services to support these ... SDKs, tools, and evaluation/guardrail capabilities that help teams quickly build reliable GenAI assistants and automation features. You'll join a group that sits at… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search