- Boston Consulting Group (Pittsburgh, PA)
- …organizations must blend digital and human capabilities. Our diverse, global teams bring deep industry and functional expertise and a range of perspectives to spark ... and implement change. Innovative. They are creative thinkers who apply their deep technology architecture and AI Platform expertise to envision novel design patterns… more
- Unknown (New York, NY)
- …particularly in fast-paced environments. Deep expertise in advanced machine learning , statistical modeling, experimental design, and causal inference is a ... value-based care. This leadership position demands a visionary with deep technical expertise, capable of building and scaling a...a quantitative field and at least 5-7 years in senior leadership roles. The ideal candidate will have a… more
- NVIDIA (Durham, NC)
- We are now looking for a Senior Deep Learning Inference Performance Architect! NVIDIA is seeking a Senior Performance Architect - a creative engineer ... who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-software… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Deep Learning Architect for LLM Inference ! NVIDIA is at the forefront of the generative AI revolution. Our Inference ... 6+ years of relevant industry experience + Detailed knowledge of deep learning inference serving, PyTorch programming, profiling, and compiler optimizations.… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, ... Our team is responsible for developing and maintaining high-performance deep learning frameworks, including SGLang and vLLM,...at the forefront of efficient large-scale model serving and inference . You will play a central role in improving… more
- NVIDIA (Santa Clara, CA)
- Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Senior ... best practices with C++11 and C++14. + Familiarity with deep learning concepts and frameworks. + A...models (such as Large Language Models) & frameworks for inference . + Background with C++17. NVIDIA is widely considered… more
- Amazon (Cupertino, CA)
- …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference… more
- Amazon (Seattle, WA)
- …Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning ... Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's...new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference… more
- NVIDIA (CA)
- …Dynamo Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team, and we are a remote friendly work ... on the world. We are now looking for a Senior System Software Engineer to work on user facing...world are using GPUs to power a revolution in deep learning , enabling breakthroughs in problems from… more
- NVIDIA (Santa Clara, CA)
- …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative ... cache optimization, parallelism strategies). + Drive continuous innovation in deep learning inference performance to strengthen NVIDIA platform integration… more
- NVIDIA (Santa Clara, CA)
- …Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... We are looking for a Senior Technical Product Marketing Manager. This role will...rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with… more
- NVIDIA (Santa Clara, CA)
- …experience. + 3+ years of experience. + Strong background in deep learning and neural networks, in particular inference . + Experience with performance ... We are now looking for a Senior DL Algorithms Engineer! NVIDIA is seeking ...help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid… more
- NVIDIA (Santa Clara, CA)
- …structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories. + Knowledgeable and passionate about ... highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll architect and… more
- NVIDIA (Santa Clara, CA)
- …Deep understanding of modern data center architectures, accelerated computing, distributed inference , deep learning frameworks (PyTorch, TensorFlow, JAX), ... power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our growing accelerated...high-impact go-to-market strategy. This role will focus on AI inference at scale, ensuring that customers and partners understand… more
- NVIDIA (Santa Clara, CA)
- …different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our ... and work on exciting projects and proof-of-concepts focused on inference for Generative AI and Large Language Models (LLMs)....equivalent experience) + 8+ years of hands-on experience with Deep Learning frameworks such as PyTorch and… more
- Red Hat (Boston, MA)
- …ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops engineer, you will work ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...to solving challenging technical problems at the forefront of deep learning , this is the role for… more
- Amazon (Cupertino, CA)
- …AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators. Come optimize LLMs such as Llama and GPT-OSS to run ... fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team...day in the life You will work with your senior management and technical leaders to define the model… more
- quadric.io, Inc (Burlingame, CA)
- …for efficient inference ; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI model algorithms, system ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph… more
- Amazon (Seattle, WA)
- …AWS Neuron, the complete software stack for Trainium, Amazon's custom cloudscale machine learning accelerators. Come optimize LLMs such as Llama and GPT OSS to run ... fast on Trainium. As the SDM for the Neuron Inference Technology building blocks team, you will guide your...day in the life You will work with your senior management and technical leaders to define the building… more
- Amazon (Seattle, WA)
- …This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance ... required) - Experience with PyTorch - Working knowledge of Machine Learning and LLM fundamentals including transformer architecture, training/ inference … more