- Red Hat (Boston, MA)
- A leading software company seeks a Machine Learning Engineer focused on vLLM Inference in Boston, MA. This role involves designing high-performance machine ... Ideal candidates are experienced in Python and Pydantic, understand LLM Inference Core Concepts, and possess strong communication skills. Enjoy comprehensive… more
- Red Hat (Boston, MA)
- Machine Learning Engineer , vLLM Inference - Tool Calling and Structured Output Join to apply for the Machine Learning Engineer , vLLM Inference - ... mission to bring the power of open‑source LLMs and vLLM to every enterprise. The Red Hat Inference...optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM , you will be… more
- Red Hat (Boston, MA)
- Principal Machine Learning Engineer , Distributed vLLM Inference Join to apply for the Principal Machine Learning Engineer , Distributed vLLM ... mission to bring the power of open‑source LLMs and vLLM to every enterprise. Red Hat Inference ...or distributed tracing libraries/techniques like OpenTelemetry. Ph.D. in an ML ‑related domain is a significant advantage. The salary range… more
- Red Hat, Inc. (Boston, MA)
- …open, and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise ... optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM , you will be...you. Join us in shaping the future of AI Inference ! What You Will Do Write robust Python and… more
- Red Hat (Boston, MA)
- Principal Machine Learning Engineer , AI Inference page is loaded## Principal Machine Learning Engineer , AI Inferenceremote type: Hybridlocations: ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference ...optimize, and scale LLM deployments.As a Principal Machine Learning Engineer focused on vLLM , you will be… more
- Red Hat (Boston, MA)
- …is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and ... optimize, and scale LLM deployments. As a Machine Learning Engineer focused on distributed vLLM (https://github.com/ vllm...components in Go and/or Rust to integrate with the vLLM project and manage distributed inference workloads.… more
- Red Hat (Boston, MA)
- …open, and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise ... optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM , you will be...you. Join us in shaping the future of AI Inference ! **What You Will Do** + Write robust Python… more
- Red Hat (Boston, MA)
- …is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and ... project (https://github.blog/news-insights/octoverse/octoverse-a-new-developer-joins-github-every-second-as-ai-leads-typescript-to-1/#the-top-open-source-projects-by-contributors) on Github. As a Machine Learning Engineer focused on vLLM , you will… more
- Red Hat (Boston, MA)
- …is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and ... to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model...scale LLM deployments. We are seeking an experienced Senior ML Ops engineer to work closely with… more
- Red Hat (Boston, MA)
- …Nsight Systems, PyTorch Profiler, among others + Hands-on experience with modern LLM inference server stacks (eg, vLLM , TensorRT-LLM, TGI, Triton Inference ... and Scale Engineering team is seeking a Senior Performance Engineer to join our PSAP (Performance and Scale for...you will drive the performance and scalability of distributed inference for Large Language Models (LLMs) as part of… more
- Oracle (Boston, MA)
- …that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and implementation of scalable, ... workflows. You will work in a collaborative environment with applied scientists, ML engineers, and software teams to deliver performant and reliable AI… more