- Red Hat (Boston, MA)
- …open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings ... to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model...LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product and research… more
 
- Red Hat (Boston, MA)
- **Job Summary:** The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston, MA office. In this role, you will work ... + Familiarity with model parallelization, quantization, and memory optimization using vLLM , DeepSpeed, OpenVino and other inference libraries + Strong experience… more