- Red Hat (Raleigh, NC)
- …open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings ... to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model...LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product and research… more
- LinkedIn (Mountain View, CA)
- …to optimize their models and deliver the best performance possible. As a Senior Software Engineer , you will have first-hand opportunities to advance one ... and has many open source committers (TensorFlow, Horovod, Ray, vLLM , Hugginface, DeepSpeed etc.) in the team. Additionally, this...billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team, you will… more
- Steampunk (Mclean, VA)
- **Overview** We are looking for an experienced ** Senior ** **LLMOps** ** Engineer ** to design, implement, and maintain production-grade large-language-model (LLM) ... pipelines, deployment architectures, and monitoring systems across enterprise environments. The Senior LLMOps Engineer will play a critical role in… more
- Red Hat (Boston, MA)
- **Job Summary:** The Red Hat Ecosystems Engineering group is seeking a Senior Principal Software Engineer in our Boston, MA office. In this role, you will work ... + Familiarity with model parallelization, quantization, and memory optimization using vLLM , DeepSpeed, OpenVino and other inference libraries + Strong experience… more
- CAE USA INC (Orlando, FL)
- …and having fun! Summary We are seeking a highly skilled and experienced Machine Learning Engineer to join our growing AI & Data Science team in R&D . This role ... using tools like ONNX, TensorRT , DeepSpeed , or vLLM . + Implement retrieval-augmented generation (RAG) pipelines using...microservice architecture and REST APIs. + Strong understanding of MLOps tools and practices ( MLflow , Airflow, DVC).… more
- Stanford University (Stanford, CA)
- Junior AI Applications Engineer **Business Affairs: University IT (UIT), Redwood City, California, United States** **New** Information Technology Services Post Date ... 3 days ago Requisition # 107733 **Job Purpose** Are you an AI/GenAI engineer who loves shipping real systems? Join Stanford's Enterprise Technology team to design,… more
- Palo Alto Networks (Santa Clara, CA)
- …security posture from development through runtime. As a Principal Machine Learning Inference Engineer , you will serve as a technical authority and visionary for the ... Beyond individual contribution, you will lead complex technical projects, mentor senior engineers, and set the standard for performance, scalability, and engineering… more