- NVIDIA (Santa Clara, CA)
- …fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone ... We are seeking highly skilled and motivated software engineers to join our vLLM & MLPerf team. You will define and build benchmarks for MLPerf Inference, the… more
- Red Hat (Raleigh, NC)
- …seeking an experienced ML Ops engineer to work closely with our product and research teams to scale SOTA deep learning products and software. As an ML Ops ... mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates...AI ! **What you will do** + Collaborate with research and product development teams to scale machine learning… more
- NVIDIA (Santa Clara, CA)
- …fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone ... and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme.... What you'll be doing: + Contribute features to vLLM that empower the newest models with the latest… more
- Amazon (Cupertino, CA)
- …and efficiently on AWS silicon. We are seeking a Software Development Engineer to lead and architect our next-generation model serving infrastructure, with a ... particular focus on large-scale generative AI applications. Key job responsibilities * Architect and lead...workloads * Lead integration efforts with frameworks such as vLLM , SGLang, Torch XLA, TensorRT, and Triton * Develop… more
- LinkedIn (Mountain View, CA)
- …with our values-without slowing down innovation** . We sit at the intersection of ** AI modeling, large-scale infrastructure, and safety research ** . We build the ... and has many open source committers (TensorFlow, Horovod, Ray, vLLM , Hugginface, DeepSpeed etc.) in the team. Additionally, this...billions of user queries. Model Training Infrastructure: As an engineer on the AI Training Infra team,… more
- Stanford University (Stanford, CA)
- Junior AI Applications Engineer **Business Affairs: University IT (UIT), Redwood City, California, United States** **New** Information Technology Services Post ... Date 4 days ago Requisition # 107733 **Job Purpose** Are you an AI /GenAI engineer who loves shipping real systems? Join Stanford's Enterprise Technology team to… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …an impact on the world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual ... will be pivotal in leading the development, operations, and support of our entire AI infrastructure. You will be responsible for the entire lifecycle of our AI… more
- NVIDIA (Santa Clara, CA)
- …fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU ... on the world. NVIDIA is seeking a Senior Software Engineer to serve as a Tech Lead, driving the...delivery of agentic blueprints and reference workflows, including the AI -Q Deep Researcher blueprint. This role focuses on crafting… more
- Elevance Health (Chicago, IL)
- **Gen AI Engineer ** **Location:** This role requires associates to be in-office 1 - 2 days per week, fostering collaboration and connectivity, while providing ... position is not eligible for current or future visa sponsorship._ The **Gen** ** AI Engineer ** is responsible for analyzing and modeling organizational data for… more
- Bank of America (Addison, TX)
- Senior Engineer - AI Inference Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you must be at ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI -Inference\_25029879) **Job Description:** At Bank of America,… more
- Bank of America (Kennesaw, GA)
- Software Engineer III -Gen AI Inferencing Addison, Texas;Charlotte, North Carolina; Kennesaw, Georgia; Newark, Delaware **To proceed with your application, you ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Software- Engineer -III--Gen- AI -Inferencing-\_25032986) **Job Description:** At Bank of America,… more
- Robert Half Technology (Princeton, NJ)
- Description We're looking for a software engineer with a strong foundation in Python and deep experience in Artificial Intelligence and Machine Learning, ... testing, and scaling Python-based systems, along with a solid grasp of AI /ML methodologies and their practical applications. + Analyze software requirements and… more
- NVIDIA (Santa Clara, CA)
- …Performance tuning and optimizations of deep learning framework and software components. + Research , prototype, and develop robust and scalable AI tools and ... NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron...PyTorch, JAX), and/or inference and deployment environments (eg TRTLLM, vLLM , SGLang). + Proficient in Python programming, software design,… more
- Cisco (San Jose, CA)
- …Join our innovative engineering team focused on building next-generation AI /ML solutions. You'll collaborate with skilled colleagues across platform, security, ... Impact** Dive into the development and implementation of cutting-edge generative AI applications using the latest large language models-think GPT-4, Claude, Llama,… more
- NVIDIA (Santa Clara, CA)
- …will define the future of generative AI . We are looking for a research engineer who is passionate about open-source and excited to create our next-generation ... post-training software stack. You will work at the intersection of research and engineering, collaborating with the Post-Training and Frameworks teams to invent,… more
- Amazon (Seattle, WA)
- …cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible ... so on. Key job responsibilities Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from… more
- Amazon (Sunnyvale, CA)
- …Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior Research Engineer with a strong hands-on machine learning background, to lead ... Large Language Models (LLMs) and Generative Artificial Intelligence (Gen AI ). You will have significant influence on our overall...upon industry leading frameworks (NeMo, Megatron Core, PyTorch, Jax, vLLM , TRT, etc) - Work with other team members… more
- Red Hat (Raleigh, NC)
- …(https://github.blog/news-insights/octoverse/octoverse-a-new-developer-joins-github-every-second-as- ai ... Summary** At Red Hat we believe the future of AI is open and we are on a mission...mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates… more
- NVIDIA (Santa Clara, CA)
- …across diverse GPU platforms, particularly for physical AI and generative AI applications. You will collaborate with research scientists, software engineers, ... We are now looking for a Senior DL Algorithms Engineer ! We are seeking a highly skilled Deep Learning...and hardware specialists to bring cutting-edge AI models from prototype to production. What you will… more
- NVIDIA (Santa Clara, CA)
- …strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... of research and engineering, pushing the boundaries of large-scale AI optimization. We are looking for passionate engineers with strong foundations in… more