- Comfy (San Francisco, CA)
- …company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will engage in ... building efficient AI models and tackling complex challenges. The role requires a strong background in PyTorch and a passion for pushing performance limits. Join a dynamic team focused on creating innovative AI solutions and shaping the future of visual… more
- Amazon (Seattle, WA)
- A leading technology company based in Seattle is seeking a Senior Software Engineer for its Machine Learning Inference Applications team. The role focuses on ... development and optimization for LLM inference on specialized hardware. Ideal candidates will have significant software development experience and an understanding… more
- Amazon (San Francisco, CA)
- …technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI/ ML projects. You will design and optimize machine ... learning principles. This role fosters a collaborative environment emphasizing continuous learning and performance tuning for clients' ML workloads. #J-18808-Ljbffr more
- Qualcomm (San Diego, CA)
- …working within a team to develop Qualcomm's next-gen high-performance inference accelerator. Candidates should have strong embedded software development skills ... and experience working with various programming languages. The position offers competitive compensation ranging from $162,600 to $244,000, reflecting the total compensation package including bonuses and benefits. #J-18808-Ljbffr more
- quadric.io, Inc (Burlingame, CA)
- A pioneering tech company is looking for an experienced AI Inference Engineer to bridge AI models and advanced processing platforms. This role requires expertise ... in AI model algorithms, strong C/C++ and Python skills, and experience with deployment frameworks. You will optimize and benchmark AI models, ensuring efficient deployment in edge devices. The position comes with comprehensive health benefits and flexible work… more
- Google Inc. (Sunnyvale, CA)
- A leading global technology company is looking for a Senior Software Engineer specializing in machine learning and model optimization. This role requires ... expertise in C++ and Python, particularly with advanced deep learning frameworks like PyTorch or JAX. You will work closely with customers, optimizing their machine learning models on TPU platforms and developing innovative model performance solutions. The… more
- Amazon (San Francisco, CA)
- Senior Software Development Engineer , AI/ ML , AWS Neuron, Model Inference Job ID: 3067759 | Amazon.com Services LLC The Annapurna Labs team at Amazon Web ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
- Amazon (San Francisco, CA)
- Software Development Engineer , AI/ ML , AWS Neuron, Model Inference The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software ... integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and… more
- Amazon (Seattle, WA)
- …cloud-scale machine-learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is ... for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc.… more
- Nutanix (San Diego, CA)
- A global technology leader in San Diego is seeking an AI Software Engineer to develop on-device software solutions using Python and C/C++. The ideal candidate will ... work in a dynamic environment, collaborating with researchers to advance Gen AI technology. A strong background in software engineering and experience with deep learning frameworks are essential. The role offers a competitive salary range and opportunities for… more
- Apple Inc. (San Francisco, CA)
- Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best map ... with Maps infrastructure. Responsibilities Own the technical architecture of large-scale ML inference platforms, defining long-term design direction for serving… more
- Akamai Technologies GmbH (Cambridge, MA)
- Senior Principal Software Engineer - Akamai Inference Cloud (Remote) United States (Remote) Job Description Do you thrive on defining the future of AI ... requiring a longer-term view and deep understanding of business objectives. As a Senior Principal Software Engineer , you will be responsible for: Defining the… more
- NVIDIA Corporation (Santa Clara, CA)
- Senior Technical Marketing Engineer - AI Inference at Scale page is loaded## Senior Technical Marketing Engineer - AI Inference at ... power AI at scale. We are looking for a Senior Technical Marketing Engineer to join our...consistent, high-impact go-to-market strategy.This role will focus on AI inference at scale, ensuring that customers and partners understand… more
- Hamilton Barnes Associates Limited (San Francisco, CA)
- …systems. Requirements 5+ years' experience building large-scale, fault-tolerant distributed systems ( ML inference , HPC, or similar). Proficiency in Python, Go, ... architectures. Exposure to hybrid cloud or multi-cluster environments. Contributions to open-source ML or inference systems projects. Proven track record of cost… more
- Apple Inc. (Cupertino, CA)
- A leading technology company in California seeks a senior /principal engineer to architect and build distributed ML infrastructure. This role involves ... optimizing GPU compute systems and collaborating with silicons teams to enhance AI capabilities. Candidates should have substantial experience in GPU programming and distributed systems, alongside a technical degree. The compensation package includes a base… more
- Red Hat (Boston, MA)
- …for enterprises to build, optimize, and scale LLM deployments.We are seeking an experienced ML Ops engineer to work closely with our product and research teams ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...scale SOTA deep learning products and software. As an ML Ops engineer , you will work closely… more
- Red Hat, Inc. (Boston, MA)
- …for enterprises to build, optimize, and scale LLM deployments. We are seeking an experienced ML Ops engineer to work closely with our product and research teams ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...scale SOTA deep learning products and software. As an ML Ops engineer , you will work closely… more
- Neara (Palo Alto, CA)
- …implement, and maintain distributed systems that support high-throughput, low-latency AI model inference and data services. Partner with ML researchers and ... Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI...and resilient distributed systems. Youll work closely with researchers, ML engineers, and product teams to bring cutting-edge AI… more
- Etched.ai, Inc. (San Jose, CA)
- …SDK flows, and reproducible performance metrics. This role requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work ... A tech company specializing in AI inference seeks a technical team member to build installation guides,… more
- NVIDIA Corporation (Santa Clara, CA)
- A leading technology company is seeking a Senior Technical Marketing Engineer for AI Inference at Scale. This role involves developing impactful messaging ... should have a strong background in product marketing, experience with AI/ ML workloads, and exceptional communication skills. The position offers a competitive… more