- NVIDIA (Santa Clara, CA)
- …how you can make a lasting impact on the world. We are seeking a highly-skilled Senior On- Device Model Inference Optimization Engineer to join our team ... you'll be doing: + Develop and implement strategies to optimize AI model inference for on- device deployment. + Employ techniques like pruning, quantization,… more
- Google (Mountain View, CA)
- …Write and test product or system development code. + Develop GenAI audio/speech features from model research to on- device inference . + Drive the research and ... will own ML model development, training and optimization, and contribute to model inference optimization into Android Kernel/HAL. You will design and fine… more
- FocusKPI Inc. (Mountain View, CA)
- FocusKPI is looking for a Senior AI Web Development Engineer to join one...device (edge) LLM is a plus + ML model deployment and inference in the Android ... novel ways to use web content. They seek a Senior AI Web Development Engineer or Senior ...scraping frameworks like Puppeteer or a similar framework + On- device AI model optimization and quantization Desired… more
- Amazon (Sunnyvale, CA)
- …optimize and balance between short term monetization and long term value creation for Amazon Device users. We are seeking a skilled Senior Economist to build the ... Key job responsibilities We are looking for a talented Senior Economist to drive the devlopment of Devices and...the unique characteristics of the Devices and Services business model , and combining the models with A/B testing for… more
- Walmart (Sunnyvale, CA)
- …next generation of AI powered experiences on smart TVs think on device large language models, real time content understanding, privacy preserving audience insights, ... & train GenAI models targeting CTV use cases: on device LLM quantization, multimodal video text encoders, and diffusion...limited memory devices. + Hands on experience with edge inference (TensorRT, CoreML, TVM, ONNX, TFLite, WebGPU, or similar).… more
- Google (Mountain View, CA)
- …in data structures and algorithms. Preferred qualifications: + Experience running AI inference on- device . + Experience in ML performance, systems data analysis, ... + Create new tools and fine-tuning LLMs to improve model performance and evaluate quality. + Improving the on-...model performance and evaluate quality. + Improving the on- device performance of Gemini + Develop new Built-In AI… more