- Bosch (Sunnyvale, CA)
- …following areas: multimodal transformers, diffusion models, NeRF/Gaussian Splatting, video generation, 3D scene understanding , autonomous driving, behavior ... Valley focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data Visual...internship, you will conduct advanced research and engineering in 3D perception, scene understanding , and… more
- NVIDIA (Santa Clara, CA)
- …i nternships require research experience in at least one of the following areas: 3D Vision and Scene Understanding + Optical Flow/ Scene Flow + SLAM + ... submitting your resume, you're expressing interest in one of our 2026 Computer Vision and Deep Learning focused Research Internships. We'll review resumes on an… more
- Meta (Burlingame, CA)
- …ML and software in the domains of World Model/VLA, Long-context & consistent 4D scene understanding , 3D environment and object reconstruction, estimation and ... field 8. Experienced in advanced computer vision and ML, thorough understanding of 3D geometry fundamentals as well as state of the art ML/AI models… more
- Bosch (Sunnyvale, CA)
- …multimodal language models, diffusion models, NeRF or gaussian splatting, VLA modeling, 3D scene understanding , sensor calibration, autonomous driving, SfM, ... **Advance 3D perception capabilities** by integrating large-scale vision -language-action models, enhancing reasoning, explainability, and open-world understanding… more
- General Motors (Mountain View, CA)
- …error logging, and data curation. **Bonus:** + Expertise with Transformer-based models for 3D detection, tracking, and scene understanding . + Technical ... deployment of deep learning models for core perception tasks such as: + 3D Object Detection and Tracking (vehicles, pedestrians, cyclists). + Real-time map detection… more
- Amazon (Sunnyvale, CA)
- …related research and experimentation, applying advanced machine learning techniques in computer vision (CV), Generative AI, multimedia understanding and so on. ... of: harmonization, relighting, style transfer, lip-sync, segmentation, matting, depth estimation, 3D camera/ scene modeling. Amazon is an equal opportunity… more
- pony.ai (Fremont, CA)
- …models focusing on 3D object detection and tracking, segmentation, semantics understanding , video understanding , scene understanding , traffic ... in real-time on the cars. + Develop and deploy deep learning models, including vision language models (VLMs) and Large Language Models (LLMs) + Design and implement… more