- Cartesia (San Francisco, CA)
- …datasets at large scale to develop model capabilities. Stay on the cutting edge of research in synthetic data generation, data augmentation, and ... training will be built on a foundation of high-quality synthetic data . We are looking for a...a unique, high-impact role, where you will solve critical data bottlenecks and directly accelerate our research … more
- Sesame (San Francisco, CA)
- …We're looking for a skilled engineer or researcher to build high-value synthetic data pipelines that accelerate vision model development. The ideal candidate ... improve downstream computer vision tasks. Responsibilities: Build and maintain synthetic data generation pipelines (eg, neural rendering, diffusion/score-based… more
- Amazon (San Francisco, CA)
- …development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and ... Senior Language Engineer , Artificial General Intelligence - Data ...responsible for the final deliverables Design and conduct complex data creation tasks using synthetic and model-based… more
- Fabrion (San Francisco, CA)
- ML/AI Research Engineer - Agentic AI Lab...intelligence layer that sits on top of our enterprise data fabric. This isn't a prompt engineer ... graphs, and multi‑tenant governance. We're looking for an ML/AI Research Engineer to join our AI Lab...for enterprise use cases with both structured and unstructured data Build and optimize RAG pipelines using LangChain, LangGraph,… more
- Amazon (San Francisco, CA)
- …development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and ... Language Engineer , Artificial General Intelligence - Data ...responsible for the final deliverables Design and conduct complex data creation tasks using synthetic and model-based… more
- Aldea Inc (San Francisco, CA)
- …The Role We are hiring a Data Engineer to build the data infrastructure that powers Aldea's multi-modal AI research . You will design and scale ... sources across language and speech domains, and generate high-quality synthetic data for model training. This is...training quality and efficiency. If you're passionate about building data systems that power cutting-edge AI research ,… more
- Letta Inc. (San Francisco, CA)
- …mixtures, training algorithms, and models Building infrastructure for generating and collecting synthetic data at scale Building challenging evals for measuring ... of self-improving superintelligence. Advance the field through open publishing of research through papers, technical reports, blog posts, and open-source code. What… more
- Periodiclabs (Menlo Park, CA)
- …is seeking a candidate to train cutting-edge language models and develop methods for synthetic data generation. In this role, you will closely collaborate with ... researchers to guide scientific data curation and optimize reinforcement learning processes. Ideal candidates should have experience with LLM training and… more
- Amazon (San Francisco, CA)
- …responsibilities Develop simulations for reinforcement learning, closed‑loop simulations and synthetic data generation Implement essential robotics features, ... We are seeking a Simulation Engineer to join our AI robotics research...science or equivalent Experience with physical robots, reinforcement learning, synthetic data generation. Experience optimizing physics simulation… more
- Sesame (San Francisco, CA)
- …for running on mobile class hardware. Own the full development cycle: system design, data collection & curation, synthetic data generation, model training & ... field. Experience with wearables, IMUs, or tactile/force sensors. Familiarity with synthetic data generation and augmentation techniques. Experience in a… more
- NightDragon Acquisition Corp. (San Francisco, CA)
- About Capella Space Capella Space is a pioneer in Synthetic Aperture Radar (SAR) satellite technology and space-based signal intelligence. We empower government, ... commercial, and research organizations around the world with high-resolution, timely Earth...technology will support customers with the highest level of data fidelity, security, and speed. Capella was named one… more
- Latent Labs (San Francisco, CA)
- We are seeking a Senior Research Associate to join our team working at the interface of generative AI and biology. The ideal candidate is experienced in molecular ... related field with two years of industry or academic research experience in the above field(s); or Master's degree...automation. You are used to managing large amounts of data . You have experience using and improving LIMS, Benchling… more
- NightDragon Acquisition Corp. (San Francisco, CA)
- About Capella Space Capella Space is a pioneer in Synthetic Aperture Radar (SAR) satellite technology and space-based signal intelligence. We empower government, ... commercial, and research organizations around the world with high-resolution, timely Earth...technology will support customers with the highest level of data fidelity, security, and speed. Capella was named one… more
- Code Metal (San Francisco, CA)
- …applying RLHF to LLMs, especially for code generation. Experience with large‑scale synthetic data generation. Benefits Health care plan with 100% premium ... using PyTorch (2+ years experience required). Design and implement scalable data curation and quality assurance pipelines to ensure top-tier training datasets.… more
- NightDragon Acquisition Corp. (San Francisco, CA)
- About Capella Space Capella Space is a pioneer in Synthetic Aperture Radar (SAR) satellite technology and space-based signal intelligence. We empower government, ... commercial, and research organizations around the world with high-resolution, timely Earth...technology will support customers with the highest level of data fidelity, security, and speed. Capella was named one… more
- Arena AI (San Francisco, CA)
- …on yesterday's tools. At Arena, we're building the world's first AI industrial engineer designed to solve the most complex hardware and manufacturing challenges. Our ... domains of physics. Paired with its ability to reason about multimodal industrial data , Atlas can test, debug, optimize, and repair physical systems and products in… more
- NightDragon Acquisition Corp. (San Francisco, CA)
- About Capella Space Capella Space is a pioneer in Synthetic Aperture Radar (SAR) satellite technology and space-based signal intelligence. We empower government, ... commercial, and research organizations around the world with high-resolution, timely Earth...technology will support customers with the highest level of data fidelity, security, and speed. Capella was named one… more
- Periodiclabs (Menlo Park, CA)
- …serve as the foundation for reinforcement learning. You will develop methods for synthetic data generation, distillation, and continual learning at scale. You ... scaling laws and compute‑optimal hyperparameters Generating billions of tokens of high‑quality synthetic data Building evals that correlate with downstream task… more
- Volkswagen Group Services GmbH (Belmont, CA)
- …use cases, including AI digital twins, scene understanding, outlier detection, synthetic data generation, scenario modeling, traffic modeling, neural ... days) to receive an alert: Senior AI ML Ops Engineer - Must have Autonomous Driving Experience As ADMT,...of Excellence within the ADMT LLC focuses on applied research and pre-development with the goal to evaluate and/or… more
- Meta (Menlo Park, CA)
- …while having the chance to collaborate with researchers and engineers across MSL. **Required Skills:** Research Engineer , Media Data Research - MSL FAIR ... research engineers to help us build the data foundation for Meta's most advanced Large Language and...We are tackling complex challenges at trillion-scale, including organic data curation, synthetic data generation,… more