- Cartesia (San Francisco, CA)
- …datasets at large scale to develop model capabilities. Stay on the cutting edge of research in synthetic data generation, data augmentation, and ... training will be built on a foundation of high-quality synthetic data . We are looking for a...a unique, high-impact role, where you will solve critical data bottlenecks and directly accelerate our research … more
- Sesame (San Francisco, CA)
- …We're looking for a skilled engineer or researcher to build high-value synthetic data pipelines that accelerate vision model development. The ideal candidate ... improve downstream computer vision tasks. Responsibilities: Build and maintain synthetic data generation pipelines (eg, neural rendering, diffusion/score-based… more
- Sesame (San Francisco, CA)
- …Francisco is seeking a skilled engineer or researcher to build high-value synthetic data pipelines. The ideal candidate should have experience in classical ... computer vision techniques and be comfortable with modern machine learning tools. This full-time role offers excellent employee benefits including health and dental coverage, unlimited PTO, and a collaborative work environment. Diversity is embraced within the… more
- Periodiclabs (Menlo Park, CA)
- …is seeking a candidate to train cutting-edge language models and develop methods for synthetic data generation. In this role, you will closely collaborate with ... researchers to guide scientific data curation and optimize reinforcement learning processes. Ideal candidates should have experience with LLM training and… more
- Scale AI (San Francisco, CA)
- Machine Learning Research Engineer , Agent Data Foundation - Enterprise GenAI AI is becoming vitally important in every function of our society. At Scale, our ... Data Foundation team, you'll work on cutting edge research to define the data flywheel that...would love to hear from you! You will: Build synthetic data pipelines to generate enterprise environments… more
- Apple Inc. (Culver City, CA)
- Machine Learning Research Engineer - Large Language Models (LLMs), Siri Core Modeling Cupertino, California, United States Machine Learning and AI Join the Siri ... Planner team as a Machine Learning Research Engineer and play a pivotal role...Augmented Generation (RAG), or agentic systems Applying LLMs for synthetic data generation (eg for knowledge distillation)… more
- Fabrion (San Francisco, CA)
- ML/AI Research Engineer - Agentic AI Lab...intelligence layer that sits on top of our enterprise data fabric. This isn't a prompt engineer ... graphs, and multi‑tenant governance. We're looking for an ML/AI Research Engineer to join our AI Lab...for enterprise use cases with both structured and unstructured data Build and optimize RAG pipelines using LangChain, LangGraph,… more
- Workshop Labs (San Francisco, CA)
- …techniques to make models pick up people's reasoning & judgement & style. Put small data to use. Create synthetic data pipelines to let models squeeze ... judgment, and preferences without big tech-or us-ever seeing your data . Our core ML challenge: how do we train...AI startup, or top AI lab. Published machine learning research outputs , as peer‑reviewed papers or in‑depth technical… more
- Aldea Inc (San Francisco, CA)
- …The Role We are hiring a Data Engineer to build the data infrastructure that powers Aldea's multi-modal AI research . You will design and scale ... sources across language and speech domains, and generate high-quality synthetic data for model training. This is...training quality and efficiency. If you're passionate about building data systems that power cutting-edge AI research ,… more
- Zyphra Technologies Inc. (Palo Alto, CA)
- …grappling in detail with data and spending significant time involved in data engineering and synthetic data generation Postgraduate degree in scientific ... You will be deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies. You… more
- Voltai Inc. (Palo Alto, CA)
- …automated chip development. You will develop methods for generating and curating synthetic design data , performing model distillation, and enabling continual ... laws and optimizing compute budgets for chip-design-specific workloads Generating large-scale synthetic design data (eg, RTL variants, testbenches, verification… more
- Letta Inc. (San Francisco, CA)
- …mixtures, training algorithms, and models Building infrastructure for generating and collecting synthetic data at scale Building challenging evals for measuring ... of self-improving superintelligence. Advance the field through open publishing of research through papers, technical reports, blog posts, and open-source code. What… more
- Periodiclabs (Menlo Park, CA)
- …serve as the foundation for reinforcement learning. You will develop methods for synthetic data generation, distillation, and continual learning at scale. You ... scaling laws and compute‑optimal hyperparameters Generating billions of tokens of high‑quality synthetic data Building evals that correlate with downstream task… more
- Amazon (San Francisco, CA)
- …development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and ... Senior Language Engineer , Artificial General Intelligence - Data ...responsible for the final deliverables Design and conduct complex data creation tasks using synthetic and model-based… more
- Apple Inc. (San Francisco, CA)
- …a staff engineer to lead the creation of a groundbreaking tooling for synthetic data generation. You will architect the systems that create vast, diverse ... AIML - Sr. Machine Learning Infrastructure Engineer , Evaluation San Francisco, California, United States Software...and implementation of a platform dedicated to generating high-fidelity synthetic data at an unprecedented scale. You… more
- Tome (San Francisco, CA)
- …and execs Pioneer the training of new models that leverage both historical data and synthetic training data Prototype innovative, LLM-powered experiences, ... and capabilities Bonus Points Proven track record of leading successful AI/ML research projects in a product environment Publications in applied AI/ML scientific… more
- Amazon (San Francisco, CA)
- …development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and ... Language Engineer , Artificial General Intelligence - Data ...responsible for the final deliverables Design and conduct complex data creation tasks using synthetic and model-based… more
- Nuro, Inc. (Mountain View, CA)
- …About the Role Our robotics team is growing and we are looking for a Software Engineer to join our Sensor Data and Calibration team. We are searching for an ... of machine learning methods (eg, NeRF or Gaussian splatting) for generating synthetic sensor data (photorealistic images, realistic lidar and/or radar, etc.).… more
- Asimov (Boston, MA)
- …ability to design living systems. We're developing a mammalian synthetic biology platform--from cells to software--to enable biotechnologies with outsized ... we ship code. We're looking for a Senior Software Engineer to help us build it. You'll join an...join an interdisciplinary team that works directly with scientists, synthetic biologists, and computational biologists alongside your teammates in… more
- Amazon (San Francisco, CA)
- …responsibilities Develop simulations for reinforcement learning, closed‑loop simulations and synthetic data generation Implement essential robotics features, ... We are seeking a Simulation Engineer to join our AI robotics research...science or equivalent Experience with physical robots, reinforcement learning, synthetic data generation. Experience optimizing physics simulation… more