ML Safety Benchmarking Research Jobs in Berkeley, CA

ML Safety & Benchmarking…

Apple Inc. (San Francisco, CA)

…Learning Research Engineer to design and develop safe AI benchmarking methodologies. This role involves collaboration with various teams to implement responsible ... research background, proficiency in Python, and experience in AI/ ML model evaluation. The position offers competitive compensation, stock option opportunities,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
ML Safety Research Engineer

Apple Inc. (San Francisco, CA)

…models. Description Our team, part of Apple Services Engineering, is looking for an ML Research Engineer to lead the design and continuous development of ... automated safety benchmarking methodologies. In this role, you...research background in empirical evaluation, experimental design, or benchmarking Strong proficiency in Python (pandas, NumPy, Jupyter, PyTorch,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal Rust Engineer - ML Infrastructure

Labelbox (San Francisco, CA)

…and safety across existing Rust codebases. Collaborate with data, research , and engineering teams to support model training and evaluation workflows. Identify ... with data annotation, data quality, or evaluation systems. Familiarity with AI/ ML workflows, model training, or benchmarking pipelines. Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal Python Engineer - ML…

Labelbox (San Francisco, CA)

…and safety across existing Python codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Machine Learning Research Lead, Security…

Scale AI, Inc. (San Francisco, CA)

…learning, especially in generative AI, evaluation, or oversight. Significant experience leading ML research in academia or industry. Strong written and verbal ... research team is shaping the next generation of safety science for frontier AI models and works at...research at top‑tier venues and contribute to open‑source benchmarking initiatives. Remain deeply engaged with the research… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
AI Architect

Scale AI, Inc. (San Francisco, CA)

…of frontier AI research , product, and go-to-market. You'll partner closely with ML teams in high-stakes meetings, scope and pitch solutions to top AI labs, and ... technical execution across accounts. You have Deep technical background in applied AI/ ML : 5-10+ years in research , engineering, solutions engineering, or… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Systems Engineer - AI Data & Infrastructure…

Labelbox (San Francisco, CA)

…supporting AI training and benchmarking Improve performance, scalability, and safety across existing codebases Collaborate with data, research , and ... Experience with data annotation, data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Systems Software Engineer - Machine Learning Ops

Labelbox (San Francisco, CA)

…and safety across existing C++ codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Python Insfrastructure Engineer - Model Evaluation

Labelbox (San Francisco, CA)

…and safety across existing Python codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior C# Full-Stack Engineer - AI Data…

Labelbox (San Francisco, CA)

…and safety across existing C++ codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Senior Rust Engineer - AI Data & Infrastructure…

Labelbox (San Francisco, CA)

…and safety across existing Rust codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal Systems Engineer (C++) - AI…

Labelbox (San Francisco, CA)

…and safety across existing C++ codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Backend Developer - AI Data Services

Labelbox (San Francisco, CA)

…and safety across existing C# codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... experience with data annotation, data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Data Infrastructure Engineer (Rust) - High…

Labelbox (San Francisco, CA)

…and safety across existing Rust codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Data Infrastructure Developer (C++)

Labelbox (San Francisco, CA)

…and safety across existing C++ codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... experience with data annotation, data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Backend & Tooling Engineer (Rust)

Labelbox (San Francisco, CA)

…and safety across existing Rust codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... experience with data annotation, data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
C++ Engineer - High Performance Computing (HPC)

Labelbox (San Francisco, CA)

…and safety across existing C++ codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... experience with data annotation, data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Systems Programmer - AI Data Pipelines

Labelbox (San Francisco, CA)

…and safety across existing Rust codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... experience with data annotation, data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Rust Software Engineer - Distributed Systems

Labelbox (San Francisco, CA)

…and safety across existing Rust codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Backend Developer - Data Annotation Systems

Labelbox (San Francisco, CA)

…and safety across existing Python codebases Collaborate with data, research , and engineering teams to support model training and evaluation workflows Identify ... with data annotation , data quality, or evaluation systems Familiarity with AI/ ML workflows, model training, or benchmarking pipelines Experience with… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Recent Searches

Recent Jobs

Account Login

Sign Up

Forgot your password?

Advanced Search