Senior Machine Learning Engineer (AI Training) About The Role What if your deep expertise in machine learning could directly shape how AI reasons, plans, and solves complex problems for millions of people? We're looking for Senior Machine Learning Engineers to author high-fidelity reasoning traces that teach cutting-edge LLMs how to think — step by step, decision by decision. This is a fully remote, versátil contract role built for experienced ML professionals who understand model behavior at a deep level and can translate that understanding into structured, actionable training data.
\n
Organization: Alignerr
\n
Type: Hourly Contract
\n
Location: Remote
\n
Commitment: 10–40 hours/week What You'll Do Author complex, high-fidelity reasoning traces that capture how an AI should plan, use tools, and make decisions across sophisticated real-world tasks
\n
Decompose difficult technical problems into clear, logical, well-documented steps that serve as ground truth for model training
\n
Review and mentor the quality of structured traces produced by others, ensuring accuracy, consistency, and optimal planning documentation
\n
Design data strategies that help LLMs navigate intricate, multi-step decision-making scenarios
\n
Collaborate with a global team of ML professionals working at the frontier of AI research Who You Are Experienced machine learning engineer or researcher with a strong foundation in model reasoning, training, or evaluation
\n
Able to break down complex, ambiguous problems into structured, logical reasoning chains
\n
Deeply familiar with how LLMs work — including their failure modes, capabilities, and training dynamics
\n
Detail-oriented and rigorous in your thinking — you care about getting things exactly right A clear, precise written communicator who can articulate complex reasoning accessibly
\n
Self-directed and productive when working independently in an async environment Nice to Have Prior experience with data annotation, data quality workflows, or model evaluation systems
\n
Familiarity with RLHF, chain-of-thought prompting, or related alignment techniques
\n
Top-tier Kaggle competition results (Grandmaster or Master level) demonstrating elite model performance and feature engineering skills
\n
Background in AI safety, interpretability, or responsible AI development Why Join Us Work at the frontier — your contributions directly influence how next-generation AI models reason and behave
\n
Fully remote and flexible — work when and where it suits you, on your own schedule
\n
Freelance autonomy with the structure and purpose of meaningful, high-impact technical work
\n
Collaborate with world-class ML researchers and AI labs on projects that matter
\n
Potential for ongoing work and contract extension as new projects launch