Our partner is seeking versatile and detail-oriented professionals to collaborate with our team on a range of cutting-edge AI evaluation projects. As a Generalist, you’ll participate in the development and assessment of AI systems across diverse domains — helping test, refine, and improve how advanced models understand and reason about real-world workflows.
1. Key Responsibilities
Evaluate AI-generated outputs for accuracy, clarity, and alignment with real-world reasoning
Contribute written assessments and structured feedback on model performance
Identify conceptual, logical, and stylistic strengths and weaknesses in AI responses
Collaborate asynchronously with partner’s research and operations teams to maintain quality and consistency across evaluations
Ensure strong analytical judgment, precise communication, and consistent application of evaluation criteria
2. Ideal Qualifications
Strong English fluency and written communication skills
Sharp attention to detail and ability to identify subtle errors or inconsistencies
Analytical and critical thinking skills across a wide range of topics
No formal educational background or specific degree required — curiosity, clarity, and reasoning skill are key
3. Timeline
This listing does not qualify you for any particular project. This is an opportunity to join an important part of our talent pool
Start Date: Rolling (immediate opportunities available)
Duration: Varies by project (1–3 months typical)
Commitment: Flexible and part-time (~10–20 hours/week, with potential to increase)
Schedule: Fully remote and asynchronous
4. Compensation & Contract
Paid opportunities across multiple projects, typically $30–$100 USD/hour, depending on domain expertise and task complexity
Independent contractor arrangement
Daily payments via Stripe Connect
5. Application & Qualification Process
Submit Application: Upload your resume or professional summary
Qualifying Assessment: Complete a short, unpaid work trial introducing a core data type and task format used in projects
Ongoing Opportunities: Candidates who perform well will be considered for future paid work across AI evaluation, training, and research collaborations
Apply Now
Let's start your dream job