β Back to Gigs
Description
### Role Overview the platform is collaborating with a leading AI lab to contract experienced professionals for an AI model evaluation project. Contractors will assess the quality, accuracy, and safety of AI-generated responses across real estate related subjects. The project offers an opportunity to directly improve the reliability of AI systems in high-stakes contexts where inaccurate information carries serious risk. Key Responsibilities
- Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance
- Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness
- Identify fabricated claims, incorrect references, or misleading reasoning across model outputs
- Score and rank multiple model responses using structured rubrics across dimensions
- Provide written justifications with specific evidence for each evaluation Ideal Qualifications
- Professional experience applying domain expertise in a practitioner or advisory capacity
- Familiarity with industry-specific standards, regulations, or clinical guidelines
- Strong written communication and critical reasoning skills More About the Opportunity
- Expected commitment: ~20 hours/week Application Process
- Submit your resume to begin
- Complete the Model Response Evaluation assessment
Details
Category
STEM Tutoring
Location
Remote
Employment Type
Independent Contractor
Languages Required
πΊπΈ English
Posted
4/2/2026
Interested? Apply directly.
Apply Now βRelated Opportunities
Review
βIs Mercor Legit?
Pay Data
βHow Much Do AI Jobs Pay?
Guide
βHow to Get Started