Job description
**Pay:** $20-$35 per hour (USD).
**Job Title:** AI Evaluation Specialist
**Job Type:** Contractor
**Location:** Remote
**Job Summary:** In this role, you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input.
**Key Responsibilities:**
1. Design and implement self-contained evaluation tasks, including prompts, supporting files, and detailed grading rubrics to assess AI performance on practical computer-based workflows.
2. Define clear, unambiguous written criteria for what constitutes successful and unsuccessful task completion across diverse administrative and workflow scenarios.
3. Meticulously observe and document AI agent behaviors, producing crisp, precise summaries and reports in high-quality English.
4. Iterate and refine evaluation tasks and rubrics based on feedback and team collaboration to ensure robust benchmarking methodologies.
5. Work cross-functionally across a wide range of domains, adapting evaluation frameworks as project requirements evolve.
6. Collaborate with the customer's team to share insights and help drive continuous improvement in AI evaluation techniques.
7. Champion meticulousness, structured observation, and clear written communication throughout all project deliverables.
**Required Skills and Qualifications:**
1. Minimum 3 years of experience in roles emphasizing written precision and structured thinking—such as paralegal, executive assistant, junior analyst, librarian, document archival specialist, research assistant, technical writer, QA analyst, etc.
2. Native or fluent in English writing, with a demonstrated ability to produce observations that are succinct, specific, and unambiguous.
3. Proven skill in designing or applying rubric-based evaluation, grading against set criteria, or building structured scoring frameworks.
4. High attention to detail and ability to notice subtle patterns or inconsistencies others might miss.
5. Exceptional written and verbal communication skills, especially for documenting nuanced observations and feedback.
6. Fluency in navigating computers, common SaaS tools, web browsers, file management, and document editing platforms.
7. Strong self-direction, with the ability to independently take ownership of ambiguous or loosely defined projects.
**Preferred Qualifications:**
1. Prior experience evaluating AI outputs or participating in technology-driven process improvement projects.
2. Background in developing or refining evaluation rubrics or scoring methodologies.
3. Comfort working across multiple domains and adapting quickly to new types of workflow challenges.
Ready to apply?
You'll be taken to Micro1 career page to submit your application. We'll also add this to your tracker if you want.

Originally posted via 4 Day Week. View source ↗
Keep looking
All roles at Micro1 →More roles like this

Physics Research Expert (Solver)
VerifiedMicro1 · Remote · SeniorPythonCommunicationProblem SolvingResearchTechnical Writing+5 more$80–$110/hr2w agovia 4 Day Week
Journalist
VerifiedMicro1 · Remote · MidCommunicationResearchCritical ThinkingJournalismEditing+6 more$20–$70/hr3w agovia 4 Day Week
CNC Manufacturing Documentation Reviewer
Micro1 · Remote · MidTechnical DrawingQuality ControlWritten CommunicationGD&TDocumentation review+9 more$40–$50/hr6d agovia 4 Day Week
Household Data Specialist - Video Capture
Micro1 · RemoteCommunicationAttention to DetailManual DexterityFeedback incorporationtask precision+3 more$30/hr1w agovia 4 Day Week