Type: Hourly contract
Compensation: $36.16 per hour
Location: Remote
Commitment: Part-time
Role Responsibilities
* Evaluate LLM-generated responses across diverse general topics
* Conduct fact-checking using trusted public sources and external tools
* Annotate strengths, weaknesses, and factual inaccuracies in model outputs
* Assess reasoning quality, clarity, tone, and completeness
* Ensure responses align with conversational guidelines and system standards
* Apply structured taxonomies, benchmarks, and evaluation frameworks consistently
Requirements
* Native or ILR 5 / CEFR C2 fluency in Italian
* Fluency in English (written and spoken)
* Strong writing skills with ability to provide nuanced feedback
* Experience using large language models (LLMs)
* Strong analytical thinking and attention to detail
* Comfort working across varied topics and structured evaluation frameworks
#J-18808-Ljbffr