As a Gen AI Data Annotation Analyst, you will play a critical role in developing high-quality datasets that power the next generation of Large Language Models (LLMs). You will evaluate, generate, and refine multi-modal data (text, image, or video) to ensure AI outputs are accurate, safe, and culturally resonant for the Polish locale.
Key Responsibilities
* Multi-Modal Annotation: Label and categorize high-quality training data across text, image, and video formats with surgical precision.
* Content Generation: Draft grammatically flawless, creative, and factually accurate text in both Polish and English, adapting tone and style for specific project needs.
* Edge-Case Resolution: Apply logical reasoning to resolve ambiguous data scenarios where guidelines may not explicitly cover a specific nuance.
* Continuous Feedback: Identify and report bugs within internal annotation tools and suggest workflow optimizations.
* Cross-Functional Sync: Collaborate with Language Leads and Project Managers to calibrate on quality benchmarks and project pivots.
System Requirements
Full set of desktop/Laptop with Windows 11 Pro. Bring a compliant device. Windows 11 Pro (Home editions not allowed) or macOS 12+, 8 GB RAM, 50 GB free disk. Tablets/Chromebooks/phones won’t be accepted.
Network
Stable Internet connection which should have a speed of up to 40MBPS minimum.
* Linguistic Agility: Ability to switch between creative, technical, and professional writing styles effortlessly.
* Analytical Rigor: A \"detective mindset\" for spotting subtle inaccuracies or biases in AI-generated content.
* Domain Breadth: Comfortable handling content ranging from social media trends to high-school-level science and technical documentation.
* Tool Proficiency: Quick to master complex web-based annotation platforms.
* Education: Bachelor’s degree or equivalent (Linguistics, Communications, or a related field preferred).
* Language Mastery:
* Polish: Native speakers of the relevant language and that market. Advanced level (CEFR B2 minimum; C1 preferred).
Preferred Qualifications
* Direct experience in RLHF (Reinforcement Learning from Human Feedback) or LLM fine-tuning.
* Experience moderating sensitive, nuanced, or complex datasets.
* Familiarity with the current Generative AI landscape (Terminologies, Principles, LLM limitations, etc.).
* Familiarity with the cultural and professional landscape of the relevant country, either as their country of origin or through extended residency.
* Foundational understanding of industrial operations in the Poland.
What We Offer
* An innovative and supportive global working environment.
* Opportunities for continuous learning and professional growth.
* Competitive compensation and flexible working arrangements.
* Engagement in impactful projects at the forefront of AI technology.
Join iMerit to be part of shaping high-quality datasets powering the next generation of generative AI solutions. If precision, critical analysis, and innovation excite you, we encourage you to apply!
#J-18808-Ljbffr