Appen Logo

Appen

Persian LLM Evaluator

Sorry, this job was removed at 08:07 a.m. (EST) on Sunday, Jun 15, 2025
Be an Early Applicant
Remote
Hiring Remotely in United States
Remote
Hiring Remotely in United States

Similar Jobs

6 Hours Ago
Remote or Hybrid
Santa Clara, CA, USA
60K-80K
Senior level
60K-80K
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Strategic Sourcing Manager will oversee IT hardware sourcing, supplier management, and category strategies while integrating AI and driving sustainability efforts in procurement processes.
Top Skills: Ai-Powered ToolsIaasIt HardwareRfx ProcessesSupplier Negotiation StrategiesSupply Chain Processes
6 Hours Ago
Remote
USA
72K-108K Annually
Senior level
72K-108K Annually
Senior level
Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Cybersecurity • Data Privacy
Manage multiple customer-facing projects, prepare clients for deployment, track deliverables, and ensure quality and team synergy throughout the project lifecycle.
Top Skills: Google SuiteMicrosoft Applications
10 Hours Ago
Remote or Hybrid
Pennsylvania, USA
55K-129K Annually
Mid level
55K-129K Annually
Mid level
AdTech • Digital Media • Marketing Tech
The Client Solutions Engineer develops technical solutions for clients in the media industry, optimizing product usage and providing technical guidance while managing implementations and documentation.
Top Skills: GrafanaHTML5HTTPKibanaRestful ApisSQLXML
Join Project Spearmint, a multilingual AI response evaluation project reviewing large language model (LLM) outputs in different languages, focused on either Tone or Fluency. Native-level fluency in a target language, along with strong English comprehension, is required.

As an evaluator, you will review short, pre-segmented datasets and assess model-generated replies based on specific quality dimensions. Your input will help validate evaluation frameworks and establish baseline quality metrics for future model development.

Key Responsibilities:
- Evaluate model replies in your native language based on either Tone or Fluency.
- Assess the overall quality, correctness, and naturalness of responses.
- Read the user prompt and two model replies, then rate each using a five-point scale.
- Provide brief rationales for any extreme ratings.

Project Breakdown:

Batch 1 – Tone: Determine whether replies are helpful, insightful, engaging, and fair. Flag formality mismatches, condescension, bias, or other tonal issues.

Batch 2 – Fluency: Assess grammatical accuracy, clarity, coherence, and natural flow.


This is a project-based opportunity with CrowdGen, where you will join the CrowdGen Community as an Independent Contractor. If selected, you will receive an email from CrowdGen regarding the creation of an account using your application email address. You will need to log in to this account, reset your password, complete the setup requirements, and proceed with your application for this role.

Make an impact on the future of AI – apply today and contribute from the comfort of your home.

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account