Native-level fluency in Javanese language. Strong comprehension of English. Residing in Indonesia. Commitment to working 25 hours per week.
Responsibilities:
Review short, pre-segmented datasets. Evaluate model-generated replies based on Tone or Fluency. Rate user prompts and two model replies using a five-point scale. Provide short rationales for extreme ratings.