AI Benchmark Engineer - Native Language Specialist, Japanese

New

LILT (Production)AI, Language Technology

Japan (Remote)ContractMiddle

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

1+ years in software or prompt engineering.
Proven track record at leading technology companies and/or graduation from top-tier engineering universities.
Native or near-native fluency in Japanese, with a deep understanding of its grammar, register, and phrasing rules.
High English proficiency.
Strong proficiency in Python.
Strong proficiency in standard shell scripting.
Strong proficiency in data processing.
Extensive experience with Terminal/CLI-based development workflows.
Working familiarity with coding agents.
Deep technical understanding of multilingual text processing pitfalls.
Experience with encoding/decoding robustness and Unicode normalization.
Knowledge of locale-dependent conventions (collation, casing, non-Gregorian dates).
Experience with Text I/O, toolchain interoperability, and safe string operations.

Design, build, and validate AI benchmarks focused on multilingual software challenges.
Create high-signal, high-quality tasks to test model's ability to handle multilingual environments.
Build realistic task environments using datasets and files in Japanese.
Find failure points where AI does not work in Japanese.
Support the development of robust reference implementations.
Write highly reliable, deterministic verifier scripts.
Analyze execution logs and calibrate task difficulty (Easy to Very Hard).
Participate in a rigorous, 4-layer human quality control process (creation, human review, calibration review, and audit).

View Full Description & ApplyYou'll be redirected to the employer's site