Familiarity with PyTorch, Huggingface, Gemma models, LORA, VLLM, Skypilot, Marimo Proficient in Python, FastAPI Experience with Google Cloud Platform (GCP) Knowledge of PostgreSQL, DuckDB Experience with Cloud Run
Responsibilities:
Finetuning small language models Improving the quality of existing data using scalable approaches Adding new signals by scrubbing, matching, and normalizing data Pushing solutions into production environments, touching data pipelines and/or backend systems