Design, ship, and maintain Python/FastAPI services for LLM workflows and 3D context retrieval Optimize latency and throughput with async pipelines and efficient GPU usage Enforce auth-first design for APIs and websockets Own GCP operations including Cloud Run/Functions, Pub/Sub, Postgres, Redis, CI/CD Establish observability through logging, tracing, and dashboards Partner with 3D/ML engineers to productize models via APIs