BS in Computer Science or equivalent professional experience. 6+ years designing, building, and operating large-scale distributed systems in production. Deep, hands-on expertise with Ray or Spark. Expert-level Python proficiency with strong software engineering fundamentals. Proven experience optimizing and scaling production data pipelines processing terabytes or petabytes of data. Strong SQL and data manipulation skills. Experience with cloud infrastructure (AWS preferred: S3, EC2, EKS, EMR, IAM). Mentoring junior engineers.