Bachelor’s or Master’s degree in Computer Science or a related field. 4–6 years of backend development experience. Proficiency in Java, Golang, or Python with strong coding and system design fundamentals. Experience designing and scaling distributed systems at production scale. Exposure to LLM inference setups (e.g., vLLM, Hugging Face Inference, Triton). Strong debugging, profiling, and performance tuning skills for latency-sensitive applications. Knowledge of storage systems, query optimization, and caching strategies. Hands-on experience with AWS (preferred), Kafka, and CI/CD pipelines. Ability to work autonomously and deliver in fast-paced environments. Passion for mentoring engineers and leading by example.