Adapt models for new conditioning inputs (emotion, speed, prosody, speaker control). Fine-tune and optimize speech models using DPO, LoRA, and other parameter-efficient methods. Implement post-training optimization techniques (quantization, pruning, distillation). Integrate and test novel architectures (neural codecs, diffusion, flow-matching). Design and implement new evaluation metrics for TTS systems, including automated MOS prediction. Stay updated with latest research in audio diffusion, autoregressive models, neural codecs, and multimodal LLMs.