My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.
liusongxiang / deepul Goto Github PK
View Code? Open in Web Editor NEWThis project forked from rll/deepul