Abstract: We propose WHISPER-GPT: A generative large language model (LLM) for speech and music that allows us to work with continuous audio representations and discrete tokens simultaneously as part ...
Entering text into the input field will update the search result below Entering text into the input field will update the search result below ...