【S2ST】Direct Speech-to-Speech Translation With Discrete Units
DirectSpeech-to-SpeechTranslationWithDiscreteUnitsAbstractIntroductionRelatedworkModelSpeech-to-unittranslation(S2UT)modelMultitasklearningUnit-basedvocoderExperimentsDataSystemsetupBaselineASRM