[ad_1]
Self-training (ST) and self-supervised studying (SSL) strategies have demonstrated sturdy enhancements in computerized speech recognition (ASR). Despite these advances, to the perfect of our data, there isn’t any evaluation of how the composition of the labelled and unlabelled datasets utilized in these strategies impacts the outcomes. On this work we intention to analyse the impact of variety of audio system within the coaching information on a latest SSL algorithm (wav2vec 2.0), and a latest ST algorithm (slimIPL). We carry out a scientific evaluation on each labeled and unlabeled information by various the variety of audio system whereas…
[ad_2]