Training Gigispeech problem in Kaldi #1620

YangangCao · 2024-08-14T15:53:36Z

Hi dear author,

I only want to train a small acoustics model use Gigaspeech, but I encountered some problems when I run Gigaspeech recipe in Kaldi.

.if [ $stage -le 2 ]; then
echo "======Train lm START | current time : date +%Y-%m-%d-%T=============="
mkdir -p $lm_dir || exit 1;
sed 's|\t| |' data/$train_combined/text |
cut -d " " -f 2- > $lm_dir/corpus.txt || exit 1;
echo "break point1"
local/lm/train_lm.sh
--cmd "$train_cmd" --lm-order $lm_order
$lm_dir/corpus.txt $lm_dir || exit 1;
echo "break point2"
echo "======Train lm END | current time : date +%Y-%m-%d-%T================"
fi

this step let me install SRILM and train a language model(when I train librispeech, I didn't do these two things), is it necessary?(I only want to train a acoustics model and don't need compute wer), whatever, I skip this step

Thanks very much!

The text was updated successfully, but these errors were encountered:

nshmyrev · 2024-08-15T09:20:10Z

You can skip this step.

Still, it is recommended to install SRILM and evaluate the model, it is an important part of accuracy testing.

Next, you probably want to take some modern model instead of gigaspeech, there are many of them and they depend on your requirements. They gonna be much more accurate.

YangangCao · 2024-08-15T13:15:33Z

Hi dear author, thanks for your reply, my goal is to train a text limited ASR model, I only know chain model support it, any other more accurate method?

nshmyrev · 2024-08-19T20:37:25Z

Modern RNNT / conformer CTC model should be more accurate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Gigispeech problem in Kaldi #1620

Training Gigispeech problem in Kaldi #1620

YangangCao commented Aug 14, 2024 •

edited

Loading

nshmyrev commented Aug 15, 2024

YangangCao commented Aug 15, 2024

nshmyrev commented Aug 19, 2024

Training Gigispeech problem in Kaldi #1620

Training Gigispeech problem in Kaldi #1620

Comments

YangangCao commented Aug 14, 2024 • edited Loading

nshmyrev commented Aug 15, 2024

YangangCao commented Aug 15, 2024

nshmyrev commented Aug 19, 2024

YangangCao commented Aug 14, 2024 •

edited

Loading