Datasets:

joujiboi
/

japanese-anime-speech

Tasks:

Modalities:

Formats:

Languages:

Size:

Tags:

Libraries:

License:

Any stats about improvement in WER going from V1, V2, ..., V5?

by captain-ballot - opened Sep 13

Sep 13

Hello, I'm planning to collect a dataset for another language and would like to know if anyone has any training results to share.

Have you seen diminishing returns to increasing the training dataset from 20 to 30 to 50 to 60 to 110 hours?

Or does the improvement seem linear with respect to training dataset size?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment