Custom model training with multiple datasets
I'm trying to train a custom speech model using about 5 GB of data that I have. I realize the upload limit is 2GB archives, so I've split my data into multiple bundles to upload them separately. However, in the UI I don't see how it is possible to train a model using multiple archives. In particular, the "select training data" flow uses radio buttons to select data so I am only able to select a single model. Is it possible to train a model with multiple archives of data?

1 comment
-
Joe commented
In the documentation it implies I should be able to select multiple datasets in the UI:
> From the Select training data page, choose one or multiple audio + human-labeled transcription datasets that you'd like to use for training.
But I cannot; the UI uses radio buttons that prevent me from making a multiple selection. If I submit a POST request to the API with multiple datasets in the request body I get a 400 error (even though a request with a single dataset in the `datasets` list goes through fine).