The TelVue Connect Closed Captioning Service uses speech-to-text Artificial Intelligence technology to automatically create captioning from the audio in uploaded videos for both Broadcast and Streaming/OTT workflows. The core speech-to-text engine uses a general model of common, everyday words and phrases. For some applications, it can be helpful to additionally train the base model with Custom Language Models (custom words, phrases, and names) for improved accuracy.
A Corpus (singular of Corpora) is a collection of words, names, and phrases used train the captioning engine with uncommon, unique, and domain specific terms. For example, you could define a Corpus for generic government meetings with words and phrases like “recuse”, “call to order”, “make a motion”, and “is there a second.” You could define another Corpus for a specific meeting type, such as Board of Education with the names of the board members.
To create a Corpus:
To edit or delete a Corpus:
A Model is a grouping of one or more Corpus. A given Corpus can be included in multiple Models. For example, you might have a Model for Planning Board, Board of Education, and City Council meetings, that share a common Generic Meeting Corpus, but each has its own specific Corpus.
To create a Custom Language Model:
To edit or delete a Custom Language Model:
Upon adding or editing a Custom Language Model, a Language Model Update background activity will be queued to train the model. The progress of the training background activity can be tracked on the Activities Tab.
Once the Language Model Update is complete, subsequent Captions creation will use the updated models.
If you have the “Auto generate captions for all uploaded files” option enabled, the Default Custom Model setting will determine which if any Custom Language Model is used. Likewise, when manually triggering captioning, the Default Custom Model setting determines which Custom Language Model if any is the default selection, that you can then choose to override.
To set the Default Custom Model:
When manually trigger captioning for media using the Bulk Generate Closed Captions or individual “generate closed caption file” actions, you will be presented with the option to select one of your Custom Language Models. If you do not want to use a Custom Language Model, select “No Custom Model”.