800.885.8886 sales@telvue.com
  • Facebook
  • Twitter
  • Facebook
  • Twitter
TelVue
  • Home
  • Products
    • Playback & Automation
      • HyperCaster®
      • ProVue®
        • ProVue® HD
        • ProVue® GFX
        • ProVue® CC
      • TelVue TeleCastTM 2
    • Streaming/Cloud
      • TelVue CloudCast™
      • TelVue Connect™
      • Media Exchange
      • Mobile & OTT Apps
      • TelVue SmartCaption™
    • Digital Signage
      • TelVue InfoVue™
    • Automatic Captioning
      • SmartCaption™
    • Ad Insertion
      • TelVue AdCaster™
  • Solutions
    • Cable/Telco
    • Community Media
    • Media Companies
  • Support
    • Knowledge Base
    • Webinars
    • Training Videos
    • Documentation
    • Contact Support
  • News
    • Blog
    • Press
    • Awards
  • Client Logins
    • Connect/CloudCast
    • InfoVue
  • Contact Us
    • General & Sales
    • Technical Support
Select Page

Connect/CloudCast

Connect User Manual
  • TelVue Connect™ User Manual (v2.6)
  • Organization Settings
  • Getting Started
  • Upload and Transcode Content
  • Enter Metadata
  • Metadata Presets
  • Broadcast Server Integration
  • Distribution
  • Manage Storage Space in Connect
  • Manage User Accounts
  • Email Notifications
  • Publishing and Moderation
  • Create and Schedule Series
  • Fill a Series with Episodes
  • Create and Schedule Playlists
  • TelVue Connect Media Exchange
  • How to Syndicate Content to the Media Exchange
  • Series Syndication
  • Syndication Reports
  • SmartCaption™ Automatic Closed Captioning
  • Custom Language Models
  • SmartCaption™ Caption Translation
  • Manage CloudCast Content
  • Add Live Streams to CloudCast
  • CloudCast Video Player (Connect VOD Player)
  • Share and Embed Videos
  • Manage Connect/CloudCast Categories
  • Add Chapters
  • Add Attachments
  • CloudCast Banner Ads
  • Getting Started with Roku
  • Manage Roku App
  • Getting Started with Apple TV
  • Manage Apple TV App
  • CloudCast MRSS
  • Podcasting
  • Bulk Action Feature Requirements
  • RSS Feeds
  • TelVue Connect Contributors’ Guide (PEG version)
  • TelVue Connect Contributors’ Guide (CableLabs version)
  • Cable Option
CloudCast User Manual
  • Manage CloudCast Content
  • Add Live Streams to CloudCast
  • CloudCast Video Player (Connect VOD Player)
  • Share and Embed Videos
  • Manage Connect/CloudCast Categories
  • Add Chapters
  • Add Attachments
  • CloudCast Banner Ads
  • Getting Started with Roku
  • Manage Roku App
  • Getting Started with Apple TV
  • Manage Apple TV App
  • SmartCaption™ Automatic Closed Captioning
  • Custom Language Models
  • SmartCaption™ Caption Translation
  • CloudCast MRSS
  • CloudCast Training Videos
Media Exchange
  • How to License Content from the Media Exchange
Connect APIs
  • JSON REST API – version 2 – Media
  • JSON REST API – version 2 – Videos
  • JSON REST API – version 2 – Playlists
  • JSON REST API – version 2 – Playlist Items
  • JSON REST API – version 2 – Series
  • JSON REST API – version 2 – Categories
  • JSON REST API – version 2 – Roku Applications
  • Home
  • Knowledge Base
  • Connect/Cloudcast Knowledge Base
  • Connect/CloudCast
  • CloudCast User Manual
  • Custom Language Models

Custom Language Models

Custom Language Models

The TelVue Connect Closed Captioning Service uses speech-to-text Artificial Intelligence technology to automatically create captioning from the audio in uploaded videos for both Broadcast and Streaming/OTT workflows. The core speech-to-text engine uses a general model of common, everyday words and phrases. For some applications, it can be helpful to additionally train the base model with Custom Language Models (custom words, phrases, and names) for improved accuracy.

Corpora

A Corpus (singular of Corpora) is a collection of words, names, and phrases used train the captioning engine with uncommon, unique, and domain specific terms. For example, you could define a Corpus for generic government meetings with words and phrases like “recuse”, “call to order”, “make a motion”, and “is there a second.” You could define another Corpus for a specific meeting type, such as Board of Education with the names of the board members.

To create a Corpus:

  1. Hover over the Media Tab and select Captions Generation.
  2. Select the Corpora Tab.
  3. Click the New Custom Corpus button.
  4. Name the specific corpus.
  5. Add any words, phrases, and names to be associated to this Corpus, per line, with a return/carriage return in between.
  6. Click the Save button to save your Corpus.

To edit or delete a Corpus:

  1. Hover over the Media Tab and select Captions Generation.
  2. Select the Corpora Tab.
  3. Hover over the Gear actions menu for the Corpus you would like to edit or delete.
  4. Select the desired action.

Models

A Model is a grouping of one or more Corpus. A given Corpus can be included in multiple Models. For example, you might have a Model for Planning Board, Board of Education, and City Council meetings, that share a common Generic Meeting Corpus, but each has its own specific Corpus.

To create a Custom Language Model:

  1. Hover over the Media Tab and select Captions Generation.
  2. Select the Models Tab.
  3. Click the New Language Model button.
  4. Name specific Model.
  5. Select the Spoke Language in the source material. Captions using this model with be generated in the selected Spoken Language. Available languages and dialects include: English (US), English (UK), Arabic, Portuguese (Brazilian), Chinese (Mandarin), Dutch, French, German, Italian, Japanese, Korean, Spanish (Argentinian, Castilian, Chilean, Colombian, Mexican, Peruvian). Once set, the spoken language cannot be later modified.
  6. Click the Save button.
  7. Hover over the Gear actions menu for the model you just created, and select Edit.
  8. Select one or more Custom Corpora to be included within this Model for training.
  9. Set the Customization Weight via the slider (on a scale from .1 to 1). The Customization Weight determines how weight to give words from the Custom Language Model compared to those from the base model. The higher the value, the higher the weight. The default of .3 is recommended, though you can test other settings to see what works best. If the weight is too high, it is possible that spoken language that is not actually a match for your custom model will be mistaken for a match. If the weight is too low, spoken language that matches your custom model may not be detected as such.
  10. Click the Save button to save your Language Model.

Connect Edit Custom Language Model

To edit or delete a Custom Language Model:

  1. Hover over the Media Tab and select Captions Generation.
  2. Select the Models Tab (Corpora is plural of Corpus).
  3. Hover over the Gear actions menu for the Model you would like to edit or delete.
  4. Select the desired action.

Upon adding or editing a Custom Language Model, a Language Model Update background activity will be queued to train the model. The progress of the training background activity can be tracked on the Activities Tab.

Once the Language Model Update is complete, subsequent Captions creation will use the updated models.

Setting the Default Custom Model

If you have the “Auto generate captions for all uploaded files” option enabled, the Default Custom Model setting will determine which if any Custom Language Model is used. Likewise, when manually triggering captioning, the Default Custom Model setting determines which Custom Language Model if any is the default selection, that you can then choose to override.

To set the Default Custom Model:

  1. Hover over the Administration Tab and select Organization Settings.
  2. Select the Captions tab.
  3. Select your Default Custom Language Model, or select [No Custom Model] to not use a Custom Model by default.
  4. Click the Save button to save your changes.

Selecting a Custom Language Model for Captioning

When manually trigger captioning for media using the Bulk Generate Closed Captions or individual “generate closed caption file” actions, you will be presented with the option to select one of your Custom Language Models. If you do not want to use a Custom Language Model, select “No Custom Model”.

Share This Article :
  • Facebook
  • Twitter
  • LinkedIn
  • Pinterest
Still stuck? How can we help?

How can we help?

Updated on December 1, 2022
SmartCaption™ Automatic Closed CaptioningSmartCaption™ Caption Translation

PRODUCTS

  • Playback & Automation
    • HyperCaster®
    • ProVue®
      • ProVue® HD
      • ProVue® GFX
      • ProVue® CC
    • TelVue® TeleCastTM 2
  • Streaming/Cloud
    • TelVue CloudCast™
    • TelVue Connect™
    • Media Exchange
    • Mobile & OTT Apps
    • TelVue SmartCaption™
  • Digital Signage
    • TelVue InfoVue™
  • Automatic Captioning
    • SmartCaption™
  • Ad Insertion
    • AdCaster™

SUPPORT

  • Contact Support
  • Knowledge Base
  • Training Videos
  • Product Documentation

ABOUT

  • Contact Us
  • TelVue News for Hyperlocal Broadcast, PEG news, and TV Technology
  • Press Releases
  • Executive Profiles
  • Sales Team
  • Company
  • Partners

BLOG POSTS

  • HyperCaster Integrated Streaming & Social Streaming Automation
  • Live Dual-language Captioning, Multiple Trim Editing, and Multistreaming Automation with HyperCaster™ 8.3
  • AIO Integrated Streaming, Program Guide & Clock Overlays with HyperCaster™ 8.2
  • Channel Streaming Rights Management with HyperCaster™ 8.1
  • Enhanced Captioning Accuracy and Security with HyperCaster™ 8.0
  • Facebook
  • Twitter
16000 Horizon Way, Suite 100, Mt. Laurel, NJ 08054 - TelVue Corporation ® All Rights Reserved 2020