Speech-To-Text Filter

1. Download the Needed Models

Download the required models from either the package provided by Capella or from GitHub:

If you downloaded the Capella provided "whisper-models" package, there will be 5 models available.
These models are multilingual and should work with different languages.

Click "Browse..." under "Output Filename" and select a destination for the output file.
Name and set the extension of the output file to anything you want.

Under "Speech Extraction Model", choose a model to use. You can select any of the 5 packaged models.
- Note: Smaller models are much faster but less accurate in extraction.
- Note: Extraction quality also depends on the language.

A: This error means your CPU is too old to use this feature. The CPU must support at least AVX2 to run this function.

A: Yes, you can use a string replacement variable to make the output file name the same as the source name.

For example, when using the "Audio Speech Extraction" filter in WatchFolder, you can set the output path as:

C:\Users\Public\Documents\CapellaOutput\%sourceName%.mp4

This will automatically replace %sourceName% with the original file name.