Skip to main content

AI Tools Menu

The AI Tools Menu provides access to artificial intelligence-powered features including automatic transcription, translation, shot change detection, and audio description rendering. These tools leverage cloud-based AI services and local processing capabilities to accelerate caption and subtitle creation workflows. Most AI features require an active internet connection and are only available in the online version of Closed Caption Creator.

Batch Transcription​

Batch Transcription enables simultaneous transcription of multiple source files in a single operation, which is essential for high-volume production environments. This feature is available exclusively to users with Pro or Enterprise subscriptions and requires online connectivity.

The batch transcription interface allows you to queue multiple audio or video files for transcription using your selected AI provider. Each file is processed independently, and you can track the status of each job through the transcription dashboard. Once transcription jobs are complete, you can import the results into organized event groups or download the transcript files for use in other projects.

Batch processing reduces the overhead of submitting individual transcription jobs and allows you to efficiently process entire seasons of content, multiple language versions, or large media libraries. The system manages job queuing and completion tracking so you can continue working on other projects while transcription operations run in the background.

Automatic Transcription​

Automatic Transcription converts source audio from your loaded media file to timed text using supported AI transcription providers. This feature dramatically accelerates the caption creation process by generating a complete transcript with word-level timing data that can be imported as a subtitle event group or used as the basis for caption spotting workflows.

To use automatic transcription, your project must have media loaded from a compatible source. Local media files and cloud storage sources are fully supported, but streaming sources including HLS manifests, YouTube links, and Vimeo embeds may be restricted based on provider requirements and source accessibility. Before submitting a transcription job, you select your transcription provider, specify the source language, and configure any provider-specific options such as speaker diarization or custom vocabulary support.

Once submitted, transcription jobs are processed by the AI provider and appear in your transcription import dashboard when complete. You can then import the transcription results as a formatted subtitle event group with automatic line breaking and timing, or import the full word-level timing data for use with automatic sync or manual caption spotting workflows.

Automatic Translation​

Automatic Translation generates translated versions of your existing caption content using AI-powered translation services. This feature is available to Pro and Enterprise subscribers and requires online connectivity. Translation operates on the text content of your selected event group and produces a new event group containing the translated text with timing synchronized to your source events.

Before initiating translation, you specify the target language or languages for translation. The system supports multiple simultaneous translations, allowing you to generate several language versions in a single operation. The translation process preserves your original timing structure, formatting attributes, and speaker assignments while replacing the text content with professionally-grade AI translations.

Automatic translation is particularly valuable for international distribution workflows where you need to produce multiple language versions from a single source language master. While AI translation provides excellent results for most content, it is recommended that professionally-translated captions receive review and quality control from native speakers before final delivery.

Shot Change Detection​

Shot Change Detection analyzes your source video to identify edit points, scene transitions, and camera angle changes. This information is used to optimize caption timing by ensuring that captions align with scene boundaries and avoid appearing across shot changes whenever possible.

Shot change detection is available exclusively in the desktop version of Closed Caption Creator and requires that your media is loaded from local storage or cloud storage sources that allow direct file access. Streaming sources are not compatible with shot change detection due to access limitations.

The detection process uses computer vision algorithms to analyze your video frames and identify points where significant visual changes occur. Detected shot changes are marked in your timeline and can be used as reference points when spotting captions or as constraints for automatic timing operations that attempt to avoid placing caption breaks mid-scene.

This feature supports best practices in captioning by ensuring that captions synchronize naturally with the visual editing of your program, which improves viewer comprehension and reduces distraction from the on-screen action.

Audio Rendering for Audio Description​

The audio rendering commands are available when working with audio description event groups and require the Audio Description plugin. Force Render Audio generates text-to-speech audio for selected audio description events using your configured voice provider and voice settings. This operation converts the text content of each event into narrated audio files that are linked to the corresponding events.

Force Render All Audio extends this capability to automatically render every event in your audio description event group in a single operation. This batch rendering is useful when you have completed text editing and are ready to generate the full audio description mix. The system processes each event sequentially and generates audio files with filenames linked to their source events for proper timeline association.

Rendered audio files are stored in your project structure and can be previewed directly in the audio description timeline. The audio file durations are automatically calculated and can be used with the Trim to Duration timing command to ensure events display for the appropriate length based on their narration duration.

Transcription Import​

Transcription Import provides access to the transcription dashboard where you can view the status of submitted transcription jobs and import completed transcripts into your project. The dashboard displays all transcription jobs associated with your account, including jobs that are processing, completed, or failed.

From this interface you can import transcription results as subtitle event groups with automatic formatting applied, or import raw transcription data including word-level timing information for use in advanced timing workflows. The import options allow you to specify maximum character counts, maximum line limits, and other formatting constraints that control how the transcript is converted into caption format.

Translation Import​

Translation Import provides access to the translation dashboard where you can monitor the status of submitted translation jobs and import completed translations into your project. This interface displays all translation jobs you have submitted and allows you to import the translated event groups once processing is complete.

Translation import preserves the timing and structure of your source event group while applying the translated text content. You can configure import options including event group naming, formatting preferences, and metadata handling to ensure that translated versions integrate seamlessly with your project structure.

Availability and Requirements​

Most features in the AI Tools Menu require online connectivity because they rely on cloud-based AI services for processing. Automatic transcription, translation, and audio rendering features are not available in offline mode. Shot change detection operates locally on the desktop version but requires that source media be accessible through the local file system.

Subscription tier restrictions apply to several AI features. Batch transcription and automatic translation are available exclusively to Pro and Enterprise subscribers, while standard automatic transcription is available to all subscription tiers. Audio description rendering requires the Audio Description plugin, which is available as part of Pro and Enterprise plans with the Audio Description add-on.