This Workflow transcribes a video or audio with multiple speakers. After transcription, it allocates the names of the speakers and groups and cleans passages from the two main speakers for further analysis.
Associated Tutorial
This workflows is part of the tutorial Transcribing Audio and Video files with Automated Speech Recognition, available in the GTN
Features
- Includes Galaxy Workflow Tests
- Includes a Galaxy Workflow Report
Thanks to...
Workflow Author(s): Daniela Schneider
Tutorial Author(s): Daniela Schneider
Tutorial Contributor(s): Armin Dadras, Saskia Hiltemann, Daniela Schneider
Funder(s): German Competence Center Cloud Technologies for Data Management and Processing, German Network for Bioinformatics Infrastructure Service, Training, Cooperations & Cloud Computing, Ministry of Science, Research and Arts
Inputs
| ID | Name | Description | Type |
|---|---|---|---|
| Video or Audio File | Video or Audio File | Upload the video or audio file you want to transcribe. |
|
Steps
| ID | Name | Description |
|---|---|---|
| 1 | Speech to Text with Diarization | Transcribe your file to create text from the video or audio that you can continue analysing. toolshed.g2.bx.psu.edu/repos/bgruening/whisperx/whisperx/3.4.2+galaxy1 |
| 2 | Allocate Moderator | Rename the auto-given name SPEAKER_03 to Moderator. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 |
| 3 | Allocate Kennedy | Rename the auto-given name SPEAKER_00 to Kennedy. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 |
| 4 | Allocate Nixon | Rename the auto-given name SPEAKER_01 to Nixon. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 |
| 5 | Allocate Journalist | Rename the auto-given name SPEAKER_02 to Fleming, the name of the journalist. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 |
| 6 | Extract Passages spoken by Nixon | This step selects only those lines spoken by Nixon. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy2 |
| 7 | Extract Passages spoken by Kennedy | This step selects only those lines spoken by Kennedy. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy2 |
| 8 | Delete Speaker Name for Nixon | This step deletes the speaker information to generate a cleaned corpus for further analysis. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 |
| 9 | Delete Speaker Name for Kennedy | This step deletes the speaker information to generate a cleaned corpus for further analysis. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 |
Outputs
| ID | Name | Description | Type |
|---|---|---|---|
| Nixon-transcribed-cleaned | Nixon-transcribed-cleaned | n/a |
|
| Kennedy-transcribed-cleaned | Kennedy-transcribed-cleaned | n/a |
|
Version History
Creators and SubmitterCreators
Not specifiedSubmitter
Views: 26 Downloads: 4 Runs: 3
Created: 13th Apr 2026 at 14:29
AttributionsNone
Visit source
Run on Galaxy
master