home

CapsQual User Manual

1. Introduction

1.1 Overview

CapsQual is a specialized transcription workstation designed for converting subtitle files into publishable interview transcripts using GAT2 (Gesprächsanalytisches Transkriptionssystem 2) conventions.

1.2 Key Features

Quick Speaker Assignment: Rapidly assign speakers to subtitle segments using keyboard shortcuts
Basic Editing: Split, merge, and edit transcript segments
GAT2 Symbols: Insert conversation analysis symbols for pauses, breathing, and annotations
Audio Synchronization: Link audio files to transcripts with auto-sync functionality
Export Transcripts: Generate formatted transcripts in HTML or plain text

2. Projects

2.1 Basic Project File Management

Start a new transcription project by selecting File → New Project or pressing Ctrl+N.
Open saved projects with File → Open Project or Ctrl+O. CapsQual uses the .capsqual file format.
Recently opened projects may be opened through the "Open Recent" submenu.
Save your work with File → Save Project (Ctrl+S) or Save Project As to create a new file.

2.2 Project Memos

Add project notes and descriptions via Edit → Project Memo. This information can be included in transcript exports.

3. Importing Subtitle Files

3.1 Supported File Formats

SRT (.srt): Standard subtitle format with timestamps
JSON (.json): Various JSON formats including token-based and segment-based
Text (.txt): Plain text files (one block per line)
TSV (.tsv): Tab-separated values with start/end times and text

3.2 Import Methods

Use File → Import → Subtitles to load subtitle files. When importing audio files, CapsQual automatically searches for matching subtitle files in the same directory and offers to import them.

3.3 JSON Import Options

When importing JSON files with token data, you can choose from three import methods:

One Continuous Block: Import all text as a single segment
Tokens as Separate Blocks: Create individual blocks for each token
Auto-segment: Automatically detect pauses and create segments accordingly

4. Basic Formatting Functions

4.1 Speaker Assignment

Assign speakers using the number keys 1-4 (configurable up to 8 speakers). Unassign with U.

4.2 Navigation

Keyboard Shortcuts:

Next block: N or →
Previous block: P or ←
Jump to unassigned: Click on block in "Unassigned Blocks" list

4.3 Editing Functions

Split Block: Space - Opens split dialog to divide current segment
Merge Blocks: Delete - Combine current block with next block
Edit Content: E - Open text editor for current block
Insert Empty Line: Enter - Add blank line for formatting

4.4 Transcript Symbols

Access the symbols dialog with * or the Symbols button.
Switch between symbols for GAT2, Dresing & Pehl, TiQ and custom symbol tabs using the [Tab] key.
Custom symbols can be defined in the custom symbol tab. These can also be saved and restored as .JSON files.
Custom symbols can also be managed from the menu bar (Edit -> Custom Symbols).

5. Audio Functions

5.1 Importing Audio

Load audio files via File → Import → Audio File. Supported formats: MP3, WAV, OGG, M4A, FLAC.

5.2 Automatic Subtitle Detection

When importing audio, CapsQual automatically searches the directory for matching subtitle files (SRT, JSON, TXT, TSV) and offers to import them.

5.3 Audio Controls

Playback Controls:

Play/Pause: [End] or click ▶ or ⏸ button
Rewind 5s: [PgUp] or click ⏪ button
Fast Forward 5s: [PgDn] or click ⏩ button
Jump to Time: Use the "Jump" button or [Ctrl]+[J] for precise navigation
Play from segment: Press the "Play from segment" button or [Shift]+[Enter] to start playback from the current segment.
Adjust speed: Use the mouse wheel on the knob, the buttons or the + and - keys to adjust playback speed (VLC Player needs to be installed for this feature to work!)

5.4 Additional Audio Features

Auto-sync to Audio: Automatically highlights the current transcript block during playback (Using audio controls [PgUp/PgDwn] instead of scrolling is recommended)
Autopause: Automatically pauses audio when opening editing dialogs
Progress Tracking: Visual progress bar with time display

6. Exporting Transcripts

6.1 Export Formats

Generate transcripts in these formats:

HTML: Formatted with CSS styling
Word document: A publication-ready .DOC file
Plain Text: Simple text format for maximum copy-paste compatibility
Subtitle file: An .SRT file containing time stamps and speaker diarization

6.2 Transcript Conventions

Transcript exports can be customized to better suit one of several transcription conventions:

GAT2-Transcription: More commonly used in conversation analysis; each segment takes up a line, a mono-spaced font is used, speaker-labels and line numbering follow GAT2 guidelines.
Dresing & Pehl: More commonly used for semantic analysis; segments are arranged by speaker turns.
TiQ (Talk in Qualitative Research): Segments are arranged by speaker turns, often used for reconstructive research (especially for group discussions).

Please note that choosing transcription conventions does not alter the transcript content, but only the basic transcript formatting. The choice should be based on whichever format best fits the conventions used during transcription.

6.3 Export Options

Customize your export with these options:

Line-Wrapping: When enabled, line breaks will be implemented in the exported transcript. Enter the amount of characters, after which line breaks should occur. Enabling "Force character-based wrapping" results in line breaks occurring regardless of words, whereas disabling this checkbox results in line breaks after spaces.
Include Timestamps: Add timecodes to transcript (when available); these can follow different formats. For custom timestamp formats, the letter-combinations HH, mm, ss and xx can be used to represent hours, minutes, seconds and tenths of seconds respectively. Any other symbols, letters, numbers, characters etc. can be used alongside these to make up a custom timestamp formula.
Include Diarization (Speaker Labels): Choose whether to include speaker labels (only applicable for subtitle exports)
Add empty line after every speaker turn: An empty line is added after every turn. Empty lines are handled differently for different conventions. For TiQ, empty lines are numbered, whereas for GAT2 and Dresing & Pehl they are not.
Project Title: Include project name as header
Project Memo: Include project description
Audio File Info: Include source audio file name

6.4 Export Preview

Use the preview feature to review your transcript formatting before exporting. The preview shows an approximation of how the final transcript will appear.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly