Text Analysis Info - transcribing

Last update: 18. August 2008


Anvil 4.0
program:
Anvil 4.0
author: Michael Kipp
author: Michael Kipp
distributor: Michael Kipp, DFKI, Germany
documentation: PC manual in PDF format
download: e-mail the author for a free copy
operating system(s): MS-Windows, Mac OS X, Linux and Solaris
description:
Anvil - written in Java - is a free video annotation tool, used at research institutes world-wide (see the Anvil User Web). It offers frame-accurate, hierarchical multi-layered annotation driven by user-defined annotation schemes. The intuitive annotation board shows color-coded elements on multiple tracks in time-alignment. Special features are cross-level links, non-temporal objects and a project tool for managing multiple annotations. Originally developed for Gesture Research, Anvil has also proved suitable for research in Human-Computer Interaction, Linguistics, Ethology, Anthropology, Psychotherapy, Embodied Agents, Computer Animation and many other fields.

Anvil can import data from the widely used, public domain phonetic tools PRAAT and XWaves which allow precise and comfortable speech transcription. Anvil can display waveform and pitch contour. Anvil's data files are XML-based. Special ASCII output can be used for import in statistical toolkits like SPSS. The Anvil system is written in Java and should run on Windows, Macintosh and Unix (Solaris/Linux) computers.


CLAN - Children's Language Analyser
program:
CLAN
author:
Brian MacWhinney
distributor: CHILDES Project
documentation: CLAN manual and introduction in Chinese, French, and Spanish
download MS-Windows: freeware
Mac OS-X: freeware
Unix: freeware
operating system(s): MS-Windows, MacOS X, Unix
description: there are Unicode versions available, QuickTime is necessary, too. You can work with digitized audio or video.

Functionally, however, CLAN has two parts. The first part is the CLAN editor which can be used to edit files in either CHAT or CA (Conversation Analysis) format. The editor also provides a wide range of additional functions, such as audio and video playback, linkage to audio and video, fonts for Roman and non-Roman orthographies, data validation, adding codes to files, and shipping data to other programs. The second part of CLAN is the set of data analysis programs. These programs are run from a separate window called the Commands window. The results of the analytic programs are sent to the CLAN Output window.


Express Scribe
program:
Express Scribe
author: unknown
distributor: NCH Swift Sound software
documentation:the manual is included in the download file, and there is also a
tutorial
download: free version
operating system: MS-Windows, MacOS-X. Linux (under Wine)
description: Express Scribe is professional audio player software designed to assist the transcription of audio recordings. It is installed on the typist's computer and controlled using the keyboard (with 'hot' keys) and / or can be used with a transcription pedal. This computer transcriber application features variable speed wave playback, foot pedal operation, file management and more.


F4 3.0.3
program:
F4 3.0.3
author: unknown
distributor: dresing & pehl GbR
documentation:tutorial
download: free version
operating system: MS-Windows, Media Player 8.0 or better, Direct X 9.0 or better
description: F4 is a free transcription system, working with slower speed and foot pedal is possible.


SALT 2008 - Systematic Analysis of Language Transcripts
program:
SALT 2008
operating systems: MS-Windows, MacOS
authors: Robin S. Chapman and Jon F. Miller
distributor: Language Lab, University of Wisconsin-Madison
documentation: brochure must be requested
download: test version
description:
The SALT program contains an assortment of standard analyses. Some of the information from the analyses include:

  • types of utterances including distribution of imitations, responses to questions, incomplete, unintelligible, and nonverbal utterances;
  • calculation of total number of words, type token ratio, mean length of utterance, and Brown's linguistic stage;
  • number and length of pauses and rate of speaking;
  • lists and frequencies of word roots, bound morphemes, and codes;
  • distributions of utterances by length in terms of words and morphemes;
  • distribution of speaker turns by length in terms of consecutive utterances;
  • frequencies for sets of words, including question words, negatives, conjunctions, modal and semi-auxiliaries, pronouns, and any set of words you define; and
  • number and types of mazes (filled pauses, repetitions, revisions).
  • The values of these variables can be compared with the SALT Reference Database that contain empirically collected data from Wisconsin Children of different age groups (3-13 years), gender, sampling context, and transcript length. Matched records are selected from the database and mean, range, and standard deviation statistics are given for many of the analysis variables.


    TAMS 3.44b4
    program:
    TAMS 3.44b4 - Text Analysis Markup System
    author: Matthew Weinstein
    documentation: manual
    download: free binaries and sources
    operating system: MacOS X
    description:
    TAMS Analyzer is a program that lets you select passages of a text and just code them by double clicking the name of the code on a list. It then allows you to extract and save coded information. TAMS command line program lets you extract marked up text files for analysis by a database or spreadsheet program. Both are released under GPL Multimedia support turns TAMS into a digital transcription machine.



    Transana 2.22
    program:
    Transana 2.22
    author: Transana was originally created by Chris Fassnacht. It is now developed and maintained by David K. Woods at the Wisconsin Center for Education Research, University of Wisconsin-Madison, USA.
    documentation: online help
    download: free binaries and sources of version 1.5.1
    operating system: MS-Windows, MacOS X
    description:
    Transana is an open source project and can be used to



    Transcriber 1.5.2
    program:
    Transcriber 1.5.2
    authors:Karim Boudahmane, Mathieu Manta, Fabien Antoine, Sylvain Galliano, Claude Barras, and many others
    documentation: reference manual (in English, in French it is planned)
    download: binaries and sources
    operating system: MacOS, Linux/x86, Unix (Sun Solaris, SGI IRIX), MS-Windows
    description:
    Transcriber is a tool for assisting the manual annotation of speech signals. It provides a user-friendly graphical user interface for segmenting long duration speech recordings, transcribing them, and labeling speech turns, topic changes and acoustic conditions. It is more specifically designed for the annotation of broadcast news recordings, for creating corpora used in the development of automatic broadcast news transcription systems, but its features might be found useful in other areas of speech research.

    Transcriber is developed with the scripting language Tcl/Tk and C extensions. It relies on the Snack sound extension, which allows support for most common audio formats, and on the tcLex lexer generator. It has been tested on various Unix systems (Linux, Sun Solaris, Silicon Graphics) and Windows NT. Transcriber is distributed as free software under GNU General Public License.


    Please send comments and suggestions to