|
Text Analysis Info - transcribing |
Last update: 11. February 2008
program: Anvil 4.0
author: Michael Kipp
author: Michael Kipp
distributor: Michael Kipp, DFKI, Germany
documentation: PC manual in PDF format
download: e-mail the author for a free copy
operating system(s): MS-Windows, Mac OS X, Linux and Solaris
description:
Anvil - written in Java - is a free video annotation tool, used at research institutes world-wide (see the Anvil User Web). It offers frame-accurate, hierarchical multi-layered annotation driven by user-defined annotation schemes. The intuitive annotation board shows color-coded elements on multiple tracks in time-alignment. Special features are cross-level links, non-temporal objects and a project tool for managing multiple annotations. Originally developed for Gesture Research, Anvil has also proved suitable for research in Human-Computer Interaction, Linguistics, Ethology, Anthropology, Psychotherapy, Embodied Agents, Computer Animation and many other fields.
Anvil can import data from the widely used, public domain phonetic tools PRAAT and XWaves which allow precise and comfortable speech transcription. Anvil can display waveform and pitch contour. Anvil's data files are XML-based. Special ASCII output can be used for import in statistical toolkits like SPSS. The Anvil system is written in Java and should run on Windows, Macintosh and Unix (Solaris/Linux) computers.
|
CLAN - Children's Language Analyser |
program: CLAN
author: Brian MacWhinney

distributor: CHILDES Project
documentation: CLAN manual and
introduction in Chinese, French, and Spanish
download
MS-Windows: freeware
Mac OS-X: freeware
Unix: freeware
operating system(s): MS-Windows, MacOS X, Unix
description: there are Unicode versions available, QuickTime is necessary, too. You can work with digitized audio or video.
Functionally, however, CLAN has two parts. The first part is the CLAN editor which can be used to edit files in either CHAT or CA (Conversation Analysis) format. The editor also provides a wide range of additional functions, such as audio and video playback, linkage to audio and video, fonts for Roman and non-Roman orthographies, data validation, adding codes to files, and shipping data to other programs. The second part of CLAN is the set of data analysis programs. These programs are run from a separate window called the Commands window. The results of the analytic programs are sent to the CLAN Output window.
program: Express Scribe
author: unknown
distributor: NCH Swift Sound software
documentation:the manual is included in the download file, and there is also a
tutorial
download: free version
operating system: MS-Windows, MacOS-X. Linux (under Wine)
description: Express Scribe is professional audio player software designed to assist the transcription of audio recordings. It is installed on the typist's computer and controlled using the keyboard (with 'hot' keys) and / or can be used with a transcription pedal. This computer transcriber application features variable speed wave playback, foot pedal operation, file management and more.
program: F4 3.0.3
author: unknown
distributor: dresing & pehl GbR
documentation:tutorial
download: free version
operating system: MS-Windows, Media Player 8.0 or better, Direct X 9.0 or better
description: F4 is a free transcription system, working with slower speed and foot pedal is possible.
|
SALT 2008 - Systematic Analysis of Language Transcripts |
program: SALT 2008
operating systems: MS-Windows, MacOS
authors: Robin S. Chapman
and
Jon F. Miller

distributor: Language Lab, University of Wisconsin-Madison
documentation: brochure must be requested
download: test version
description:
The SALT program contains an assortment of standard analyses. Some of the
information from the analyses include:
types of utterances including distribution of imitations, responses to
questions, incomplete, unintelligible, and nonverbal utterances;
calculation of total number of words, type token
ratio, mean length of utterance, and Brown's linguistic stage;
number and length of pauses and rate of speaking;
lists and frequencies of word roots, bound morphemes, and codes;
distributions of utterances by length in terms of words and morphemes;
distribution of speaker turns by length in terms of consecutive utterances;
frequencies for sets of words, including question words, negatives,
conjunctions, modal and semi-auxiliaries, pronouns, and any set of words you
define; and
number and types of mazes (filled pauses, repetitions, revisions).
The values of these variables can be compared with the SALT Reference
Database that contain empirically collected data from Wisconsin Children
of different age groups (3-13 years), gender, sampling context, and transcript
length. Matched records are selected from the database and mean, range, and
standard deviation statistics are given for many of the analysis variables.
program: TAMS 3.41b4 - Text Analysis Markup System
author: Matthew Weinstein
documentation: manual
download: free binaries and sources
operating system: MacOS X
description:
TAMS Analyzer is a program that lets you select passages of a text and just code them by double clicking the name of the code on a list. It then allows you to extract and save coded information. TAMS command line program lets you extract marked up text files for analysis by a database or spreadsheet program. Both are released under GPL
Multimedia support turns TAMS into a digital transcription machine.
program: Transana 2.21
author: Transana was originally created by Chris Fassnacht. It is now developed and maintained by David K. Woods at the Wisconsin Center for Education Research, University of Wisconsin-Madison, USA.
documentation: online help
download: free binaries and sources
operating system: MS-Windows, MacOS X
description:
Transana is an open source project and can be used to
- Identify and easily access the analytically significant portions of their video data.
- Organize video clips (from the same or from different video files) into meaningful categories, as a mechanism for developing and expanding the theoretical understanding of what the video shows.
- Apply searchable analytic keywords to these video clips.
- Engage in complex data mining and hypothesis testing across large video collections.
- Share analytic markup with distant colleagues to facilitate collaborative analysis.
program: Transcriber 1.5.1
authors: Mathieu Manta, Fabien Antoine, Sylvain Galliano, Claude Barras, and many others
documentation: reference manual (in English, in French it is planned)
download: binaries and sources
operating system: MacOS, Linux/x86, Unix (Sun Solaris, SGI IRIX), MS-Windows
description:
Transcriber is a tool for assisting the manual annotation of speech signals. It provides a
user-friendly graphical user interface for segmenting long duration speech recordings, transcribing them, and labeling speech turns, topic changes and acoustic conditions. It is more specifically designed for the annotation of broadcast news recordings, for creating corpora used in the development of automatic broadcast news transcription systems, but its features might be found useful in other areas of speech research.
Transcriber is developed with the scripting language Tcl/Tk and C extensions. It relies on the Snack sound extension, which allows support for most common audio formats, and on the tcLex lexer generator. It has been tested on various Unix systems (Linux, Sun Solaris, Silicon Graphics) and Windows NT. Transcriber is distributed as free software under GNU General Public License.