Transcription - Speech & Technology

There are many different specialized tools for manual digital transcription. While it is not neccesary to use specialized tools (one can also use Microsoft Word or Google Docs), knowing your options can make transcribing much less of a hassle and expand the possibilities of research using your transcriptions.

One example of an advantage of transcription software is that it offers "playing" with "typing" and that the resulting transcription is time-aligned. i.e. the start and end time of each text-fragment (a word, a couple of words, a phrase or even a paragraph) is known. This time-alignment makes it possible to search for spoken words and to generate subtitles.

Transcriptions made with an ordinary text editor (Notepad, Word, etc.) lack this time-alignment and the result is just text. Combining this text with forced alignment however will result in the same time-aligned transcriptions as with dedicated transcription software, which will be explored on our page on post-transcription.

Granularity
The "time resolution" of the transcription software depends on the human editor who selects short fragments (words or even phonemes) or rather long fragments (paragraph). Another often used method for the time-alignment is to place time-stamps in a fixed interval (e.g. each 30 sec or each 5 minutes).

Once the transcription is made (with a text editor with or without time-stamps or dedicated transcription software on an utterance level) a final foreced alignment will result in a more precise determination of the start- and end-times of each word and, if desired, the start- and end-times of the spoken phonemes.

For Oral Historians, time-aligment on the utterance level will be "enough", but modern technolgy makes it extremely simple to automatically add a higher granularity on the time-aligned transcriptions.

Tools

Here, I will list three tools that can be used especially for transcription. The first is a transcription-centered text editor for plain transcripts, the second is a tool useful for manually transcribing on a sentence-by-sentence basis, and the third goes even deeper, making it possible to transcribe on a word-by word or phoneme basis.

oTranscribe

Simple transcription - Lightweight - Easy-to-use

Free - Webbased

https://otranscribe.com/

oTranscribe is a simple, lightweight website which eases the process of manual transcription. It combines a straight-foward text editor á la Google Docs with a simple media player that uses keyboard shortcuts for play/pause/forward/backward/etcetera. You can load your own audio and video files, or use a video from YouTube. The online tool does not store data on their own servers, but uses your own browser storage. This is good for privacy, but bad for reliability: back-up your transcription every 10 minutes or so!

Subtitle Edit

ELAN

20 May

Joint COLING and LREC conference

20-05-2024 - 25-05-2024

The three-day main conference (22-23-24, May) will be accompanied by a total of three days of workshops and tutorials (20-21-25, May) held in the days immediately before and after.

Two major international key players in the area of computational linguistics, the ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL), are joining forces to organize the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) to be held in Torino (Italy) on 20-25 May, 2024.

The hybrid conference will bring together researchers and practitioners in computational linguistics, speech, multimodality, and natural language processing, with special attention to evaluation and the development of resources that support work in these areas. Following in the tradition of the well-established parent conferences COLING and LREC, the joint conference will feature grand challenges and provide ample opportunity for attendees to exchange information and ideas through both oral presentations and extensive poster sessions, complemented by a friendly social program.

31 Aug

The Young Female* Researchers in Speech Workshop

31-08-2024

YFRSW 2024 The Young Female* Researchers in Speech Workshop (YFRSW) is a workshop for female* Bachelor’s and Master’s students currently working in speech science and technology. The workshop aims to promote interest in research in our field among women* who have not yet committed to pursuing a PhD in speech science or technology, but who have already gained research experience at their universities through individual or group projects.

The workshop will be held prior to Interspeech 2024 on Saturday, August 31st, 2024. The event will take place in Greece. The workshop will feature panel discussions with PhD students and senior researchers in the field, student poster presentations, and a mentoring session. Student poster presentations should give an overview of a current or planned research project in which the student is involved, with an emphasis on promoting discussion.

For more information, see: https://sites.google.com/view/yfrsw-2024/abstract-submission