Sound Indexing: février 2007

Bibliographic reference

MAKHOUL, John et al / Speech and language technologies for audio indexing and retrieval. Proceedings of the IEEE, Vol. 88, N° 8. AUGUST 2000. URL: http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf

Text

URL: http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf

Dublin Core Metadata:

Title: Speech and language technologies for audio indexing and retrieval
Creator: John MAKHOUL
Subject: Audio indexing, information extraction, information retrieval, speech recognition, segmentation, classification.
Description: This paper explain how to extract audio information. It proposes figures of the Rough'n Ready System witch it is possible to indexing and retrieval sounds. It also talks about the speech recognition and the segmentation.
Contributor: -
Date: August 2000
Type: Pdf
Identifier: URL: http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf
Source: Proceedings of the IEEE
Language: En
Coverage: World
Rights: IEEE

Bibliographic reference:

AJMERA, Jitendra. McCOWAN, Iain. BOURLARD, Herve / An Online Audio Indexing System. IDIAP, Switzerland. 2002. URL: ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf

Text:

Sumary: "This paper presents overview of an online audio indexing system which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives continuously, the system first finds boundaries of the acoustically homogenous segments. Next, each of these segments is classified as speech, music or \it mixture classes, where mixtures are defined as regions where speech and other non-speech sounds are present simultaneously and noticeably. The speech segments are then clustered together to provide consistent speaker labels. The speech and mixture segments are converted to text via an ASR system. The resulting words are time-stamped together with other metadata information (speaker identity, speech confidence score) in an XML file to rapidly identify and access target segments. In this paper, we analyze the performance at each stage of this audio indexing system and also compare it with the performance of the corresponding offline modules."

URL: ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf

Dublin Core Metadata:

Title: An Online Audio Indexing System
Creator: AJMERA Jitendra, McCOWAN Iain, BOURLARD Herve
Subject: audio files, audio indexing, automatic speech recognition, speaker clustering.
Description: "This paper presents an overview of an online audio indexing systemwhitch creates a searchable index of speech content embedded in digitized audio files."
Contributor: -
Date: 2004
Type: Paper
Format: Pdf
Identifier:URL: ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf
Source: IDIAP Research Institute
Language: En
Coverage: World
Rights: IDIAP

Sound Indexing

Speech and language technologies for audio indexing and retrieval

An Online Audio Indexing System

Subject Index

Who am I ?

Some links

Blog Archives