<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4202417623550891385</id><updated>2011-04-21T18:08:02.957-07:00</updated><category term='Semantic Hifi'/><category term='extraction'/><category term='processing'/><category term='speech recognition'/><category term='speaker clustering'/><category term='audio files'/><category term='musical indexing classification'/><category term='audio documents'/><category term='sound indexing'/><category term='retrieval'/><category term='audio indexing'/><category term='sound information'/><category term='recognition'/><category term='musical signal'/><category term='segmentation'/><title type='text'>Sound Indexing</title><subtitle type='html'>Welcome !
This blog is produced for my studies in Gestion de l'Information et du Document dans les Organisations (GIDO), at the IUT Michel de Montaigne (Bordeaux, France).
Here you will find documents concerning  sound indexing and an English/French glossary on this theme.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>7</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-2916210186374817464</id><published>2007-05-03T06:44:00.000-07:00</published><updated>2007-05-03T07:29:00.758-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Semantic Hifi'/><category scheme='http://www.blogger.com/atom/ns#' term='musical indexing classification'/><title type='text'>Semantic Hifi</title><content type='html'>&lt;span style="font-weight: bold; color: rgb(204, 204, 204);font-size:130%;" &gt;&lt;span style="color: rgb(102, 102, 102);"&gt;Bibliographic reference:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;IRCAM. Semantic Hifi. paper. IRCAM: Information society technologies.&lt;br /&gt;URL: &lt;a href="http://shf.ircam.fr/?L=1"&gt;http://shf.ircam.fr/?L=1&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(102, 102, 102); font-weight: bold;font-size:130%;" &gt;Text:&lt;/span&gt;&lt;br /&gt;&lt;h2 style="text-align: justify; font-weight: bold;"&gt;&lt;span style="font-size:100%;"&gt;Objectives&lt;/span&gt;&lt;/h2&gt;&lt;p style="text-align: justify;"&gt;In the context of large-scale digital music distribution, the goal of the project is to develop a new generation of HIFI systems, offering new functionality for browsing, interacting, rendering, personalizing and editing musical material. &lt;/p&gt;&lt;div style="text-align: justify;"&gt; &lt;/div&gt;&lt;p style="text-align: justify;"&gt;This next generation of hard-disk based HIFI systems will drastically change the home users’ relationship to music and multimedia content. They will be able to interact with music, blurring the traditional limits between playing, performing and remixing. These HIFI systems will be as much open instruments as listening stations. &lt;/p&gt;&lt;div style="text-align: justify;"&gt;&lt;a name="1675"&gt;&lt;/a&gt;&lt;/div&gt;&lt;h2 style="font-weight: bold; text-align: justify;"&gt;&lt;span style="font-size:100%;"&gt;Main functions&lt;/span&gt;&lt;/h2&gt;&lt;ul style="text-align: justify;"&gt;&lt;li&gt;Personalized classification and content-based management of music pieces; query by humming, automated playlist generation specified by global and content-based criteria, automatic production of musical summaries; &lt;/li&gt;&lt;li&gt;Browsing within musical pieces through the analysis of their content: temporal maps, browsing by lyrics, advanced variable speed playback, navigation within the orchestral polyphony with spatial audio rendering; &lt;/li&gt;&lt;li&gt;Personalized editing and composition tools, DJ application; &lt;/li&gt;&lt;li&gt;Instrumental and vocal tools and automatic accompaniment; &lt;/li&gt;&lt;li&gt;Sharing of the indexing, composition and performance work through P2P networks.&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold; color: rgb(153, 153, 153);font-size:130%;" &gt;Dublin Core Metadata: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Title&lt;/span&gt;: Semantic Hifi&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Creator&lt;/span&gt;: IRCAM&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Subject&lt;/span&gt;: Hifi system / classification / indexing / music / summary.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Description&lt;/span&gt;: In the context of large-scale digital music distribution, the goal of the project is to develop a new generation of HIFI systems, offering new functionality for browsing, interacting, rendering, personalizing and editing musical material.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Contributor&lt;/span&gt;: -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Date&lt;/span&gt;: -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Type&lt;/span&gt;: Paper&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Identifier&lt;/span&gt;: &lt;a href="http://shf.ircam.fr/?L=1"&gt;http://shf.ircam.fr/?L=1&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Source&lt;/span&gt;: IRCAM Information society technologies&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Language&lt;/span&gt;: en&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Coverage&lt;/span&gt;: World&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Rights&lt;/span&gt;: -&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-2916210186374817464?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/2916210186374817464/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=2916210186374817464' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/2916210186374817464'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/2916210186374817464'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/05/semantic-hifi.html' title='Semantic Hifi'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-3546939921449023337</id><published>2007-04-26T06:21:00.000-07:00</published><updated>2007-05-03T06:36:38.771-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='segmentation'/><category scheme='http://www.blogger.com/atom/ns#' term='sound indexing'/><category scheme='http://www.blogger.com/atom/ns#' term='musical signal'/><title type='text'>Segmentation and indexing of sounds</title><content type='html'>&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:85%;" &gt;&lt;span style="color: rgb(51, 51, 51);"&gt;This document is a thesis &lt;/span&gt;&lt;/span&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:85%;" &gt;&lt;span style="color: rgb(51, 51, 51);"&gt;on the signal processing&lt;/span&gt;&lt;/span&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:85%;" &gt;&lt;span style="color: rgb(51, 51, 51);"&gt; realised by a french student . The thesis is in french but you can find an english abstract. &lt;/span&gt;&lt;/span&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;&lt;br /&gt;&lt;br /&gt;Bibliographic reference:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;ROSSIGNOL, Stéphane. Segmentation et indexation des signaux sonores musicaux. July 2000, thesis, Thèse de doctorat, Université Paris VI.&lt;br /&gt;URL: &lt;a href="http://stephanerossignol.ifrance.com/"&gt;http://stephanerossignol.ifrance.com/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Abstract:&lt;/span&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: justify;"&gt;"I defended my PhD thesis in Signal Processing in July 2000 at the University of Jussieu -- Paris VI, Paris; IRCAM -- Centre Georges Pompidou, Paris; and Supélec (engineer school), Metz (1996-2000). This work was supported by France Télécom Rennes. It deals with the Segmentation and the Indexing of Acoustic Musical Signals. Below is a summary of my PhD thesis.  &lt;/div&gt;&lt;p style="text-align: justify;"&gt; This work deals with temporal segmentation and indexation of  musical signals. Three interdependent schemes of segmentation  are defined, which correspond to different levels of signal  attributes.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt;  1) The first scheme, named "source" scheme, concerns mainly  the distinction between speech and music on movie sound tracks  and on radio broadcasts.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; Features have been examined: they intend to measure distinct  properties of speech and music. They are combined into several  multidimensional classification frameworks. The performance  of the system is discussed.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; 2) The second scheme, named "feature" scheme, refers to labels  such as: silence/sound, voiced/unvoiced, harmonic/inharmonic,  monophonic/polyphonic, with vibrato/without vibrato. Most of  these characteristics are features used by the third scheme.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; Vibrato detection, vibrato parameter (its frequency and its  magnitude) estimation, and vibrato extraction from the fundamental frequency  trajectory has been particularly studied. Several techniques  are described. The performance of the system is discussed.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; The vibrato is extracted from the fundamental frequency trajectory  to obtain a no-vibrato melodic evolution. This "flat" fundamental frequency  is useful for segmentation of musical excerpts into notes (third  scheme), and can also be used for sound modification or processing.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; The vibrato detection is operated only when music is identified  on the first scheme.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; 3) The third scheme leads to segmentation into "notes or into  phones or more generally into stable sounds", according to the nature  of the sound: instrumental part, singing voice excerpt, speech,  percussive part...  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; The analysis is composed of four steps. The first step is to  extract a large set of features. A feature will be all the more appropriate  as its time evolution presents strong and short peaks when transitions  occur, and as its variance and its mean remain at very low levels when describing a steady state part. Three kinds of transitions exist: fundamental frequency transients, energy transients and frequency  content transients. Secondly, each of these features is automatically  thresholded. Thirdly, a final decision function based on the  set of the thresholded features has been built and provides  the segmentation marks. Lastly, for monophonic and harmonic  sounds, the automatic transcription is done. The performance  of the system is discussed.  &lt;/p&gt;&lt;p style="text-align: justify;"&gt; The data obtained in a given scheme are propagated from lower  numbered to higher numbered schemes in order to improve their  performance."&lt;/p&gt;&lt;p style="color: rgb(153, 153, 153);" align="justify"&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-size:130%;" &gt;Dublin Core Metadata:&lt;/span&gt;&lt;/p&gt;&lt;span style="font-weight: bold;"&gt;Title&lt;/span&gt;: Segmentation and indexing of sounds&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Creator&lt;/span&gt;: Stéphane ROSSIGNOL&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Subject&lt;/span&gt;: Signal processing, segmentation, indexing, sounds, musical signals.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Description&lt;/span&gt;:  "This work deals with temporal segmentation and indexation of  musical signals. Three interdependent schemes of segmentation  are defined, which correspond to different levels of signal  attributes."&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Contributor&lt;/span&gt;: -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Date&lt;/span&gt;: july 2000&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Type&lt;/span&gt;: thesis (summaryin english)&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Identifier&lt;/span&gt;: &lt;a href="http://stephanerossignol.ifrance.com/"&gt;http://stephanerossignol.ifrance.com/&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Source&lt;/span&gt;: Web site of Stéphane Rossignol&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Language&lt;/span&gt;: Fr&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Coverage&lt;/span&gt;: -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Rights&lt;/span&gt;: -&lt;br /&gt;&lt;br /&gt;&lt;a name="partsegen"&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-3546939921449023337?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/3546939921449023337/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=3546939921449023337' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/3546939921449023337'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/3546939921449023337'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/04/segmentation-and-indexing-of-sounds.html' title='Segmentation and indexing of sounds'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-5378818253534071078</id><published>2007-03-22T08:25:00.000-07:00</published><updated>2007-05-03T06:37:03.453-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='recognition'/><category scheme='http://www.blogger.com/atom/ns#' term='speech recognition'/><category scheme='http://www.blogger.com/atom/ns#' term='audio files'/><category scheme='http://www.blogger.com/atom/ns#' term='audio indexing'/><title type='text'>Indexing Sound Files on Search Engines that Can’t Hear Them</title><content type='html'>&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Référence bibliographique:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;SLAWSKI, Bill. Indexing Sound Files on Search Engines that Can’t Hear Them. Creative Flow, May 27th, 2004. URL: &lt;a href="http://blog.cre8asite.net/archives/125"&gt;http://blog.cre8asite.net/archives/125&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Text:&lt;/span&gt;&lt;br /&gt;&lt;p style="text-align: justify;"&gt;"What do you do if most of the content your company creates is in audio or video format, and you want to include it on the web, and make it so that people can find it? And the content is news, which relies upon timely delivery? &lt;/p&gt;&lt;div style="text-align: justify;"&gt; &lt;/div&gt;&lt;p style="text-align: justify;"&gt;In the case of National Public Radio (NPR), the answer was to put the audio files online, and also offer transcriptions of them. Since NPR started doing that&lt;span style="color: rgb(0, 0, 0);"&gt; &lt;/span&gt;&lt;a style="color: rgb(0, 0, 0);" href="http://asia.cnet.com/newstech/personaltech/0,39001147,39181083,00.htm"&gt;a few weeks ago&lt;/a&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;,&lt;/span&gt;  they’ve noticed a substantial increase in traffic to their site for topical subjects from the search engines.&lt;/p&gt;&lt;div style="text-align: justify;"&gt; &lt;/div&gt;&lt;p style="text-align: justify;"&gt;One of the things I like about this practice is that people with hearing disabilities are now able to access stories that they couldn’t hear on the radio. What a great result.&lt;/p&gt;&lt;div style="text-align: justify;"&gt; &lt;/div&gt;&lt;p style="text-align: justify;"&gt;The transcription is presently done by speech recognition technology to get stories online very quickly after they are broadcast on the radio. It’s likely that humans will take over from the software presently used, which sometimes garbles results.&lt;/p&gt;&lt;div style="text-align: justify;"&gt; &lt;/div&gt;&lt;p style="text-align: justify;"&gt;If at some point, the search engines become capable of indexing audio, I hope that sites providing transcripts continue to do so. It’s great to see such a great improvement in accessibility, even if it is done inadvertently."&lt;/p&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Dublin Core Metadata:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Title:&lt;/span&gt; Indexing Sound Files on Search Engines that Can’t Hear Them&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Creator:&lt;/span&gt; Bill SLAWSKI&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Subject: &lt;/span&gt;Audio file / audio format / video format / indexing audio / National public radio / recognition / speech recognition.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Description:&lt;/span&gt; "What do you do if most of the content your company creates is in audio or video format, and you want to include it on the web, and make it so that people can find it? And the content is news, which relies upon timely delivery?"&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Contributor:&lt;/span&gt; -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Date:&lt;/span&gt; 2004/05/27&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Type:&lt;/span&gt; Article&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Identifier:&lt;/span&gt; &lt;a href="http://blog.cre8asite.net/archives/125"&gt;http://blog.cre8asite.net/archives/125&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Source:&lt;/span&gt; Creative Flow&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Language:&lt;/span&gt; En&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Coverage: &lt;/span&gt;World&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Rights:&lt;/span&gt; -&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-5378818253534071078?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/5378818253534071078/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=5378818253534071078' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/5378818253534071078'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/5378818253534071078'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/03/indexing-sound-files-on-search-engines.html' title='Indexing Sound Files on Search Engines that Can’t Hear Them'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-5307187772759147403</id><published>2007-03-15T07:24:00.000-07:00</published><updated>2007-05-03T06:37:29.594-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='audio documents'/><category scheme='http://www.blogger.com/atom/ns#' term='sound information'/><category scheme='http://www.blogger.com/atom/ns#' term='processing'/><title type='text'>Audio and Sound Information</title><content type='html'>&lt;div style="text-align: justify;"&gt;This site was created by an old student of &lt;span style="font-weight: bold;"&gt;GIDO&lt;/span&gt; (Gestion de l'Information et du Document dans les Organisations) at the &lt;span style="font-weight: bold;"&gt;IUT Michel de Montaigne&lt;/span&gt; (France, Bordeaux) in 2006.&lt;br /&gt;It gives information on the &lt;span style="font-weight: bold;"&gt;processing of audio documents&lt;/span&gt; thanks to web sites links. It is composed to two parts, the first on the sound information and the second on the processing on the audio documents.&lt;br /&gt;It contents nine links, you can visited the &lt;span style="font-style: italic;"&gt;Assocation for recorded sound collections (ARSC)&lt;/span&gt;, the &lt;span style="font-style: italic;"&gt;Association des détenteurs de documents audiovisuels et sonores (AFAS)&lt;/span&gt;, or the &lt;span style="font-style: italic;"&gt;International Association of Sound and Audiovisual Archives (IASA)&lt;/span&gt; for example.&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;Clic on the link to visit the site &lt;span style="font-style: italic;"&gt;Audio and Sound Information&lt;/span&gt;: &lt;a href="http://www.iut.u-bordeaux3.fr/doc/sitos2006/Info%20son/Index.htm"&gt;http://www.iut.u-bordeaux3.fr/doc/sitos2006/Info%20son/Index.htm&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-5307187772759147403?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/5307187772759147403/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=5307187772759147403' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/5307187772759147403'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/5307187772759147403'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/03/audio-and-sound-information.html' title='Audio and Sound Information'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-650002000199499410</id><published>2007-02-08T07:13:00.000-08:00</published><updated>2007-05-03T06:37:47.547-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='segmentation'/><category scheme='http://www.blogger.com/atom/ns#' term='recognition'/><category scheme='http://www.blogger.com/atom/ns#' term='retrieval'/><category scheme='http://www.blogger.com/atom/ns#' term='extraction'/><category scheme='http://www.blogger.com/atom/ns#' term='audio indexing'/><title type='text'>Speech and language technologies for audio indexing and retrieval</title><content type='html'>&lt;span style="font-weight: bold; color: rgb(153, 153, 153);font-size:130%;" &gt;Bibliographic reference&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;MAKHOUL, John et al / Speech and language technologies for audio indexing and retrieval. Proceedings of the IEEE, Vol. 88, N° 8. AUGUST 2000. URL: &lt;a href="http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf"&gt;http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Text&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;URL: &lt;a href="http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf"&gt;http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; color: rgb(153, 153, 153);font-size:130%;" &gt;Dublin Core Metadata:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Title&lt;/span&gt;: Speech and language technologies for audio indexing and retrieval&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Creator&lt;/span&gt;: John MAKHOUL&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Subject&lt;/span&gt;: Audio indexing, information extraction, information retrieval, speech recognition, segmentation, classification.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Description&lt;/span&gt;: This paper explain how to extract audio information. It proposes figures of the Rough'n Ready System witch it is possible to indexing and retrieval sounds. It also talks about the speech recognition and the segmentation.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Contributor&lt;/span&gt;: -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Date&lt;/span&gt;: August 2000&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Type&lt;/span&gt;: Pdf&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Identifier&lt;/span&gt;: URL: &lt;a href="http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf"&gt;http://www.bbn.com/docs/whitepapers/Audio-Indexing-Retrieval.pdf&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Source: &lt;/span&gt;Proceedings of the IEEE&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Language&lt;/span&gt;: En&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Coverage&lt;/span&gt;: World&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Rights&lt;/span&gt;: IEEE&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-650002000199499410?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/650002000199499410/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=650002000199499410' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/650002000199499410'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/650002000199499410'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/02/speech-and-language-technologies-for.html' title='Speech and language technologies for audio indexing and retrieval'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-1825524404382278324</id><published>2007-02-01T06:58:00.000-08:00</published><updated>2007-05-03T06:38:02.205-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='speaker clustering'/><category scheme='http://www.blogger.com/atom/ns#' term='speech recognition'/><category scheme='http://www.blogger.com/atom/ns#' term='audio files'/><category scheme='http://www.blogger.com/atom/ns#' term='audio indexing'/><title type='text'>An Online Audio Indexing System</title><content type='html'>&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Bibliographic reference:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;AJMERA, Jitendra. McCOWAN, Iain. BOURLARD, Herve / An Online Audio Indexing System. IDIAP, Switzerland. 2002. URL: &lt;a href="ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf"&gt;ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf&lt;br /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Text:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: justify;"&gt;&lt;span style="font-weight: bold;"&gt;Sumary&lt;/span&gt;: "This paper presents overview of an online audio indexing system which creates a searchable index of speech content embedded in digitized audio files. This system is based on our recently proposed offline audio segmentation techniques. As the data arrives continuously, the system first finds boundaries of the acoustically homogenous segments. Next, each of these segments is classified as speech, music or \it mixture classes, where mixtures are defined as regions where speech and other non-speech sounds are present simultaneously and noticeably. The speech segments are then clustered together to provide consistent speaker labels. The speech and mixture segments are converted to text via an ASR system. The resulting words are time-stamped together with other metadata information (speaker identity, speech confidence score) in an XML file to rapidly identify and access target segments. In this paper, we analyze the performance at each stage of this audio indexing system and also compare it with the performance of the corresponding offline modules."&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;URL&lt;/span&gt;:&lt;a href="ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf"&gt; ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(153, 153, 153); font-weight: bold;font-size:130%;" &gt;Dublin Core Metadata:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Title&lt;/span&gt;: An Online Audio Indexing System&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Creator&lt;/span&gt;: AJMERA Jitendra, McCOWAN Iain, BOURLARD Herve&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Subject&lt;/span&gt;: audio files, audio indexing, automatic speech recognition, speaker clustering.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Description&lt;/span&gt;: "This paper presents an overview of an online audio indexing systemwhitch creates a searchable index of speech content embedded in digitized audio files."&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Contributor&lt;/span&gt;: -&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Date&lt;/span&gt;: 2004&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Type&lt;/span&gt;: Paper&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Format&lt;/span&gt;: Pdf&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Identifier&lt;/span&gt;:URL: &lt;a href="ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf"&gt;ftp://ftp.idiap.ch/pub/reports/2003/rr03-39b.pdf&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Source:&lt;/span&gt; IDIAP Research Institute&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Language&lt;/span&gt;: En&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Coverage&lt;/span&gt;: World&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Rights&lt;/span&gt;: IDIAP&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-1825524404382278324?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/1825524404382278324/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=1825524404382278324' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/1825524404382278324'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/1825524404382278324'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/02/online-audio-indexing-system.html' title='An Online Audio Indexing System'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4202417623550891385.post-5313877315929018066</id><published>2007-01-28T10:04:00.000-08:00</published><updated>2007-05-03T06:38:26.561-07:00</updated><title type='text'>Glossary English / French</title><content type='html'>&lt;p  style="font-weight: bold; text-align: justify;font-family:times new roman;" class="MsoNormal"&gt;&lt;span style="font-size:100%;"&gt;&lt;span style="" lang="EN-GB"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;div style="text-align: justify;"&gt;&lt;span style="font-weight: bold;"&gt;Automatic sound indexing&lt;/span&gt;&lt;br /&gt;“These methods allow the system to automatically organize new sounds introduced by the user, by analyzing their content in relation to predefined categories."&lt;br /&gt;Name: IRCAM Centre Pompidou.&lt;br /&gt;URL: &lt;a href="http://www.ircam.fr/307.html?&amp;L=1&amp;amp;tx_ircam_pi4%5BshowUid%5D=15&amp;cH"&gt;http://www.ircam.fr/307.html?&amp;amp;L=1&amp;tx_ircam_pi4[showUid]=15&amp;amp;cH&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.ircam.fr/307.html?&amp;L=1&amp;amp;tx_ircam_pi4%5BshowUid%5D=15&amp;cH"&gt;ash=72de9812b5&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Indexation automatique des sons&lt;/span&gt;&lt;br /&gt;“ Méthode permettant au système de ranger automatiquement les nouveaux sons introduits par l’utilisateur à partir d’une analyse de leur contenu selon des catégories qu’il aura prédéfinies.»&lt;br /&gt;Nom : IRCAM Centre Pompidou.&lt;br /&gt;URL : &lt;a href="http://www.ircam.fr/307.html?&amp;L=0&amp;amp;tx_ircam_pi4%5BshowUid%5D=15&amp;cH"&gt;http://www.ircam.fr/307.html?&amp;amp;L=0&amp;tx_ircam_pi4[showUid]=15&amp;amp;cH&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.ircam.fr/307.html?&amp;L=0&amp;amp;tx_ircam_pi4%5BshowUid%5D=15&amp;cH"&gt;ash=72de9812b5&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Cataloging&lt;/span&gt;&lt;br /&gt;"The compilation and maintenance of primary information by systematically describing objects in the collection, and the arranging of this information into an object catalog record."&lt;br /&gt;Name: International Guidelines for Museum Object Information: The CIDOC Information Categories.&lt;br /&gt;URL: &lt;a href="http://www.willpowerinfo.myby.co.uk/cidoc/guide/guideglo.htm"&gt;http://www.willpowerinfo.myby.co.uk/cidoc/guide/guideglo.htm&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Catalogage&lt;/span&gt;&lt;br /&gt;"Consiste à analyser le document en tant que support."&lt;br /&gt;Nom : Methodoc.&lt;br /&gt;URL : &lt;a href="http://www.scd.univ-lille3.fr/methodoc/cours/typedocument/catal"&gt;http://www.scd.univ-lille3.fr/methodoc/cours/typedocument/catal&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.scd.univ-lille3.fr/methodoc/cours/typedocument/catal"&gt;ogage.htm&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Indexing    &lt;/span&gt;&lt;br /&gt;"The process of converting a collection of data into a database suitable for easy search and retrieval."&lt;br /&gt;Name: Virtech E-solution-SEO: E-Solutions and Web Development for Today's Internet&lt;br /&gt;URL: &lt;a href="http://www.virtechseo.com/seoglossary.htm"&gt;http://www.virtechseo.com/seoglossary.htm&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Indexation&lt;/span&gt;&lt;br /&gt;"Consiste à analyser le document pour les informations qu'il contient."&lt;br /&gt;Nom : Methodoc.&lt;br /&gt;URL : &lt;a href="http://www.scd.univ-lille3.fr/methodoc/cours/typedocument/"&gt;http://www.scd.univ-lille3.fr/methodoc/cours/typedocument/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.scd.univ-lille3.fr/methodoc/cours/typedocument/"&gt;indexation.htm&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Record    &lt;/span&gt;&lt;br /&gt;"A group of fields relating to a particular object or transaction."&lt;br /&gt;Name: International Guidelines for Museum Object Information: The CIDOC Information Categories.&lt;br /&gt;URL: &lt;a href="http://www.willpowerinfo.myby.co.uk/cidoc/guide/guideglo.htm"&gt;http://www.willpowerinfo.myby.co.uk/cidoc/guide/guideglo.htm&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Enregistrement&lt;/span&gt;&lt;br /&gt;"Fait de recueillir et de conserver (une donnée) au moyen d'appareils appropriés."&lt;br /&gt;Nom : TLFI : Trésor de la langue française informatisé.&lt;br /&gt;URL: &lt;a href="http://atilf.atilf.fr/dendien/scripts/tlfiv5/saveregass.exe?43;s=2841"&gt;http://atilf.atilf.fr/dendien/scripts/tlfiv5/saveregass.exe?43;s=2841&lt;/a&gt;&lt;br /&gt;&lt;a href="http://atilf.atilf.fr/dendien/scripts/tlfiv5/saveregass.exe?43;s=2841"&gt;038385;r=2;;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Segmentation    &lt;/span&gt;&lt;br /&gt;"The process by which speech signals are divided into phonemes, syllables or words."&lt;br /&gt;Name: Keith Yates Design Group.&lt;br /&gt;URL: &lt;a href="http://www.keithyates.com/index.html"&gt;http://www.keithyates.com/index.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Segmentation  &lt;/span&gt;&lt;br /&gt;"Consiste à détecter les variations brusques du signal, à détecter les transitions entre deux zones stables successives."&lt;br /&gt;Nom : Thèse : Segmentation et indexation des sons.&lt;br /&gt;URL: &lt;a href="http://stephanerossignol.ifrance.com/"&gt;http://stephanerossignol.ifrance.com/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Sound recording    &lt;/span&gt;&lt;br /&gt;"The fixation of a series of musical, spoken or other sounds."&lt;br /&gt;Name: The sound of American music company.&lt;br /&gt;URL: &lt;a href="http://www.americanmusicco.com/license/helpfulHints.asp"&gt;http://www.americanmusicco.com/license/helpfulHints.asp&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold; font-style: italic;"&gt;Enregistrement sonore    &lt;/span&gt;&lt;br /&gt;"Opération qui consiste à garder la trace d'un son de façon durable sur un support analogique comme la bande magnétique ou le disque vinyle, ou sur un support numérique comme le disque compact, en vue de pouvoir le diffuser au plus proche de l'identique et éventuellement le modifier (le traiter)."&lt;br /&gt;Nom : Techno-science.net&lt;br /&gt;URL: &lt;a href="http://www.techno-science.net/?onglet=glossaire&amp;definition=1256"&gt;http://www.techno-science.net/?onglet=glossaire&amp;amp;definition=1256&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4202417623550891385-5313877315929018066?l=sound-indexing.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sound-indexing.blogspot.com/feeds/5313877315929018066/comments/default' title='Publier les commentaires'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4202417623550891385&amp;postID=5313877315929018066' title='0 commentaires'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/5313877315929018066'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4202417623550891385/posts/default/5313877315929018066'/><link rel='alternate' type='text/html' href='http://sound-indexing.blogspot.com/2007/01/glossary.html' title='Glossary English / French'/><author><name>Anne-Cécile</name><uri>http://www.blogger.com/profile/00362460633066418683</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
