A Prototype System for Selective Dissemination of Broadcast News in European Portuguese

Amaral, R.; Meinedo, H.; Caseiro, D.; Trancoso, I.; Neto, J.

doi:10.1155/2007/37507

Research Article
Open access
Published: 01 December 2007

A Prototype System for Selective Dissemination of Broadcast News in European Portuguese

R. Amaral^1,2,3,
H. Meinedo^1,3,
D. Caseiro^1,3,
I. Trancoso^1,3 &
…
J. Neto^1,3

EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 037507 (2007) Cite this article

1257 Accesses
10 Citations
Metrics details

Abstract

This paper describes ongoing work on selective dissemination of broadcast news. Our pipeline system includes several modules: audio preprocessing, speech recognition, and topic segmentation and indexation. The main goal of this work is to study the impact of earlier errors in the last modules. The impact of audio preprocessing errors is quite small on the speech recognition module, but quite significant in terms of topic segmentation. On the other hand, the impact of speech recognition errors on the topic segmentation and indexation modules is almost negligible. The diagnostic of the errors in these modules is a very important step for the improvement of the prototype of a media watch system described in this paper.

References

Meinedo H, Neto J: A stream-based audio segmentation, classification and clustering pre-processing system for broadcast news using ANN models. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 237–240.
Google Scholar
Amaral R, Trancoso I: Improving the topic indexation and segmentation modules of a media watch system. Proceedings of the 8th International Conference on Spoken Language Processing (INTERSPEECH-ICSLP '04), October 2004, Jeju Island, Korea 1609–1612.
Google Scholar
Trancoso I, Neto J, Meinedo H, Amaral R: Evaluation of an alert system for selective dissemination of broadcast news. Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH-INTERSPEECH '03), September 2003, Geneva, Switzerland 1257–1260.
Google Scholar
NIST : Fall 2004 rich transcription (rt-04f) evaluation plan. 2004.
Google Scholar
Siegler M, Jain U, Raj B, Stern R: Automatic segmentation, classification and clustering of broadcast news audio. Proceedings of DARPA Speech Recognition Workshop, February 1997, Chantilly, Va, USA 97–99.
Google Scholar
Chen S, Gopalakrishnan P: Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proceedings of DARPA Speech Recognition Workshop, February 1998, Lansdowne, Va, USA 127–132.
Google Scholar
Žibert J, Mihelič F, Martens J-P, et al.: The COST278 broadcast news segmentation and speaker clustering evaluation—overview, methodology, systems, results. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 629–932.
Google Scholar
Tranter SE, Reynolds DA: An overview of automatic speaker diarization systems. IEEE Transactions on Audio, Speech and Language Processing 2006,14(5):1557-1565.
Article Google Scholar
Zhu X, Barras C, Meignier S, Gauvain J-L: Combining speaker identification and BIC for speaker diarization. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 2441–2444.
Google Scholar
Meinedo H, Caseiro D, Neto J, Trancoso I: AUDIMUS.media: a broadcast news speech recognition system for the European Portuguese language. Proceedings of the 6th International Workshop on Computational Processing of the Portuguese Language (PROPOR '03), June 2003, Faro, Portugal 9–17.
Chapter Google Scholar
Mohri M, Pereira F, Riley M: Weighted finite-state transducers in speech recognition. Proceedings of Automatic Speech Recognition: Challenges for the New Millenium (ASR '00), September 2000, Paris, France 97–106.
Google Scholar
Caseiro D, Trancoso I: A specialized on-the-fly algorithm for lexicon and language model composition. IEEE Transactions on Audio, Speech and Language Processing 2006,14(4):1281-1291.
Article Google Scholar
Williams D: Knowing what you don't know: roles for confidence measures in automatic speech recognition, Ph.D. thesis. University of Sheffield, Sheffield, UK; 1999.
Google Scholar
Berger AL, Della Pietra VJ, Della Pietra SA: A maximum entropy approach to natural language processing. Computational Linguistics 1996,22(1):39-71.
Google Scholar
Matsoukas S, Prasad R, Laxminarayan S, Xiang B, Nguyen L, Schwartz R:The 2004 BBN recognition systems for English broadcast news and conversational telephone speech. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 1641–1644.
Google Scholar
Nguyen L, Xiang B, Afify M, et al.: The BBN RT04 English broadcast news transcription system. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 1673–1676.
Google Scholar
Galliano S, Geoffrois E, Mostefa D, Choukri K, Bonastre J-F, Gravier G: The ESTER phase II evaluation campaign for the rich transcription of French broadcast news. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 1149–1152.
Google Scholar
Gauvain JL, Lamel L, Adda-Decker M: Developments in continuous speech dictation using the ARPA WSJ task. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '95), May 1995, Detroit, Mich, USA 1: 65–68.
Google Scholar
Shriberg E: Spontaneous speech: how people really talk, and why engineers should care. Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH '05), September 2005, Lisbon, Portugal 1781–1784.
Google Scholar
Barzilay R, Collins M, Hirschberg J, Whittaker S: The rules behind roles: identifying speaker role in radio broadcasts. Proceedings of the 7th National Conference on Artificial Intelligence and the 12th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAI '00), July 2000, Austin, Tex, USA 679–684.
Google Scholar
Gelbukh A, Sidorov G, Guzmán-Arenas A: Document indexing with a concept hierarchy. Proceedings of the 1st International Workshop on New Developments in Digital Libraries (NDDL '01), July 2001, Setúbal, Portugal 47–54.
Google Scholar
Lo YY, Gauvain JL: The LIMSI topic tracking system for TDT 2002. Proceedings of DARPA Topic Detection and Tracking Workhsop, November 2002, Gaithersburg, Md, USA
Google Scholar
Werner S, Iurgel U, Kosmala A, Rigoll G: Tracking topics in broadcast news data. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '02), September 2002, Lausanne, Switzerland
Google Scholar
Garofolo J, Auzanne G, Voorhees E: The TREC spoken document retrieval track: a success story. Proceedings of the Recherche d'Informations Assiste par Ordinateur (RIAO '00), April 2000, Paris, France
Google Scholar
Neto J, Meinedo H, Amaral R, Trancoso I: A system for selective dissemination of multimedia information resulting from the ALERT project. Proceedings of ISCA Workshop on Multilingual Spoken Document Retrieval (MSDR '03), April 2003, Hong Kong 25–30.
Google Scholar
Martins C, Texeira A, Neto J: Dynamic vocabulary adaptation for a daily and real-time broadcast news transcription system. Proceedings of IEEE/ACL Spoken Language Technology Workshop, December 2006, Aruba, The Netherlands 146–149.
Google Scholar
Chelba C, Mahajan M, Acero A: Speech utterance classification. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), April 2003, Hong Kong 1: 280–283.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Instituto Superior Técnico, Universidade Técnica de Lisboa, Lisboa, 1049-001, Portugal
R. Amaral, H. Meinedo, D. Caseiro, I. Trancoso & J. Neto
Escola Superior de Tecnologia, Instituto Politécnico de Setúbal, Setúbal, 2914-503, Portugal
R. Amaral
Spoken Language Systems Lab L2F, Institute for Systems and Computer Engineering: Research and Development (INESC-ID), Lisboa, 1000-029, Portugal
R. Amaral, H. Meinedo, D. Caseiro, I. Trancoso & J. Neto

Authors

R. Amaral
View author publications
You can also search for this author in PubMed Google Scholar
H. Meinedo
View author publications
You can also search for this author in PubMed Google Scholar
D. Caseiro
View author publications
You can also search for this author in PubMed Google Scholar
I. Trancoso
View author publications
You can also search for this author in PubMed Google Scholar
J. Neto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Amaral.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://doi.org/creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Amaral, R., Meinedo, H., Caseiro, D. et al. A Prototype System for Selective Dissemination of Broadcast News in European Portuguese. EURASIP J. Adv. Signal Process. 2007, 037507 (2007). https://doi.org/10.1155/2007/37507

Download citation

Received: 08 September 2006
Accepted: 14 April 2007
Published: 01 December 2007
DOI: https://doi.org/10.1155/2007/37507

A Prototype System for Selective Dissemination of Broadcast News in European Portuguese

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords