TY - JOUR AU - Shafran, I. AU - Rose, R. PY - 2003 DA - 2003// TI - Robust speech detection and segmentation for real-time ASR applications JO - Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03) VL - 1 UR - https://doi.org/10.1109/ICASSP.2003.1198810 DO - 10.1109/ICASSP.2003.1198810 ID - Shafran2003 ER - TY - JOUR AU - Beyerlein, P. AU - Aubert, X. AU - Haeb-Umbach, R. PY - 2002 DA - 2002// TI - Large vocabulary continuous speech recognition of broadcast news - the Philips/RWTH approach JO - Speech Communication VL - 37 UR - https://doi.org/10.1016/S0167-6393(01)00062-0 DO - 10.1016/S0167-6393(01)00062-0 ID - Beyerlein2002 ER - TY - JOUR AU - Gauvain, J. -. L. AU - Lamel, L. AU - Adda, G. PY - 2002 DA - 2002// TI - The LIMSI broadcast news transcription system JO - Speech Communication VL - 37 UR - https://doi.org/10.1016/S0167-6393(01)00061-9 DO - 10.1016/S0167-6393(01)00061-9 ID - Gauvain2002 ER - TY - JOUR AU - Woodland, P. C. PY - 2002 DA - 2002// TI - The development of the HTK broadcast news transcription system: an overview JO - Speech Communication VL - 37 UR - https://doi.org/10.1016/S0167-6393(01)00059-0 DO - 10.1016/S0167-6393(01)00059-0 ID - Woodland2002 ER - TY - CHAP AU - Magrin-Chagnolleau, I. AU - Parlangeau-Vallès, N. PY - 2002 DA - 2002// TI - Audio indexing: what has been accomplished and the road ahead BT - Proceedings of Joint Conference on Information Sciences (JCIS '02) ID - Magrin-Chagnolleau2002 ER - TY - JOUR AU - Makhoul, J. AU - Kubala, F. AU - Leek, T. PY - 2000 DA - 2000// TI - Speech and language technologies for audio indexing and retrieval JO - Proceedings of the IEEE VL - 88 UR - https://doi.org/10.1109/5.880087 DO - 10.1109/5.880087 ID - Makhoul2000 ER - TY - CHAP AU - Istrate, D. AU - Scheffer, N. AU - Fredouille, C. AU - Bonastre, J. -. F. PY - 2005 DA - 2005// TI - Broadcast news speaker tracking for ESTER 2005 campaign BT - Proceedings of Interspeech ID - Istrate2005 ER - TY - CHAP AU - Moraru, D. AU - Ben, M. AU - Gravier, G. PY - 2005 DA - 2005// TI - Experiments on speaker tracking and segmentation in radio broadcast news BT - Proceedings of Interspeech ID - Moraru2005 ER - TY - JOUR AU - Reynolds, D. A. AU - Torres-Carrasquillo, P. A. PY - 2005 DA - 2005// TI - Approaches and applications of audio diarization JO - Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05) VL - 5 ID - Reynolds2005 ER - TY - CHAP AU - Sinha, R. AU - Tranter, S. E. AU - Gales, M. J. F. AU - Woodland, P. C. PY - 2005 DA - 2005// TI - The Cambridge University March 2005 speaker diarisation system BT - Proceedings of Interspeech ID - Sinha2005 ER - TY - CHAP AU - Zhu, X. AU - Barras, C. AU - Meignier, S. AU - Gauvain, J. -. L. PY - 2005 DA - 2005// TI - Combining speaker identification and BIC for speaker diarization BT - Proceedings of Interspeech ID - Zhu2005 ER - TY - JOUR AU - Saunders, J. PY - 1996 DA - 1996// TI - Real-time discrimination of broadcast speech/music JO - Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP '96) VL - 2 ID - Saunders1996 ER - TY - JOUR AU - Greenberg, S. PY - 1995 DA - 1995// TI - The ears have it: the auditory basis of speech perceptions JO - Proceedings of the 13th International Congress of Phonetic Sciences (ICPhS '95) VL - 3 ID - Greenberg1995 ER - TY - JOUR AU - Samouelian, A. AU - Robert-Ribes, J. AU - Plumpe, M. PY - 1998 DA - 1998// TI - Speech, silence, music and noise classification of TV broadcast material JO - Proceedings of International Conference on Spoken Language Processing (ICSLP '98), November- VL - 3 ID - Samouelian1998 ER - TY - JOUR AU - Scheirer, E. AU - Slaney, M. PY - 1997 DA - 1997// TI - Construction and evaluation of a robust multifeature speech/music discriminator JO - Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) VL - 2 ID - Scheirer1997 ER - TY - JOUR AU - Picone, J. W. PY - 1993 DA - 1993// TI - Signal modeling techniques in speech recognition JO - Proceedings of the IEEE VL - 81 UR - https://doi.org/10.1109/5.237532 DO - 10.1109/5.237532 ID - Picone1993 ER - TY - JOUR AU - Hermansky, H. PY - 1990 DA - 1990// TI - Perceptual linear predictive (PLP) analysis of speech JO - Journal of the Acoustical Society of America VL - 87 UR - https://doi.org/10.1121/1.399423 DO - 10.1121/1.399423 ID - Hermansky1990 ER - TY - BOOK AU - Ajmera, J. PY - 2004 DA - 2004// TI - Robust audio segmentation, M.S. thesis PB - École Polytechnique Fédérale de Lausanne CY - Lausanne, Switzerland ID - Ajmera2004 ER - TY - CHAP AU - Hain, T. AU - Johnson, S. E. AU - Tuerk, A. AU - Woodland, P. C. AU - Young, S. J. PY - 1998 DA - 1998// TI - Segment generation and clustering in the HTK broadcast news transcription system BT - Proceedings of the ID - Hain1998 ER - TY - JOUR AU - Ajmera, J. AU - McCowan, I. AU - Bourlard, H. PY - 2003 DA - 2003// TI - Speech/music segmentation using entropy and dynamism features in a HMM classification framework JO - Speech Communication VL - 40 UR - https://doi.org/10.1016/S0167-6393(02)00087-0 DO - 10.1016/S0167-6393(02)00087-0 ID - Ajmera2003 ER - TY - JOUR AU - Karnebäck, S. PY - 2002 DA - 2002// TI - Expanded examinations of a low frequency modulation feature for speech/music discrimination JO - Proceedings of 7th International Conference on Spoken Language Processing (ICSLP '02 - Interspeech '02) VL - 2 ID - Karnebäck2002 ER - TY - JOUR AU - Williams, G. AU - Ellis, D. P. W. PY - 1999 DA - 1999// TI - Speech/music discrimination based on posterior probabilities JO - Proceedings of Eurospeech '99 VL - 2 ID - Williams1999 ER - TY - CHAP AU - Siegler, M. AU - Jain, U. AU - Raj, B. AU - Stern, R. PY - 1997 DA - 1997// TI - Automatic segmentation, classification and clustering of broadcast news data BT - Proceedings of the DARPA Speech Recognition Workshop ID - Siegler1997 ER - TY - CHAP AU - Žibert, J. AU - Mihelič, F. AU - Martens, J. -. P. PY - 2005 DA - 2005// TI - The COST278 broadcast news segmentation and speaker clustering evaluation - overview, methodology, systems, results BT - Proceedings of Interspeech ID - Žibert2005 ER - TY - JOUR AU - Lu, L. AU - Zhang, H. -. J. AU - Li, S. Z. PY - 2003 DA - 2003// TI - Content-based audio classification and segmentation by using support vector machines JO - ACM Multimedia Systems Journal VL - 8 UR - https://doi.org/10.1007/s00530-002-0065-0 DO - 10.1007/s00530-002-0065-0 ID - Lu2003 ER - TY - CHAP AU - Chen, S. S. AU - Gopalakrishnan, P. S. PY - 1998 DA - 1998// TI - Speaker, environment and channel change detection and clustering via the Bayesian information criterion BT - Proceedings of the DARPA Speech Recognition Workshop ID - Chen1998 ER - TY - CHAP AU - Logan, B. PY - 2000 DA - 2000// TI - Mel frequency cepstral coefficients for music modeling BT - Proceedings of the International Symposium on Music Information Retrieval (ISMIR '00) ID - Logan2000 ER - TY - CHAP AU - Reynolds, D. A. AU - Campbell, J. P. AU - Campbell, W. M. PY - 2003 DA - 2003// TI - Beyond cepstra: exploiting high-level information in speaker recognition BT - Proceedings of the Workshop on Multimodal User Authentication ID - Reynolds2003 ER - TY - BOOK AU - Garofolo, J. S. AU - Lamel, L. F. AU - Fisher, W. M. AU - Fiscus, J. G. AU - Pallett, D. S. AU - Dahlgren, N. L. PY - 1993 DA - 1993// TI - DARPA TIMIT acoustic-phonetic continuous speech corpus PB - U.S. Department of Commerce, NIST, Gaithersburg, Md CY - USA UR - https://doi.org/10.6028/NIST.IR.4930 DO - 10.6028/NIST.IR.4930 ID - Garofolo1993 ER - TY - JOUR AU - Tritschler, A. AU - Gopinath, R. PY - 1999 DA - 1999// TI - Improved speaker segmentation and segments clustering using the Bayesian information criterion JO - Proceedings of Eurospeech '99 VL - 2 ID - Tritschler1999 ER - TY - JOUR AU - Mihelič, F. AU - Gros, J. AU - Dobrišek, S. AU - Žibert, J. AU - Pavešić, N. PY - 2003 DA - 2003// TI - Spoken language resources at LUKS of the university of Ljubljana JO - International Journal of Speech Technology VL - 6 UR - https://doi.org/10.1023/A:1023462002932 DO - 10.1023/A:1023462002932 ID - Mihelič2003 ER - TY - BOOK AU - Young, S. AU - Evermann, G. AU - Gales, M. PY - 2004 DA - 2004// TI - The HTK Book (for HTK Version 3.2) PB - Cambridge University Engineering Department CY - Cambridge, UK ID - Young2004 ER - TY - JOUR AU - Lee, K. -. F. AU - Hon, H. -. W. PY - 1989 DA - 1989// TI - Speaker-independent phone recognition using hidden Markov models JO - IEEE Transactions on Acoustics, Speech, and Signal Processing VL - 37 UR - https://doi.org/10.1109/29.46546 DO - 10.1109/29.46546 ID - Lee1989 ER - TY - CHAP AU - Potamianos, G. AU - Neti, C. AU - Luettin, J. AU - Matthews, I. ED - Bailly, G. ED - Vatikiotis-Bateson, E. ED - Perrier, P. PY - 2004 DA - 2004// TI - Audio-visual automatic speech recognition: an overview BT - Issues in Visual and Audio-Visual Speech Processing PB - MIT Press CY - Cambridge, Mass, USA ID - Potamianos2004 ER - TY - CHAP AU - Žibert, J. AU - Mihelič, F. PY - 2004 DA - 2004// TI - Development of Slovenian broadcast news speech database BT - Proceedings of the International Conference on Language Resources and Evaluation (LREC '04) ID - Žibert2004 ER - TY - CHAP AU - Vandecatseye, A. AU - Martens, J. P. AU - Neto, J. PY - 2004 DA - 2004// TI - The COST278 pan-European broadcast news database BT - Proceedings of the International Conference on Language Resources and Evaluation (LREC '04) ID - Vandecatseye2004 ER - TY - CHAP AU - Baker, B. AU - Vogt, R. AU - Sridharan, S. PY - 2005 DA - 2005// TI - Gaussian mixture modelling of broad phonetic and syllabic events for text-independent speaker verification BT - Proceedings of Interspeech ID - Baker2005 ER - TY - JOUR AU - Hatch, A. O. AU - Peskin, B. AU - Stolcke, A. PY - 2005 DA - 2005// TI - Improved phonetic speaker recognition using lattice decoding JO - Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05) VL - 1 ID - Hatch2005 ER -