Big speech data analytics for contact centers - BISON

European Horizon 2020 project No. 645323
2015-2017

Publications

2017

BASKAR Murali K., KARAFIÁT Martin, BURGET Lukáš, VESELÝ Karel, GRÉZL František and ČERNOCKÝ Jan. Residual Memory Networks: Feed-forward approach to learn long-term temporal dependencies. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 4810-4814. ISBN 978-1-5090-4117-6.
BENEŠ Karel, BASKAR Murali K. and BURGET Lukáš. Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks. In: Proceedings of Interspeeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 284-288. ISSN 1990-9772.
KARAFIÁT Martin, BASKAR Murali K., MATĚJKA Pavel, VESELÝ Karel, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 719-723. ISSN 1990-9772.
MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, BURGET Lukáš, DIEZ Sánchez Mireia and ČERNOCKÝ Jan. Analysis of Score Normalization in Multilingual Speaker Recognition. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1567-1571. ISSN 1990-9772.
ONDEL Lucas, BURGET Lukáš, ČERNOCKÝ Jan and KESIRAJU Santosh. Bayesian phonotactic language model for acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 5750-5754. ISBN 978-1-5090-4117-6.
PLCHOT Oldřich, MATĚJKA Pavel, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia, ROHDIN Johan A., GLEMBEK Ondřej, BRÜMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, BUERA Luis, KENNY Patrick, ALAM Jahangir and BHATTACHARYA Gautam. Analysis and Description of ABC Submission to NIST SRE 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1348-1352. ISSN 1990-9772.
SILNOVA Anna, BURGET Lukáš and ČERNOCKÝ Jan. Alternative Approaches to Neural Network based Speaker Verification. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1572-1575. ISSN 1990-9772.
VESELÝ Karel, BASKAR Murali K., DIEZ Sánchez Mireia and BENEŠ Karel. MGB-3 BUT System: Low-resource ASR on Egyptian YOUTUBE data. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 368-373. ISBN 978-1-5090-4788-8.
VESELÝ Karel, BURGET Lukáš and ČERNOCKÝ Jan. Semi-supervised DNN training with word selection for ASR. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 3687-3691. ISSN 1990-9772.
ZEINALI Hossein, SAMETI Hossein and BURGET Lukáš. HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2017, vol. 25, no. 7, pp. 1421-1435. ISSN 2329-9290.
ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš and ČERNOCKÝ Jan. Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models. Computer Speech and Language. Amsterdam: Elsevier Science, 2017, vol. 2017, no. 46, pp. 53-71. ISSN 0885-2308.

2016

Segura, C., Balcells, D., Umbert, M., Arias, J., & Luque, J. (2016). Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls. In Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, IberSPEECH 2016, Lisbon, Portugal, November 23-25, 2016, Proceedings 3 (pp. 255-265). Springer International Publishing. Print ISBN 978-3-319-49168-4, Online ISBN 978-3-319-49169-1.

PDF

The original publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-49169-1_25.

Zewoudie, A.W., Luque, J., Hernando, J. (2016) Short- and Long-Term Speech Features for Hybrid HMM-i-Vector based Speaker Diarization System. Proc. Odyssey 2016, 400-406.
Woubie, A., Luque, J., Hernando, J. (2016) Improving i-Vector and PLDA Based Speaker Clustering with Long-Term Features. Proc. Interspeech 2016, 372-376.
BRUMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, MATĚJKA Pavel, PLCHOT Oldřich, DIEZ Sánchez Mireia, SILNOVA Anna, JIANG Xiaowei, NOVOTNÝ Ondřej, ROHDIN Johan A., GLEMBEK Ondřej, GRÉZL František, BURGET Lukáš, ONDEL Lucas, PEŠÁN Jan, ČERNOCKÝ Jan, KENNY Patrick, ALAM Jahangir, BHATTACHARYA Gautam and ZEINALI Hossein et al. ABC NIST SRE 2016 SYSTEM DESCRIPTION. San Diego: National Institute of Standards and Technology, 2016.
EGOROVA Ekaterina and SERRANO Jordi Lugue. Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 114-120. ISSN 1877-0509.
GRÉZL František and KARAFIÁT Martin. Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 144-151. ISSN 1877-0509.
GRÉZL František, EGOROVA Ekaterina and KARAFIÁT Martin. Study of Large Data Resources for Multilingual Training and System Porting. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 15-22. ISSN 1877-0509.
MATĚJKA Pavel, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. Analysis Of DNN Approaches To Speaker Identification. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5100-5104. ISBN 978-1-4799-9988-0.
NOVOTNÝ Ondřej, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, pp. 199-204. ISBN 978-1-5090-4903-5.
ONDEL Lucas, BURGET Lukáš and ČERNOCKÝ Jan. Variational Inference for Acoustic Unit Discovery. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 80-86. ISSN 1877-0509.
PEŠÁN Jan, BURGET Lukáš and ČERNOCKÝ Jan. Sequence Summarizing Neural Networks for Spoken Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 3285-3289. ISBN 978-1-5108-3313-5.
PLCHOT Oldřich, BURGET Lukáš, ARONOWITZ Hagai and MATĚJKA Pavel. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5090-5094. ISBN 978-1-4799-9988-0.
Cevenini, Claudia / Denti, Enrico / Omicini, Andrea / Cerno, Italo (2016): "Privacy Through Anonymisation in Large-Scale Socio-Technical Systems: Multi-lingual Contact Centres Across the EU", In Internet Science. 3rd International Conference on Internet Science (INSCI 2016): Openness, Collaboration and Collective Action, Ch. 25, Lecture Notes in Computer Science 9934, pages 291-305, 12-14 September 2016. Springer International Publishing. Print ISBN 978-3-319-45981-3, Online ISBN 978-3-319-45982-0.

PDF Slides

The original publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-45982-0_25.

2015

Glembek, Ondřej / Matějka, Pavel / Burget, Lukáš / Schwarz, Petr / Pešán, Jan / Plchot, Oldřich (2015): "Voice-print transformation for migration between automatic speaker identification systems", In Abstract book of the 7th European Academy of Forensic Science Conference. Praha: Criminal Police Department Prague, 2015. ISBN 978-80-260-8659-8.
Karafiát, Martin / Grézl, František / Burget, Lukáš / Szöke, Igor / Černocký, Jan (2015): "Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge", In INTERSPEECH-2015, 2454-2458.
Llimona, Quim / Luque, Jordi / Anguera, Xavier / Hidalgo, Zoraida / Park, Souneil / Oliver, Nuria (2015): "Effect of gender and call duration on customer satisfaction in call center big data", In INTERSPEECH-2015, 1825-1829.
Woubie, Abraham / Luque, Jordi / Hernando, Javier (2015): "Using voice-quality measurements with prosodic and spectral features for speaker diarization", In INTERSPEECH-2015, 3100-3104.