ENS - Ecole Normale Supérieure
Non-reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2019). End-to-End Speech Recognition from the raw waveform. In Interspeech-2018.

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Synnaeve, G., Collobert, R. & Dupoux, E. (2018). End-to-End Speech Recognition From the Raw Waveform. In Interspeech 2018. doi:10.21437/Interspeech.2018-2414

Reviewed conference proceeding  

Zeghidour, N., Usunier, N., Kokkinos, I., Schatz, T., Synnaeve, G. & Dupoux, E. (2018). Learning Filterbanks from Raw Speech for Phoneme Recognition. In ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing.

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Usunier, N. & Dupoux, E. (2016). Joint Learning of Speaker and Phonetic Similarities with Siamese Networks. In INTERSPEECH-2016, 1295-1299.

Reviewed conference proceeding  

Zeghidour, N., Synnaeve, G., Versteegh, M. & Dupoux, E. (2016 ). A Deep Scattering Spectrum - Deep Siamese Network Pipeline For Unsupervised Acoustic Modeling. In ICASSP-2016, 4965-4969.

Reviewed conference proceeding  

Warlaumont, A., Vandam, M., Bergelson, E. & Cristia, A. (2017). HomeBank: A repository for long-form real-world audio recordings of children. In Proceedings of Interspeech Show & Tell.

Non-reviewed conference proceeding  

Wang, X., Du, J., Cristia, A., Sun, L. & Lee, C. (2020). A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 7304-7308. doi:10.1109/ICASSP40776.2020.9053875

Reviewed conference proceeding  

Versteegh, M., Anguera, X., Jansen, A. & Dupoux, E. (2016). The Zero Resource Speech Challenge 2015: Proposed Approaches and Results. , Vol. 81: In SLTU-2016 Procedia Computer Science, 67-72.

Reviewed conference proceeding  

Versteegh, M., Thiollière, R., Schatz, T., Cao, X., Anguera, X., Jansen, A. & Dupoux, E. (2015). The Zero Resource Speech Challenge 2015. In INTERSPEECH-2015, 3169-3173.

Reviewed conference proceeding  

Versteegh, M., Seidl, A. & Cristia, A. (2014). Acoustic correlates of phonological status. In Proceedings of Interspeech, 91-95.

Reviewed conference proceeding  

Varadarajan, B., Khudanpur, S. & Dupoux, E. (2008 ). Unsupervised Learning of Acoustic Subword Units. In Proceedings of ACL-08: HLT, 165-168.

Reviewed conference proceeding  

Tsuji, S. & Cristia, A. (2017). Which acoustic and phonological factors shape infants' vowel discrimination? Exploiting natural variation in InPhonDB. . In Proceedings of Interspeech. doi:10.21437/Interspeech.2017-1468

Reviewed conference proceeding  

Tsuji, S., Bergmann, C. & Cristia, A. (2017). MetaLab: A repository for meta-analyses on language development, and more. . In Proceedings of Interspeech Show & Tell.

Reviewed conference proceeding  

Titeux, H., Riad, R., Cao, X., Hamilakis, N., Madden, K., Cristia, A., Bachoud-Levi, A. & Dupoux, E. (2020). Seshat: A tool for managing and verifying annotation campaigns of audio data. In LREC - 2th Language Resources and Evaluation Conference, Marseille, France.

Reviewed conference proceeding  

Thual, A., Dancette, C., Karadayi, J., Benjumea, J. & Dupoux, E. (2018). A K-nearest neighbours approach to unsupervised spoken term discovery. In EEE Spoken Language Technology SLT-2018.

Reviewed conference proceeding  

Thiollière, R., Dunbar, E., Synnaeve, G., Versteegh, M. & Dupoux, E. (2015 ). A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling. In INTERSPEECH-2015, 3179-3183.

Reviewed conference proceeding  

Synnaeve, G. & Dupoux, E. (2016). A temporal coherence loss function for learning unsupervised acoustic embeddings. , Vol. 81: In SLTU-2016 Procedia Computer Science, 95-100.

Reviewed conference proceeding  

Synnaeve, G., Dautriche, I., Boerschinger, B., Johnson, M. & Dupoux, E. (2014). Unsupervised word segmentation in context. In Proceedings of 25th International Conference on Computational Linguistics (CoLing), 2326-2334.

Reviewed conference proceeding  

Synnaeve, G., Versteegh, M. & Dupoux, E. (2014). Learning words from images and speech. In NIPS Workshop on Learning Semantics.

Reviewed conference proceeding  

Synnaeve, G., Schatz, T. & Dupoux, E. (2014 ). Phonetics embedding learning with side information. In IEEE Spoken Language Technology Workshop, 106 - 111. doi:10.1109/slt.2014.7078558

Non-reviewed conference proceeding  

Semenzin, C. , Hamrick, L., Seidl, A., Kelleher, B. & Cristia, A. (2021). Towards Large-Scale Data Annotation of Audio from Wearables: Validating Zooniverse Annotations of Infant Vocalization Types. In 2021 IEEE Spoken Language Technology Workshop (SLT), Shenzhen, China: IEEE, 1079-1085. doi:10.1109/SLT48900.2021.9383511

Non-reviewed conference proceeding  

Semenzin, C. , Hamrick, L., Seidl, A., Kelleher, B. & Cristia, A. (2021). Theory evaluation in the age of cumulative science. In 2021 IEEE Spoken Language Technology Workshop (SLT), Shenzhen, China, 1079-1085. doi:10.1109/SLT48900.2021.9383511

Non-reviewed conference proceeding  

Seidl, A., Warlaumont, A. & Cristia, A. (2019). Towards detection of canonical babbling by citizen scientists: Performance as a function of clip length. In Proceedings of Interspeech, Graz, Austria.

Non-reviewed conference proceeding  

Schuller, B., Batliner, A., Bergler, C., Pokorny, F., Krajewski, J., Cychosz, M., Vollmann, R., Roelen, S., Schnieder, S., Bergelson, E. & Cristia, A. (2019). The INTERSPEECH 2019 Computational Paralinguistics Challenge: Styrian Dialects, Continuous Sleepiness, Baby Sounds & Orca Activity. In Proceedings of Interspeech, Graz, Austria.

Reviewed conference proceeding  

Schatz, T., Turnbull, R., Bach, F. & Dupoux, E. (2017). A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability. In INTERSPEECH-2017.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Cao, X., Bach, F., Hynek, H. & Dupoux, E. (2014 ). Evaluating speech features with the Minimal-Pair ABX task (II): Resistance to noise. In INTERSPEECH-2014, 915-919.

Reviewed conference proceeding  

Schatz, T., Peddinti, V., Bach, F., Jansen, A., Hynek, H. & Dupoux, E. (2013 ). Evaluating speech features with the Minimal-Pair ABX task: Analysis of the classical MFC/PLP pipeline. In INTERSPEECH-2013, 1781-1785.

Non-reviewed conference proceeding  

Ryant, N., Church, K., Cieri, C., Cristia, A., Ganapathy, S. & Liberman, M. (2019). Second DIHARD Diarization Challenge: Dataset, task, and baselines. In Proceedings of Interspeech, Graz, Austria.

Reviewed conference proceeding  

Rivière, M. & Dupoux, E. (2021). Towards unsupervised learning of speech features in the wild. In 2021 IEEE Spoken Language Technology Workshop (SLT), 156-163.

Reviewed conference proceeding  

Rivière, M., Mazaré, P., Joulin, A. & Dupoux, E. (2020). Unsupervised pretraining transfers well across languages. In IEEE (Eds.), In ICASSP-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). doi:10.1109/ICASSP40776.2020.9054548