, Journals: IEEE transactions on speech and language processing
,
, projects evaluated). I was an expert for the EU ERC program, I have been an expert for ANR since 2013, 2015.
, I was an expert for the following associate professor hiring committees in French universities (computer science, CNU, vol.27
, , 2018.
, , 2017.
, , 2015.
, , 2012.
, , vol.1, 2012.
, AMU), 2018.
, , p.2017
, , p.2017
, , p.2017
, , p.2017
, , p.2017
, , p.2017
, , p.2015
, , p.2012
, I was a member of the thesis steering committees (comité de suivi de thèse) of: ? Julien Dejasmin, 2019.
, , 2017.
, , 2017.
, , 2015.
, , 2015.
, , pp.2015-2017
, I was an external expert for the doctoral grant committee at UAPV (Avignon) in, 2013.
, I am the co-author of 112 publications: 4 book chapters, 9 peer-reviewed international journal articles (+1 submitted), 69 peer-reviewed international conference articles, 16 peer-reviewed French conference articles, 14 other publications including theses and reports. H-Index of 22
Replicating Speech Rate Convergence Experiments on the Switchboard Corpus, 4REAL workshop at LREC, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01807796
Finding the Structure of Documents, Multilingual natural language processing applications, pp.21-48, 2011. ,
Navigation dans les documents audio par le résumé automatique, Vers une recherche d'information contextuelle, assistée et personnalisée, 2011. ,
Open-domain Multi-Document Summarization via Information Extraction: Challenges and Prospects, Multi-source Multilingual Information Extraction and Summarization. Lecture Notes in Computer Science, 2011. ,
Speech segmentation and spoken document processing, Handbook of Natural Language Processing and Machine Translation, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01194291
, 2 International peer-reviewed journals
The SENSEI Project: Making Sense of Human Conversations, Lecture Notes on Artificial Intelligence LNAI, vol.9577, pp.10-33, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01454923
Understand the Global Economic Crisis: A Text Summarization Approach, vol.20, pp.89-110, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01194253
Re-ranking Summaries Based on Cross-Document Information Extraction, Information Retrieval Technology, pp.432-442, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01194271
Long story short-Global unsupervised models for keyphrase based meeting summarization, Speech Communication, vol.52, issue.10, pp.801-815, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01194272
Jing Tien, Dimitra Vergyri, and Fan Yang, The CALO Meeting Assistant System, 2010. ,
, Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles". Ed. by Shu-Chuan Tseng. Language and Lingusitics Monograph Series: Linguistic Patterns in Spontaneous Speech, pp.213-239, 2009.
URL : https://hal.archives-ouvertes.fr/hal-01194276
Generative and Discriminative Methods using Morphological Information for Sentence Segmentation of Turkish, IEEE Transactions on Audio, Speech and Language Processing, vol.17, pp.895-903, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194275
Speech segmentation and spoken document processing, Signal Processing Magazine, vol.25, issue.3, pp.59-69, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01194291
Cross-Genre Feature Comparisons for Spoken Sentence Segmentation, International Journal on Semantic Computing, vol.1, pp.335-346, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-00444099
, International peer-reviewed conferences
SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis, SemEval@ NAACL-HLT, pp.202-208, 2016. ,
Evaluation of Semantic Role Labeling and Dependency Parsing of Automatic Speech Recognition Output, IEEE International Conference in Acoustics, Speech and Signal Processing, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01194270
Multiple-View Constrained Clustering For Unsupervised Face Identification, TV-Broadcast". In ICASSP2014 -Image, Video, and Multidimensional Signal Processing, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194240
Multimodal Embedding Fusion for Robust Speaker Role Recognition in Video Broadcast, IEEE ASRU, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01475413
Speaker diarization through speaker embeddings, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194233
Correcting prepositional phrase attachments using multimodal corpora, Proceedings of the 15th International Conference on Parsing Technologies, pp.72-77, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01693292
A Scalable Global Model for Summarization, NAACL/HLT 2009 Workshop on Integer Linear Programming for Natural Language Processing, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194274
Efficient Sentence Segmentation Using Syntactic Features, Spoken Languge Technologies (SLT), 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01194286
MACAON: an NLP tool suite for processing word lattices, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations, pp.86-91, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00702442
Joint syntactic and semantic analysis with a multitask Deep Learning Framework for Spoken Language Understanding, Interspeech, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01454929
Asr Error Segment Localization for Spoken Recovery Strategy, IEEE International Conference in Acoustics, Speech and Signal Processing, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01194252
Integrating Prosodic Features in Extractive Meeting Summarization, ASRU, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194278
Beyond utterance extraction: summary recombination for speech summarization, Interspeech, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01454927
Lexical embedding adaptation for open-domain spoken language understanding, NIPS Workshop on Spoken Language Understanding (SLUNIPS), 2015. ,
Speaker adaptation of DNN-based ASR with i-vectors: Does it actually adapt models to speakers?, In Interspeech, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194245
Typological Features for Multilingual Delexicalised Dependency, NAACL, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02278897
Robust Named Entity Extraction from Spoken Archives, Proceedings of HLT-EMNLP'05, 2005. ,
URL : https://hal.archives-ouvertes.fr/hal-01194299
Mining Broadcast News data: Robust Information Extraction from Word Lattices, Proceeding of Eurospeech'05 ,
Multimodal understanding for person recognition in video broadcasts, Interspeech, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194244
PERCOLI: a person identification system for the 2013 REPERE challenge, First Workshop on Speech, Language and Audio in Multimedia (SLAM), pp.55-60, 2013. ,
PERCOLATTE: A Multimodal Person Discovery System in TV Broadcast for the Medieval 2015 Evaluation Campaign, 2015. ,
, Multimedia Benchmark Workshop
Call Centre Conversation Summarization: A Pilot Task at Multiling, Sigdial, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194231
Automatic Human Utility Evaluation of ASR Systems: Does WER Really Predict Performance?, In Interspeech, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01194248
Replicating Speech Rate Convergence Experiments on the Switchboard Corpus, 4REAL workshop at LREC, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01807796
Adding Syntactic Annotations to Flickr30k Entities Corpus for Multimodal Ambiguous Prepositional-Phrase Attachment Resolution, LREC, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01832930
Visual Disambiguation of Prepositional Phrase Attachments: Multimodal Machine Learning for Syntactic Analysis Correction, International Work-Conference on Artificial Neural Networks, pp.632-643, 2019. ,
Robust Semantic Parsing with Adversarial Learning for Domain Generalization, NAACL, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02298402
Veyn at PARSEME Shared Task 2018: Recurrent neural networks for VMWE identification, LAW-MWE-CxG Workshop at COLING, 2018. ,
Evaluation of word embeddings against cognitive processes: primed reaction times in lexical decision and naming tasks, Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, pp.21-26, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01773220
A Document Repository for Social Media and Speech Conversations, Language Resources and Evaluation Conference (LREC), 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01454924
Word embedding evaluation and combination, Language Resources and Evaluation Conference (LREC), 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01433185
Summarizing Behaviours: An Experiment on the Annotation of Call-Centre Conversations, Language Resources and Evaluation Conference (LREC), 2016. ,
Investigation of Speaker Embeddings for Cross-show Speaker Diarization, International Conference on Acoustics, Speech and Signal Processing, 2016. ,
CallAn: A Tool to Analyze Call Center Conversations, International Workshop on Spoken Dialogue Systems (IWSDS), 2016. ,
Speech Input for Live Performance: An Impromptu Dialogue Between the Computer and the Artist, International Workshop on Spoken Dialogue Systems (IWSDS), 2016. ,
Concept-based Summarization using Integer Linear Programming: From Concept Pruning to Multiple Optimal Solutions, Conference on Empirical Methods in Natural Language Processing, p.2015, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01203750
Speech is silver, but silence is golden: improving speech-to-speech translation performance by slashing users input, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194229
Adapting lexical representation and OOV handling from written to spoken language with word embedding, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194228
MultiLing 2015: Multilingual Summarization of Single and Multi-Documents, On-line Fora, and Call-center Conversations, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194230
Rapid FrameNet annotation of spoken conversation transcripts, Joint ACL-ISO Workshop on Interoperable Semantic Annotation, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01194232
Joint Decoding of Complementary Utterances, Spoken Languge Technologies (SLT), 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194247
Adapting dependency parsing to spontaneous speech for open domain spoken language understanding, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194246
Scene understanding for identifying persons in TV shows: beyond face authentication, 12th International Workshop on Content-Based Multimedia Indexing (CBMI), 2014. ,
A Repository of State of the Art and Competitive Baseline Summaries for Generic News Summarization, LREC, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194243
Automatically enriching spoken corpora with syntactic information for linguistic studies, LREC, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194241
Retrieving the syntactic structure of erroneous ASR transcriptions for open-domain Spoken Language Understanding, ICASSP2014 -Speech and Language Processing, p.2014, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194236
Reranked aligners for interactive transcript correction, ICASSP2014 -Speech and Language Processing, p.2014, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194237
Unsupervised Face Identification in TV Content using Audio-Visual Sources, 11th International Workshop on Content-Based Multimedia Indexing (CBMI), 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00812334
Searching Segments of Interest in Single Story Web-Videos, Workshop on Image and Audio Analysis for Multimedia Interactive Services (WIAMIS), 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01194250
Can You Give Me Another Word for Hyperbaric?: Improving Speech Translation using Targeted Clarification Questions, IEEE International Conference in Acoustics, Speech and Signal Processing (ICASSP), 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01194251
Generative Constituent Parsing and Discriminative Dependency Reranking: Experiments on English and French, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00702499
Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PortMedia corpora, LREC'12, 2012. ,
Detecting Person Presence in TV Shows with Linguistic and Structural Features, IEEE International Conference in Acoustics, Speech and Signal Processing, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-01194256
Applying Multiclass Bandit algorithms to call-type classification, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU'11), 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-01194262
Semi-supervised Part-of-speech Tagging in Speech Applications, Interspeech, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01194268
Any Questions? Automatic Question Detection in Meetings, ASRU, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194279
Leveraging Sentence Weights in Concept-based Optimization Framework for Extractive Meeting Summarization, Interspeech, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194281
ClusterRank: A Graph Based Method for Meeting Summarization, Interspeech, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194284
Phrase-level and Word-level Strategies for Detecting Appositions in Speech, In Interspeech, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194285
Combined low level and high level features for Out-Of-Vocabulary Word detection, Interspeech, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194283
Syntactically-informed Models for Comma Prediction, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194282
A Global Optimization Framework for Meeting Summarization, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194280
Jing Tien, Dimitra Vergyri, and Fan Yang, The CALO Meeting Speech Recognition and Understanding System, 2008. ,
A Keyphrase Based Approach to Interactive Meeting Summarization, Spoken Languge Technologies (SLT), 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01194292
Packing the Meeting Summarization Knapsack, In Interspeech ,
Punctuating Speech for Information Extraction, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01194289
An Interactive Timeline for Speech Database Browsing, Interspeech, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-01194296
An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings, Proceedings of SIGIR 2007, Searching Spontaneous Conversational Speech (SSCS) workshop, pp.37-43, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-01194294
Information retrieval on mixed written and spoken documents, RIAO, pp.826-835, 2004. ,
URL : https://hal.archives-ouvertes.fr/hal-01194302
, 4 National peer-reviewed conferences
Evaluation automatique de la satisfaction client à partir de conversations de type « chat » par réseaux de neurones récurrents avec mécanisme d'attention, TALN, 2018. ,
Détection d'erreurs dans des transcriptions OCR de documents historiques par réseaux de neurones récurrents multi-niveau, TALN, 2018. ,
Modèles génératif et discriminant en analyse syntaxique : expériences sur le corpus arboré de Paris 7, TALN'11, 2011. ,
Correction automatique d'attachements prépositionnels par utilisation de traits visuels, TALN, 2018. ,
Fusion multimodale image/texte par réseaux de neurones profonds pour la classification de documents imprimés, CORIA, 2018. ,
Apprentissage d'agents conversationnels pour la gestion de relations clients, 24e Conférence sur le Traitement Automatique des Langues Naturelles (TALN), 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01773218
Détection de coréférences de bout en bout en français, 24e Conférence sur le Traitement Automatique des Langues Naturelles (TALN), p.52, 2017. ,
Fusion d'espaces de représentations multimodaux pour la reconnaissance du rôle du locuteur dans des documents télévisuels, Actes de la conférence JEP 2016, 2016. ,
Détection de concepts pertinents pour le résumé automatique de conversations par recombinaison de patrons, Actes de la conférence TALN 2016, 2016. ,
Identification de personnes dans des flux multimédia, CORIA, 2015. ,
Détection et caractérisation d'erreurs dans des transcriptions automatiques pour des systèmes de traduction parole-parole, Actes de la conférence JEP, 2014. ,
Correction interactive de transcriptions de parole par fusion de phrases, Actes de la conférence JEP, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01194238
Percol0 -un système multimodal de détection de personnes dans des documents vidéo (Percol0 -A multimodal person detection system in video documents), Actes de la conférence conjointe JEP-TALN-RECITAL 2012, vol.1, pp.553-560, 2012. ,
Robustesse et portabilités multilingue et multi-domaines des systèmes de compréhension de la parole : les corpus du projet PortMedia (Robustness and portability of spoken language understanding systems among languages and domains : the PORTMEDIA project), Actes de la conférence conjointe JEP-TALN-RECITAL 2012, vol.1, pp.779-786, 2012. ,
Accès aux connaissances orales par le résumé automatique, EGC'06, 2006. ,
Recherche d'information dans un mélange de documents écrits et parlés, JEP), 2004. ,
The ICSI/UTD Summarization System at TAC, Proc. of the Text Analysis Conference workshop, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194277
LIF at TAC Multiling: Towards a Truly Language Independent Summarizer, Proc. of the Text Analysis Conference workshop, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-01194263
The ICSI Summarization System at TAC, Proc. of the Text Analysis Conference workshop, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01194287
The LIA summarization system at DUC-2007, Proceedings of the Document Understanding Workshop (DUC), 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-01194295
The LIA-Thales summarization system at DUC-2006, Document Understanding Conference Workshop, HLT-NAACL'06, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-01194297
The UMUS system for named entity generation at GREC, International Natural Language Generation Conference (INLG), 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01194267
ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields, ACL-IJCNLP, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01194273
TALEP @ DEFT'15 : Le plus cooool des systèmes d'analyse de sentiment, Actes de la 11e Défi Fouille de Texte, pp.97-103, 2015. ,
TAC2011 MultiLing Pilot Overview, Proc. of the Text Analysis Conference workshop, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-01194265
Speech onset latencies as an online measure of regularity extraction, Poster at Implicit Learning Seminar, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01454925
Automatic Summarization of Call-Centre Conversations, IEEE ASRU Demo, 2015. ,
StuMaBa: from deep representation to surface, Proceedings of the 13th European workshop on natural language generation, pp.232-235, 2011. ,
Résumé automatique de parole pour un accès efficace aux bases de données audio, 2007. ,
, Indexation multimédia : caractérisation du déséquilibre entre les modalités texte et parole, 2003.
OpenFst: A general and efficient weighted finite-state transducer library, International Conference on Implementation and Application of Automata, pp.11-23, 2007. ,
Many Languages, One Parser, Transactions of the Association of Computational Linguistics, vol.4, issue.1, pp.431-444, 2016. ,
Massively multilingual word embeddings, 2016. ,
Deep speech 2: End-to-end speech recognition in english and mandarin, International Conference on Machine Learning, pp.173-182, 2016. ,
Learning bilingual word embeddings with (almost) no bilingual data, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.451-462, 2017. ,
Neural conditional random fields, Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp.177-184, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-01291978
Part of speech tagging and chunking with hmm and crf, Proceedings of NLP Association of India (NLPAI) Machine Learning Contest, 2006. ,
Neural machine translation by jointly learning to align and translate, 2014. ,
End-to-end attention-based large vocabulary speech recognition, Acoustics, Speech and Signal Processing, pp.4945-4949, 2016. ,
METEOR: An automatic metric for MT evaluation with improved correlation with human judgments, Proceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, pp.65-72, 2005. ,
Measuring abstract reasoning in neural networks, International Conference on Machine Learning, pp.511-520, 2018. ,
Is ATIS too shallow to go deeper for benchmarking Spoken Language Understanding models?, In InterSpeech, 2018. ,
Lessons from the Netflix prize challenge, SiGKDD Explorations, vol.9, issue.2, pp.75-79, 2007. ,
Statistical language model adaptation: review and perspectives, Speech communication, vol.42, issue.1, pp.93-108, 2004. ,
A neural probabilistic language model, Journal of machine learning research, vol.3, pp.1137-1155, 2003. ,
The netflix prize, Proceedings of KDD cup and workshop, p.35, 2007. ,
Localsolver 1. x: a black-box local-search solver for 0-1 programming, 4OR, vol.9, issue.3, p.299, 2011. ,
, Cognitive bias codex, 2017.
A maximum entropy approach to natural language processing, Computational linguistics, vol.22, issue.1, pp.39-71, 1996. ,
CONDOR, a new parallel, constrained extension of Powell's UOBYQA algorithm: Experimental results and comparison with the DFO algorithm, Journal of computational and applied mathematics, vol.181, issue.1, pp.157-175, 2005. ,
Latent dirichlet allocation, Journal of machine Learning research, vol.3, pp.993-1022, 2003. ,
Is the end of supervised parsing in sight, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.400-407, 2007. ,
Very high accuracy and fast dependency parsing is not a contradiction, Proceedings of the 23rd international conference on computational linguistics, pp.89-97, 2010. ,
Enriching word vectors with subword information, 2016. ,
Freebase: a collaboratively created graph database for structuring human knowledge, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp.1247-1250, 2008. ,
Quasi-Recurrent Neural Networks, 2016. ,
Complete counterbalancing of immediate sequential effects in a Latin square design, Journal of the American Statistical Association, vol.53, issue.282, pp.525-528, 1958. ,
Is writing style predictive of scientific fraud, Proceedings of the Workshop on Stylistic Variation, pp.37-42, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-02373823
A simple rule-based part of speech tagger, Proceedings of the third conference on Applied natural language processing, pp.152-155, 1992. ,
Fast and Accurate Neural Word Segmentation for Chinese, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol.2, pp.608-615, 2017. ,
Findings of the 2011 workshop on statistical machine translation, Proceedings of the Sixth Workshop on Statistical Machine Translation, pp.22-64, 2011. ,
The use of MMR, diversity-based reranking for reordering documents and producing summaries, Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, pp.335-336, 1998. ,
A fast and accurate dependency parser using neural networks, Proceedings of the 2014 conference on empirical methods in natural language processing, pp.740-750, 2014. ,
Gaussian Mixture Embeddings for Multiple Word Prototypes, 2015. ,
The Minimalist Program. Current studies in linguistics series, 1995. ,
Lip reading sentences in the wild, 2016. ,
Empirical evaluation of gated recurrent neural networks on sequence modeling, 2014. ,
Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms, Proceedings of the ACL-02 conference on Empirical methods in natural language processing, vol.10, pp.1-8, 2002. ,
Log-Linear Models, 2005. ,
Wav2letter: an end-to-end convnetbased speech recognition system, 2016. ,
Natural language processing (almost) from scratch, Journal of Machine Learning Research, vol.12, pp.2493-2537, 2011. ,
The world atlas of language structures, vol.1, 2005. ,
Word translation without parallel data, 2017. ,
Support-vector networks, Machine learning, vol.20, issue.3, pp.273-297, 1995. ,
On the origins of the. 05 level of statistical significance, American Psychologist, vol.37, issue.5, p.553, 1982. ,
Shai Shalev-Shwartz, and Yoram Singer, Journal of Machine Learning Research, vol.7, pp.551-585, 2006. ,
Language, 2018. ,
Overview of the TAC 2008 Update Summarization Task, 2008. ,
Search-based structured prediction, Machine learning, vol.75, issue.3, pp.297-325, 2009. ,
Language Modeling with Gated Convolutional Networks, 2016. ,
Indexing by latent semantic analysis, Journal of the American society for information science, vol.41, issue.6, p.391, 1990. ,
Delexicalized word embeddings for cross-lingual dependency parsing, Proceedings of the 15th Conference of the European Chapter, vol.1, pp.241-250, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01590639
, Deep Learning in Natural Language Processing, 2018.
Stanford's Graph-based Neural Dependency Parser at the CoNLL 2017 Shared Task, Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp.20-30, 2017. ,
The hitchhiker's guide to testing statistical significance in natural language processing, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1383-1392, 2018. ,
Investigation of spontaneous speech characterization applied to speaker role recognition, Twelfth Annual Conference of the International Speech Communication Association, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-01433512
Multilingual training of crosslingual word embeddings, Proceedings of the 15th Conference of the European Chapter, vol.1, pp.894-904, 2017. ,
Transition-Based Dependency Parsing with Stack Long Short-Term Memory, Proceedings of the 53rd, 2015. ,
, Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.334-343
The jackknife estimate of variance, The Annals of Statistics, pp.586-596, 1981. ,
Tree automata, mu-calculus and determinacy, Foundations of Computer Science, 1991. Proceedings., 32nd Annual Symposium on, pp.368-377, 1991. ,
Lexrank: Graph-based lexical centrality as salience in text summarization, Journal of Artificial Intelligence Research, vol.22, pp.457-479, 2004. ,
, VSE++: Improved Visual-Semantic Embeddings, 2017.
, , 2007.
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks, 2017. ,
Weighted tree automata and tree transducers, Handbook of Weighted Automata, pp.313-403, 2009. ,
Structured and Extended Named Entity Evaluation in Automatic Speech Transcriptions, IJC-NLP, pp.518-526, 2011. ,
Quadratic knapsack problems, Combinatorial optimization, pp.132-149, 1980. ,
Reinforcement learning from imperfect demonstrations, 2018. ,
Multilingual Language Processing From Bytes, Proceedings of NAACL-HLT, pp.1296-1306, 2016. ,
A systematic exploration of diversity in machine translation, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, vol.3, 2013. ,
A primer on neural network models for natural language processing, Journal of Artificial Intelligence Research, vol.57, pp.345-420, 2016. ,
TIRA: Configuring, Executing, and Disseminating Information Retrieval Experiments, 9th International Workshop on Textbased Information Retrieval (TIR 12) at DEXA, pp.151-155, 2012. ,
Natural Language Inference over Interaction Space, 2017. ,
Generic text summarization using relevance measure and latent semantic analysis, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp.19-25, 2001. ,
Generative adversarial nets, Advances in neural information processing systems, pp.2672-2680, 2014. ,
Hybrid speech recognition with deep bidirectional LSTM, Automatic Speech Recognition and Understanding (ASRU), pp.273-278, 2013. ,
Universals of Human Language, 1963. ,
Improved training of wasserstein gans, Advances in Neural Information Processing Systems, pp.5769-5779, 2017. ,
Beyond ASR 1-best: Using word confusion networks in spoken language understanding, Computer Speech & Language, vol.20, issue.4, pp.495-514, 2006. ,
Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1489-1501, 2016. ,
Deep speech: Scaling up end-to-end speech recognition, 2014. ,
Distributional structure, vol.10, pp.146-162, 1954. ,
Peer-review fraud-hacking the scientific publication process, New England Journal of Medicine, vol.373, issue.25, pp.2393-2395, 2015. ,
Deep Residual Learning for Image Recognition, 2015. ,
Tandem connectionist feature extraction for conventional HMM systems, icassp, pp.1635-1638, 2000. ,
Universal Language Model Fine-tuning for Text Classification, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.328-339, 2018. ,
Densely Connected Convolutional Networks, 2016. ,
Hidden Markov models for speech recognition, 1990. ,
Reinforcement learning with unsupervised auxiliary tasks, 2016. ,
How to evaluate ASR output for named entity recognition, Sixteenth Annual Conference of the International Speech Communication Association, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01251370
Statistical methods for speech recognition, 1997. ,
Unsupervised neural dependency parsing, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp.763-771, 2016. ,
Bag of tricks for efficient text classification, 2016. ,
Exploring the limits of language modeling, 2016. ,
Exploring the Limits of Language Modeling, 2016. ,
Reducibility among combinatorial problems, Complexity of computer computations, pp.85-103, 1972. ,
The budgeted maximum coverage problem, Information Processing Letters, vol.70, issue.1, pp.39-45, 1999. ,
Structured attention networks, 2017. ,
Morphological Modeling for Machine Translation of English-Iraqi Arabic Spoken Dialogs, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.995-1000, 2015. ,
Unsupervised multilingual sentence boundary detection, Computational Linguistics, vol.32, issue.4, pp.485-525, 2006. ,
Opennmt: Open-source toolkit for neural machine translation, 2017. ,
Statistical machine translation, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01433972
Moses: Open source toolkit for statistical machine translation, Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions, pp.177-180, 2007. ,
Segment representations in named entity recognition, International Conference on Text, Speech, and Dialogue, pp.61-70, 2015. ,
ImageNet Classification with Deep Convolutional Neural, Neural Information Processing Systems, pp.1-9, 2014. ,
Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012. ,
Robust part-of-speech tagging using a hidden Markov model, Computer Speech & Language, vol.6, issue.3, pp.225-242, 1992. ,
A trainable document summarizer, Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, pp.68-73, 1995. ,
Adversarial examples in the physical world, 2016. ,
Conditional random fields: Probabilistic models for segmenting and labeling sequence data, 2001. ,
Syntactic parsing and compound recognition via dual decomposition: application to French, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp.1875-1885, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01074298
An unsupervised web-based topic language model adaptation method, Acoustics, Speech and Signal Processing, pp.5081-5084, 2008. ,
Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol.86, issue.11, pp.2278-2324, 1998. ,
, Five ways to fix statistics, 2017.
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer speech & language, vol.9, issue.2, pp.171-185, 1995. ,
Dependency-based word embeddings, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol.2, pp.302-308, 2014. ,
Do Multi-Sense Embeddings Improve Natural Language Understanding?, In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.1722-1732, 2015. ,
Word Embedding for Understanding Natural Language: A Survey, Guide to Big Data Applications, pp.83-104, 2018. ,
MLComp: a free website for objectively comparing machine learning programs, 2010. ,
Rouge: A package for automatic evaluation of summaries, 2004. ,
A class of submodular functions for document summarization, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.510-520, 2011. ,
, The mythos of model interpretability, 2016.
Progressive neural architecture search, Proceedings of the European Conference on Computer Vision (ECCV), pp.19-34, 2018. ,
On the limited memory BFGS method for large scale optimization, Mathematical programming, vol.45, issue.1-3, pp.503-528, 1989. ,
Gradient Episodic Memory for Continual Learning, Advances in Neural Information Processing Systems, pp.6470-6479, 2017. ,
Effective approaches to attentionbased neural machine translation, 2015. ,
The Centrality of Language in Human Cognition, Language Learning, vol.66, issue.3, pp.516-553, 2016. ,
Cross-lingual transfer parsing for low-resourced languages: An Irish case study, Proceedings of the First Celtic Language Technology Workshop, pp.41-49, 2014. ,
End-to-end sequence labeling via bi-directional lstm-cnns-crf, 2016. ,
Results of the WMT14 metrics shared task, Proceedings of the Ninth Workshop on Statistical Machine Translation, pp.293-301, 2014. ,
Part-of-speech tagging from 97% to 100%: is it time for some linguistics?, In International conference on intelligent text processing and computational linguistics, pp.171-189, 2011. ,
Foundations of statistical natural language processing, 1999. ,
Linguistic obfuscation in fraudulent science, Journal of Language and Social Psychology, vol.35, issue.4, pp.435-445, 2016. ,
Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition, 2009. ,
Effect of speech transformation on impostor acceptance, Proceedings. 2006 IEEE International Conference on, vol.1, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-01318472
, The Natural Language Decathlon: Multitask Learning as Question Answering, 2018.
A study of global inference algorithms in multi-document summarization, European Conference on Information Retrieval, pp.557-564, 2007. ,
Online large-margin training of dependency parsers, Proceedings of the 43rd annual meeting on association for computational linguistics, pp.91-98, 2005. ,
Non-projective dependency parsing using spanning tree algorithms, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp.523-530, 2005. ,
Vocal and gestural communication in nonhuman primates and the question of the origin of language, 2008. ,
Deeper syntax for better semantic parsing, Coling 2016 -26th International Conference on Computational Linguistics, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01391678
Distributed representations of words and phrases and their compositionality, Advances in neural information processing systems, pp.3111-3119, 2013. ,
Context dependent recurrent neural network language model, SLT, vol.12, pp.234-239, 2012. ,
Semi-supervised dependency parsing using lexical affinities, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol.1, pp.777-785, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00702486
Extrinsic summarization evaluation: A decision audit task, ACM Transactions on Speech and Language Processing, vol.6, issue.2, p.2, 2009. ,
Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning, pp.807-814, 2010. ,
Sequence-to-Sequence RNNs for Text Summarization, 2016. ,
Using universal linguistic knowledge to guide grammar induction, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp.1234-1244, 2010. ,
Word sense disambiguation: A survey, ACM Computing Surveys (CSUR), vol.41, issue.2, p.10, 2009. ,
Universal Dependencies v1: A Multilingual Treebank Collection, LREC, 2016. ,
Computer-intensive methods for testing hypotheses, 1989. ,
WaveNet: A Generative Model for Raw Audio, 2016. ,
Modified Wilcoxon signedrank test, Open Journal of Statistics, vol.2, issue.02, p.172, 2012. ,
BLEU: a method for automatic evaluation of machine translation, Proceedings of the 40th annual meeting on association for computational linguistics, pp.311-318, 2002. ,
GloVe: Global Vectors for Word Representation, Empirical Methods in Natural Language Processing (EMNLP), pp.1532-1543, 2014. ,
Deep Contextualized Word Representations, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.2227-2237, 2018. ,
End-to-end Audiovisual Speech Recognition, 2018. ,
Improved inference for unlexicalized parsing, Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, pp.404-411, 2007. ,
Significance tests which may be applied to samples from any populations, Journal of the Royal Statistical Society, vol.4, issue.1, pp.119-130, 1937. ,
Vocal tract normalization equals linear transformation in cepstral space, IEEE Transactions on Speech and Audio Processing, vol.13, issue.5, pp.930-944, 2005. ,
Evaluation of spoken language systems: The ATIS domain, Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990. ,
Context-Dependent Sense Embedding, Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp.183-191, 2016. ,
Co-learning of Word Representations and Morpheme Representations, COLING, pp.141-150, 2014. ,
Hidden conditional random fields, IEEE Transactions on Pattern Analysis & Machine Intelligence, issue.10, pp.1848-1852, 2007. ,
Dependency treelet translation: Syntactically informed phrasal SMT, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp.271-279, 2005. ,
Language models are unsupervised multitask learners, 2019. ,
Do CIFAR-10 Classifiers Generalize to CIFAR-10, 2018. ,
YOLO9000: better, faster, stronger, 2016. ,
On some pitfalls in automatic evaluation and significance testing for MT, Proceedings of the ACL workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization, pp.57-64, 2005. ,
Principles of structure building in music, language and animal song, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.370, p.20140097, 1664. ,
A neural attention model for abstractive sentence summarization, 2015. ,
Framework of automatic text summarization using reinforcement learning, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp.256-265, 2012. ,
A survey of unsupervised grammar induction, 2010. ,
BoosTexter: A boosting-based system for text categorization, Machine learning, vol.39, issue.2-3, pp.135-168, 2000. ,
, Improved boosting algorithms using confidence-rated predictions, Machine learning, vol.37, issue.3, pp.297-336, 1999.
Connectionist language modeling for large vocabulary continuous speech recognition, Acoustics, Speech, and Signal Processing, vol.1, p.765, 2002. ,
URL : https://hal.archives-ouvertes.fr/hal-01434616
Get To The Point: Summarization with Pointer-Generator Networks, 2017. ,
Active learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, vol.6, issue.1, pp.1-114, 2012. ,
Just post it: The lesson from two cases of fabricated data detected by statistics alone, Psychological science, vol.24, issue.10, pp.1875-1888, 2013. ,
Composite Objective Optimization and Learning for Massive Datasets, 2010. ,
A study of translation error rate with targeted human annotation. Rapport technique LAMP-TR-126, 2005. ,
A study of translation edit rate with targeted human annotation, Proceedings of association for machine translation in the Americas, vol.200, 2006. ,
What's in a p-value in NLP?, In Proceedings of the eighteenth conference on computational natural language learning, pp.1-10, 2014. ,
A lognormal tied mixture model of pitch for prosody based speaker recognition, Fifth European Conference on Speech Communication and Technology, 1997. ,
A shared task on multimodal machine translation and crosslingual image description, Proceedings of the First Conference on Machine Translation, vol.2, pp.543-553, 2016. ,
Training very deep networks, Advances in neural information processing systems, pp.2377-2385, 2015. ,
Highway Networks, CoRR abs/1505.00387, 2015. ,
What has the Loebner contest told us about conversant systems, Cambridge Center for Behavioral Studies, 2004. ,
Explicit word error minimization in n-best list rescoring, Eurospeech, vol.97, pp.163-166, 1997. ,
End-to-end memory networks, Advances in neural information processing systems, pp.2440-2448, 2015. ,
Sequence to sequence learning with neural networks, Advances in neural information processing systems, pp.3104-3112, 2014. ,
Feature-rich partof-speech tagging with a cyclic dependency network, Proceedings of the 2003 Conference of the North American Chapter, vol.1, pp.173-180, 2003. ,
A survey of hybrid ANN/HMM models for automatic speech recognition, Neurocomputing, vol.37, issue.1-4, pp.91-126, 2001. ,
Improving spoken language understanding using word confusion networks, Seventh International Conference on Spoken Language Processing, 2002. ,
Attention Is All You Need, 2017. ,
Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, Journal of machine learning research, vol.11, pp.3371-3408, 2010. ,
Sentiment analysis and opinion mining: a survey, International Journal, vol.2, issue.6, pp.282-292, 2012. ,
, A neural conversational model, 2015.
Phoneme recognition using time-delay neural networks, Readings in speech recognition, pp.393-404, 1990. ,
Residual phase cepstrum coefficients with application to cross-lingual speaker verification, Thirteenth Annual Conference of the International Speech Communication Association, 2012. ,
Computer power and human reason: From judgment to calculation, 1976. ,
Individual comparisons by ranking methods, Biometrics bulletin, vol.1, issue.6, pp.80-83, 1945. ,
Can semantic role labeling improve SMT, Proceedings of the 13th Annual Conference of the EAMT, pp.218-225, 2009. ,
SAS: A speaker verification spoofing database containing diverse attacks, Acoustics, Speech and Signal Processing, pp.4440-4444, 2015. ,
Show, attend and tell: Neural image caption generation with visual attention, International Conference on Machine Learning, pp.2048-2057, 2015. ,
Multi-Scale Context Aggregation by Dilated Convolutions, 2015. ,
Efficient summarization with read-again and copy mechanism, 2016. ,
On efficient coupling of ASR and SMT for speech translation, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07, vol.4, p.101, 2007. ,
Ensembling neural networks: Many could be better than all, Artificial Intelligence, vol.137, issue.1, pp.239-263, 2002. ,
Ensembling neural networks: many could be better than all, Artificial intelligence, vol.137, issue.1-2, pp.239-263 ,
Differentiable lower bound for expected BLEU score, 2017. ,
Neural architecture search with reinforcement learning, 2016. ,