Publications & Talks

2018

AbuRa'ed A, Chiruzzo L, Saggion H. Experiments in Detection of Implicit Citations. Calzolari N, et al. eds. Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018. 1 ed. European Language Resources Association; 2018.

Accuosto P, Saggion H. Improving the accessibility of biomedical texts by semantic enrichment and definition expansion. Procesamiento del lenguaje natural 2018; 61: 57-64

Aldezabal I., Artola X., Diaz De Ilarraza A., Gonzalez-Dios I., Labaka G., Rigau G. and Urizar R. Basque e-lexicographic resources: linguistic basis, development, and future perspectives. Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence. Galway, Ireland. 2018.

Rodrigo Agerri and German Rigau (2018). Simple Language Independent Sequence Labelling for the Annotation of Disabilities in Medical Texts. InProceedings of the Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018),Diann Track, Sevilla, Spain.

Rodrigo Agerri, Núria Bel, German Rigau, Horacio Saggion (2018). TUNER: Multifaceted Domain Adaptation for Advanced Textual Semantic Processing. First Results Available. In Proceedings of the 34th Annual Meeting of Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN 2018), Sevilla, Spain.

Agerri R., Gómez-Guinovart X., Rigau G. and Solla-Portela M.Developing New Linguistic Resources and Tools for the Galician Language. Proceedings of the 11th Language Resources and Evaluation Conference (LREC'18).Miyazaki, Japan 2018.

Agerri R., Chung Y., Aldabe I., Aranberri N., Labaka G. and Rigau G.Building Named Entity Recognition Taggers via Parallel Corpora. Proceedings of the 11th Language Resources and Evaluation Conference (LREC'18). Miyazaki, Japan 2018. WP5

Eneko Agirre, Oier Lopez de Lacalle and Aitor Soroa. 2018. The risk of sub-optimal use of Open Source NLP Software: UKB is inadvertently state-of-the-art in knowledge-based WSD. NLP-OSS workshop at ACL. Melbourne, Australia. 2018

Jon Alkorta, Koldo Gojenola, Mikel Iruskieta. SentiTegi: building a semantic oriented Basque lexicon. 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2018). Hanoi, Vietnam. 2018.

Jon Alkorta, Koldo Gojenola, Mikel Iruskieta (2018).Saying no but meaning yes: negation and sentiment analysis in Basque.Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA 2018) in conjunction with the EMNLP 2018Conference. Brussels, Belgium. 2018.

Álvez J. and Rigau G.Towards Cross-checking WordNet and SUMO Using Meronymy. In Proceedings of the 9th Global WordNet Conference (GWC 2018). Singapore. 2018.

Álvez J., Gonzalez-Dios I. and Rigau G.Cross-checking WordNet and SUMO Using Meronymy. Proceedings of the 11th Language Resources and Evaluation Conference (LREC'18). Miyazaki.

Álvez J. Hermo M., Lucio P. and Rigau G. Automatic White-Box Testing of First-Order Logic Ontologies. Journal of Logic and Computation. ISSN 0955-792Xhttps://doi.org/10.1093/logcom/exz001. 2019.

Álvez J., Lucio P. and Rigau G. A Framework for the Evaluation of SUMO-based Ontologies Using WordNet. IEEE access.https://doi.org/10.1109/ACCESS.2019.2904835ISSN:2169-3536. 2019. 

Mikel Artetxe, Gorka Labaka, Iñigo Lopez-Gazpio, Eneko Agirre. Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation. The SIGNLL Conference on Computational Natural Language Learning CONLL 2018. Brussels, Belgium. 2018

Mikel Artetxe, Gorka Labaka, Eneko Agirre. Unsupervised Statistical Machine TranslationEMNLP 2018. Brussels, Belgium. 2018

Mikel Artetxe, Gorka Labaka, Eneko Agirre A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 789–798. Melbourne, Australia, July 15 - 20, 2018. c 2018 Association for Computational Linguistics. 2018.

Mikel Artetxe, Gorka Labaka, Eneko Agirre, Kyunghyun Cho Unsupervised Neural Machine Translation. Sixth International Conference on Learning Representations (ICLR 2018). Vancouver, Canada. 2018.

Mikel Artetxe, Gorka Labaka, Eneko Agirre. Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations. Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18) pages 5012-5019. New Orleans, Louisiana, USA. 2018.

Aitziber Atutxa, Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, V. Fresno, Koldo Gojenola, R. Martinez, Maite Oronoz, Olatz Perez-de-Viñaspre (2018) IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence approachCLEF 2018 Online Working Notes. CEUR-WS.

Barbieri F, Marujo L, Karuturi P, Brendel W, Saggion H. Exploring emoji usage and prediction through a temporal variation lens. CEUR Workshop Proceedings 2018; 2130.

Barbieri F, Ballesteros M, Ronzano F, Saggion H. Multimodal Emoji Prediction. AA.VV.. The 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 1 ed. ACL; 2018. p. 679-686.

Ander Barrena, Aitor Soroa, Eneko Agirre. Learning text representations for 500K classification tasks on Named Entity Disambiguation. The SIGNLL Conference on Computational Natural Language Learning CONLL 2018. Brussels, Belgium. 2018

Bel, Núria; Pocostales, Joel. Can Domain Adaptation be Handled as Analogies?. In: Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T. Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018. 1 ed. European Language Resources Association; 2018. p. 2559-2565.

Elisabet Comelles, Jordi Atserias. VERTa: a linguistic approach to automatic machine translation evaluation. Language Resources and Evaluation. First Online 15 October 2018, 1-30. (ISSN: 1574-020X (Print) 1574-0218 (Online)). 2018.

Montse Cuadros, Naiara Perez-Miguel, Iker Montoya and Aitor García-Pablos Pablos, A. G. (2018). Vicomtech at BARR2: Detecting biomedical abbreviations with ML methods and dictionary-based heuristics. In Proceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018) co-located with 34th Conference of the Spanish Society for Natural Language Processing (SEPLN 2018)

Estarrona A. and Aldezabal I. (2018): Towards a Spatial Annotation Scheme for Basque based on ISO-Space. Proceedings of the Second Workshop on Corpus-Based Research in the Humanities (CRH-2), 75-84. 25-26 January, Vienna, Austria. (ISBN: 978-3-901716-43-0). 2018. WP2

Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Arantza Casillas, Arantza Díaz de Ilarraza, Nerea Ezeiza, Maite Oronoz, Alicia Pérez-Ramírez, Olatz Perez de Viñaspre (2018) Automatic Misogyny Identification Using Neural NetworksProceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018). ISSN 1613-073. Vol-2150. Pages 249-254.

Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Arantza Casillas, Arantza Díaz de Ilarraza, Nerea Ezeiza, Maite Oronoz, Alicia Pérez, Olatz Perez de Viñaspre (2018) A Hybrid Approach For Automatic Disability AnnotationProceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018).

Goularte FB, Nassar SM, Fileto R, Saggion H. A text summarization method based on fuzzy rules and applicable to automated assessment. Expert systems with applications 2019; 115: 264-275.  

Gómez Guinovart, Xavier, Miguel Anxo Solla Portela and Xosé María Gómez Clemente (2018): O procesamento de terminoloxía no WordNet do galego. In Manuel Núñez Singala (ed.), Terminoloxía e normalización: Actas XII Xornada Científica Realiter, 131-147. Universidade de Santiago de Compostela, Santiago de Compostela. ISBN 978-84-16954-79-7.

Gonzalez-Dios I., Álvez J.and Rigau G. Exploiting Metonymy from Available Knowledge Resources. To appear in the 20th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing. France. 2019.

Noelia Migueles-Abraira, Rodrigo Agerri and Arantza Diaz de Ilarraza (2018).Annotating Abstract Meaning Representations for Spanish. InProceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), 7-12 May, 2018, Miyazaki, Japan.

Alicia Pérez-Ramírez, Aitziber Atutxa, Arantza Casillas, Koldo Gojenola, Álvaro Sellart (2018) Inferred joint multigram models for medical term normalization according to ICDInternational Journal of Medical Informatics. Volume 110, ISSN: 1386-5056. February 2018, Pages 111–117.

Perez-Miguel N., Cuadros M. and Rigau G.Biomedical term normalization of EHRs with UMLS. In Proceedings of the 11th Language Resources and Evaluation Conference (LREC'18). Miyazaki, Japan 2018.

Álvaro Rodrigo, Jesús Herrera, Anselmo Peñas. The effect of answer validation on the performance of Question-Answering systems.Expert Systems with Applications 116, pages 351-363. 2019 (accepted and available online in september 2018). ISSN: 0957-4174. https://doi.org/10.1016/j.eswa.2018.09.014

Álvaro Rodrigo, Anselmo Peñas, Yusuke Miyao, Noriko Kando: Do systems pass university entrance exams?Information Processing and Management 54(4), pages 564-575. ISSN: 0306-457. 2018. https://doi.org/10.1016/j.ipm.2018.03.002.

Simões, Alberto and Xavier Gómez Guinovart (2018): Extending the Galician Wordnet Using a Multilingual Bible Through Lexical Alignment and Semantic Annotation. In Pedro Rangel Henriques, José Paulo Leal, António Menezes Leitão and Xavier Gómez Guinovart (eds.): 7th Symposium on Languages, Applications and Technologies (SLATE 2018) (ISBN: 978-3-95977-072-9), Schloss Dagstuhl/Leibniz-Zentrum fuer Informatik, Dagstuhl (Germany), pp. 14:1-14:13. DOI: http://dx.doi.org/10.4230/OASIcs.SLATE.2018.14.

Štajner, S., Saggion, H., Ponzetto, S.P. Improving lexical coverage of text simplification systems for Spanish. Expert systems with applications 2019; 118: 80-91.  

Mark Stevenson, Eneko Agirre (2018). Word Sense DisambiguationThe Oxford Handbook of Computational Linguistics 2nd edition (2 ed.) Edited by Ruslan Mitkov. Oxford. ISBN: 9780199573691. DOI of the chapter:10.1093/oxfordhb/9780199573691.013.28

2017

 

  • Ahmed AbuRa'ed, Horacio Saggion and Luis Chiruzzo. What Sentence are you Referring to and Why? Identifying Cited Sentences in Scientific Literature. Recent Advances in Natural Language Processing, RANLP 2017.  (accepted)

  • Pablo Accuosto, Francesco Ronzano, Daniel Ferrés, Horacio Saggion. 2017.
    Multi-level mining and visualization of scientific text collections. In Pro-
    ceedings of The 6st International Workshop on Mining Scientific Publications.
    Joint Conference on Digital Libraries, Toronto, Canada, June 2017 (JCDL’17)

  • Francesco Barbieri, Miguel Ballesteros, Horacio Saggion. Are Emojis Predictable? Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 105–111, Valencia, Spain, April 3-7, 2017. Association for Computational Linguistics. 2017.

  • Francesco Barbieri, Luis Espinosa-Anke, Miguel Ballesteros, Juan Soler-Company, Horacio Saggion.  Towards the Understanding of Gaming Audiences by Modeling Twitch Emotes. The 3rd Workshop on Noisy User-generated Text (W-NUT), September 7th, Copenhagen at EMNLP 2017. (accepted)
  • Núria Bel, Jorge Diz-Pico, Montserrat Marimon, Joel Pocostales (2017) Classifying short texts for a Social Media monitoring system, Procesamiento del Lenguaje Natural, Revista nº 59, 57-64, ISSN: 1135-5948.
  • Kepa Bengoetxea, Aitziber Atutxa, Mikel Iruskieta (2017). Un detector de la unidad central de un texto basado en técnicas de aprendizaje automático en textos cientı́ficos para el euskera. Procesamiento del Lenguaje Natural 58: 37-44

  • Comelles, E., & Atserias, J. (2016). Through the Eyes of VERTa. Procesamiento del Lenguaje Natural, 57, 181-184.

  • Espinosa-Anke, L., Oramas S., Saggion H., & Serra X. (2017).  ELMDist: A vector space model with words and MusicBrainz entities. Workshop on Semantic Deep Learning (SemDeep), collocated with ESWC 2017.

  • Daniel Ferrés, Ahmed AbuRa'ed, Horacio Saggion. Spanish Morphological Generation with Wide-Coverage Lexicons and Decision Trees. Procesamiento del Lenguaje Natural 58: 109-116.  2017.

  • Daniel Ferrés, Horacio Saggion, and Xavier Gómez Guinovart. An Adaptable Lexical Simplification Architecture for Major Ibero-Romance Languages.  Proceedings of the Building Linguistically Generalizable NLP Systems Workshop of EMNLP 2017 (accepted), 2017

  • Lopez-Gazpio I., Maritxalar M., Gonzalez-Agirre A., Rigau G., Uria L., Agirre E., Interpretable Semantic Textual Similarity: Finding and explaining differences between sentences. Knowledge-Based Systems, Volume 119, Pages 186-199, ISSN 0950-7051,http://dx.doi.org/10.1016/j.knosys.2016.12.013. 2017.

  • Marimon, Montserrat, Vivaldi, Jorge;  Bel, Núria (2017) Annotation of negation in the IULA Spanish Clinical Record Corpus. Proceedings of the Workshop Computational Semantics Beyond Events and Roles. SemBEaR 2017.

  • Horacio Saggion, Daniel Ferrés, Leen Sevens, Ineke Schuurman, Marta Ripolles, and Olga Rodriguez (2017). Able to Read My Mail: An Accessible e-Mail Client with Assistive Technology. In: Web For All (W4A) 2017 – The Future of Accessible Work, Perth (Australia), April 2-4. Best Communication Paper Award

  • Horacio Saggion, Francesco Ronzano, Pablo Accuosto and Daniel Ferrés. MultiScien: a Multilingual Natural Language Processing System for Mining and Enrichment of Scientific Collections. In Proceedings of the 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2017). (accepted)

  • Perez, Naiara, and Montse Cuadros. "Multilingual CALL Framework for Automatic Language Exercise Generation from Free Text." EACL 2017 (2017): 49.

 

2016