Elena Álvarez Mellado

Assistant professor

Department of Linguistics, UAM

¡Hola!

I am Elena. I am an Assistant Professor (profesora ayudante doctora) in the Department of Linguistics at Universidad Autónoma de Madrid, where I specialize in computational linguistics and natural language processing. My research interests lie at the intersection of language and technology, specifically using computational methods to study language contact, lexical borrowing and linguistic change over time. As part of this work, I developed Observatorio Lázaro, a pipeline that monitors the Spanish press daily and has cataloged 2 million instances of anglicisms since 2020.

I hold a MS in Computational Linguistics from Brandeis University and a PhD in NLP from UNED NLP&IR research group, where I focused on lexical borrowing identification under the supervision of Julio Gonzalo and Constantine Lignos. Prior to that, I worked for a decade as a language technology specialist at institutions such as the Information Sciences Institute at the University of Southern California, Fundéu, Molino de Ideas and UNED Digital Humanities Lab.

My research has been recognized and supported by several institutions. I am the recipient of the Adam Kilgarriff Prize (2022), the Generation Google Scholarship for Women in Computer Science (2021), the Premio HDH (2021), and a LaCaixa Scholarship (2018). I am also an honorary member (socia de honor) of ASETRAD.

In addition to my academic research, I am actively involved in public outreach and science communication. I write a regular language column for the Spanish newspaper elDiario.es, for which I received the Miguel Delibes National Journalism Award (Premio Nacional de Periodismo Miguel Delibes) in 2017. I also serve on the editorial board of the linguistics magazine Archiletras, where I am a regular contributor.

Interests

Computational Linguistics
Natural Language Processing
Contact Linguistics

Education

PhD in Natural Language Processing

UNED
MS in Computational Linguistics

Brandeis University
BA in Linguistics

Universidad Complutense de Madrid (UCM)

Selected Publications

🎓 View my full publications history on Google Scholar

Towards a Diagnostic and Predictive Evaluation Methodology for Sequence Labeling Tasks

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

Elena Álvarez Mellado, Julio Gonzalo

PDF

Overview of ADoBo at IberLEF 2025: Automatic Detection of Anglicisms in Spanish

Procesamiento del Lenguaje Natural (SEPLN 2025): Vol. 75

Elena Álvarez Mellado, Jordi Porta Zamorano, Constantine Lignos, Julio Gonzalo Arroyo

PDF

Evaluating Sequence Labeling on the basis of Information Theory

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025): Long Papers

Enrique Amigó, Elena Álvarez Mellado, Julio Gonzalo, Jorge Carrillo-de-Albornoz

PDF

ConLoan: A Contrastive Multilingual Dataset for Evaluating Loanwords.

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025): Long Papers

Sina Ahmadi, Micha David Hess, Elena Álvarez-Mellado, Alessia Battisti, Cui Ding, Anne Göhring, Yingqiang Gao, Zifan Jiang, Andrianos Michail, Peshmerge Morad, Joel Niklaus, Maria Christina Panagiotopoulou, Stefano Perrella, Juri Opitz, Anastassia Shaitarova, Rico Sennrich

PDF Code

Lexical borrowing detection as a sequence labeling task. Data, modeling and evaluation methods for anglicism retrieval in Spanish

PhD dissertation, School of Computer Science, UNED

Elena Álvarez-Mellado

PDF Slides

Detecting Unassimilated Borrowings in Spanish: An Annotated Corpus and Approaches to Modeling

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022): Long Papers

Elena Álvarez Mellado, Constantine Lignos

PDF Code Slides Video

Projects

🛠️ View projects on GitHub

Observatorio Lázaro

An observatory of anglicism usage in the Spanish press. 2 million anglicisms automatically retrieved since 2020.

ADoBo

A shared task on automatic detection of borrowings at IberLEF 2025 and 2021. Organized with Luis Espinosa Anke, Julio Gonzalo, Constantine Lignos and Jordi Porta.

pylazaro

A Python library that automatically detects lexical borrowings (or loanwords) in Spanish

COALAS 🐨

COrpus of AngLicisms in the SpAnish PresS. With Constantine Lignos

@LazaroBot

A Twitter bot that tweets new anglicisms found in the Spanish press.

Corpus of political speeches

Analysis and visualizations in Python of a corpus of Spanish political speeches from 1937 to 2019.

Aracne

A corpus linguistics project supported by Fundeu on the evolution of the Spanish language on the media during the 20th century. With Leticia Martín-Fuertes and Molino de Ideas.

AZRAEL

A rule-based automatic language detector based on the syllable structure of words. Current supported languages: Spanish, French, Italian, Portuguese, Catalan, Latin and Basque.

Writing & Public Engagement

📰 View all columns

I am also actively involved in public outreach and science communication to make linguistics accessible to non-specialized audiences. I write for the Spanish national newspaper elDiario.es and for linguistics magazine Archiletras. I am also the author of the popular science book Anatomía de la Lengua (Larousse, 2016). From 2012 to 2015, I served as a weekly linguistics contributor on Spanish National Radio (Radio Nacional de España), with a section about linguistics. Some of my early essays are available on my old blog.

Selected Columns at elDiario.es

I have published more than 70 columns since 2017 for elDiario.es, covering topics on language change, linguistic purism, grammar, and the lexicon. My column Metáforas peligrosas. El cáncer como lucha, which explores how metaphors shape medical discourse and public perception of illness, was awarded the Miguel Delibes National Journalism Award (Premio Nacional de Periodismo Miguel Delibes) in 2017. Additionally, my piece ¿El covid o la covid? was selected as text for the commentary section at Catalonia’s university entrance exams (EvAU) in 2021.

Here is a selection of columns I am particularly proud of:

‘Sólo’ y la tilde de la nostalgia (2017)
Nadie hablará de nosotras cuando nos eliminen del diccionario (2017)
Un cerdo a la izquierda (2020)
Lenguaje inclusivo: algunas claves lingüísticas (2021)
¿Para qué sirven las lenguas? (2021)
Todos decimos ‘mamá’ (2022)
La biblioteca está en llamas y los sabios no han llegado (2024)

Editorial work & interviews at Archiletras

I serve on the editorial board of the linguistics magazine Archiletras, where I write pieces challenging language purism and debunking linguistics myths. I have also interviewed prominent figures in the field, including Noam Chomsky, Ignacio Bosque, Gretchen McCulloch, Felisa Verdejo, Juan Carlos Moreno Cabrera, and Olimpia Andrés.

Talks & media appearances

More Talks

Radio interview at Cadena SER

Radio interview on language and linguistics at Cadena SER

Apr 14, 2025 12:00 AM Serendipias

Video

Lenguaje: entre computación y cognición

Encuentro interdisciplinar organizado por la Sociedad para el Estudio Multidisciplinar y Fundamental

Feb 24, 2025 6:00 PM Real Academia de Ciencias Exactas Físicas Naturales

Video

Digitalización e inteligencia artificial

Round table organized by Spanish newspaper elDiario.es on artificial intelligence

May 16, 2024 12:00 AM Jornada sobre Fondos europeos de elDiario.es

Video

Adam Kilgarriff Lecture at eLex 2023

Keynote on Lázaro Observatory and automatic detection of anglicisms at eLex 2023. Recipient of the Adam Kilgarriff Prize.

Jun 27, 2023 12:00 AM Electronic Lexicography in the 21st Century

Video

Honorary member of Asetrad: acceptance speech

Asetrad (Asociación Española de Traductores, Correctores e Intérpretes)

May 13, 2023 12:00 AM Congreso de Asetrad

Slides Video

Experience

📎 See full cv

2026 – Present

Madrid, Spain

Assistant Professor

Department of Linguistics, Universidad Autónoma de Madrid

2025 – 2026

Madrid, Spain

Postdoctoral Researcher

UNED University

2021 – 2025

Madrid, Spain

PhD in Natural Language Processing

UNED University

Dissertation: “Lexical borrowing detection as a sequence labeling task. Data, modeling and evaluation methods for anglicism retrieval in Spanish”. Supervised by Julio Gonzalo and Constantine Lignos.

2020 – 2021

NLP Research Programmer

Information Sciences Institute, University of Southern California

Developed language technology frameworks and applied NLP pipelines at USC ISI Center for Vision, Image, Speech and Text Analytics (VISTA).

2018 – 2020

Massachusetts, US

MS in Computational Linguistics

Brandeis University

Master’s thesis: “Lázaro: An Extractor of Anglicisms in Spanish Newswire”. Advisor: Constantine Lignos.

2009 – 2019

Paris, Boston, Madrid

Language Technology Specialist

Industry & Research Labs

A decade of industry experience across various language-focused institutions, including Fundéu, Molino de Ideas, Eptica Lingway, McLean Hospital, and the UNED Digital Humanities Lab.

2005 – 2010

Madrid, Spain

BA in Linguistics

Universidad Complutense de Madrid (UCM)

Erasmus exchange year abroad at Université Paris 7 Denis Diderot. Final project: “AZRAEL: A-Z Reconocedor Automático de Español”.

Honors & awards

2026

Outstanding thesis award (Premio extraordinario de doctorado)

UNED

Best dissertation of the year (awarded to the top 10% doctoral students).

2025

Announcement

Honorable Mention (Accésit), Best Doctoral Dissertation

Instituto de Investigación de Tecnologías Lingüísticas Multilingües, Universidad de Málaga

Awarded by the University of Málaga’s Research Institute for Linguistic and Multilingual Technologies (IUITLM) for the best doctoral dissertation of the year in language technology.

2023

Announcement

Honorary member of Asetrad

Asociación Española de Traductores, Correctores e Intérpretes

Honorary member of the Spanish Association of Translators, Copy-editors and Interpreters.

2022

Announcement

Adam Kilgarriff Prize

Lexical Computing, Adam Kilgarriff Endowment Fund

Awarded every two years to a researcher under 40 for projects in the fields of corpus linguistics, computational linguistics and lexicography.

2022

Acceptance speech video

Premio Archiletras de la Lengua de investigación

Revista Archiletras

2022

Generation Google Scholarship for Women in Computer Science

Google

30 female graduate students in computer science selected in Europe, Middle East and Africa.

2021

List of recipients

Premio HDH 2021 (Hispanic Digital Humanities Award)

Asociación de Humanidades Digitales Hispánicas

2020

List of recipients

Outstanding Corpus Thesis Award (MS level)

Institute for Corpus Research, Incheon National University

2020

Karen Spärck Jones Award for Outstanding Achievement in Natural Language Processing

Brandeis University

2017

Acceptance speech video

Premio Nacional de Periodismo Miguel Delibes

Asociación de Prensa de Valladolid

2017

LaCaixa Scholarship for graduate studies in the US

LaCaixa Foundation

2009

First award of the Arquímedes National Contest for Young Researchers

Ministry of Science and Education of Spain

Contact

ealvarezmellado [at] gmail.com
Facultad de Filosofía y Letras, Despacho 301, módulo IV bis, c/ Francisco Tomás y Valiente, 1, 28049 Madrid
@lirondos
LinkedIn
GitHub
My CV in PDF