Elena Álvarez Mellado

Postdoctoral researcher

UNED NLP&IR group

¡Hola!

I’m Elena. I’m a computational linguist: I’m interested in Linguistics, technology and the intersection between them. I currently work as a postdoc researcher at the NLP&IR research group at UNED University, where I also did my PhD on lexical borrowing identification under the supervision of Julio Gonzalo and Constantine Lignos. I’m particularly interested in studying how we can use technology to understand language contact and language change. My research has led to the creation of Observatorio Lázaro, an observatory that automatically monitors anglicism usage in the Spanish press.

Prior to that, I spent a decade working on different language technology projects at various organizations, such as the Information Sciences Institute at University of Southern California, Fundéu, Molino de Ideas, McLean Hospital or UNED Digital Humanities Lab.

I am also highly involved in dissemination activities that bridge the gap between Linguistics and the general public: I write a column about language at Spanish newspaper elDiario.es, a column that was awarded with the Miguel Delibes National Journalism Award in 2017. I sometimes write at linguistics magazine Archiletras, where I also serve as editorial board member. In 2016 I wrote the pop linguistics book Anatomía de la Lengua.

Interests

Computational Linguistics
Natural Language Processing
Corpus Linguistics
Contact Linguistics

Education

PhD in Natural Language Processing

UNED
MS in Computational Linguistics

Brandeis University
BA in Linguistics

Universidad Complutense de Madrid (UCM)

Talks & media appearances

More Talks

Radio interview at Cadena SER

Radio interview on language and linguistics at Cadena SER

Apr 14, 2025 12:00 AM Serendipias

Video

Lenguaje: entre computación y cognición

Encuentro interdisciplinar organizado por la Sociedad para el Estudio Multidisciplinar y Fundamental

Feb 24, 2025 6:00 PM Real Academia de Ciencias Exactas Físicas Naturales

Video

Digitalización e inteligencia artificial

Round table organized by Spanish newspaper elDiario.es on artificial intelligence

May 16, 2024 12:00 AM Jornada sobre Fondos europeos de elDiario.es

Video

Adam Kilgarriff Lecture at eLex 2023

Keynote on Lázaro Observatory and automatic detection of anglicisms at eLex 2023. Recipient of the Adam Kilgarriff Prize.

Jun 27, 2023 12:00 AM Electronic Lexicography in the 21st Century

Video

Socia de honor de Asetrad: acceptance speech

Socia de honor de Asetrad (Asociación Española de Traductores, Correctores e Intérpretes)

May 13, 2023 12:00 AM Congreso de Asetrad

Slides Video

Un tema al día: ¿Solo o sólo?

Interview for the daily podcast from elDiario.es Un tema al día with Juanlu Sánchez.

Mar 7, 2023 12:00 AM Podcast Un tema al día

Video

Radio interview at Noosfera

Radio interview at Noosfera on Linguistics and Computational Linguistics

Jan 23, 2023 12:00 AM Noosfera

Video

Experience

July 2021 – Present

Madrid, Spain

Research Staff

NLP & IR group, UNED

Researcher (first as a PhD student, now as a postdoc researcher) at the Natural Language Processing and Information Retrieval group at the School of Computer Science at UNED University.

June 2020 – June 2021

Research Programmer

Information Sciences Institute, University of Southern California

Programmer in Natural Language Processing at USC ISI Center for Vision, Image, Speech and Text Analytics (VISTA).

June 2019 – December 2019

Massachusetts, USA

Research Data Analyst

McLean Hospital, Harvard Medical School

Applied Machine Learning and NLP techniques to extract un electronic health records to predict early readmission risk.

June 2017 – June 2018

Madrid, Spain

Research Assistant

Digital Humanities Lab, UNED

Research Assistant at the Digital Humanities Lab of the School of Computer Science at UNED University.

December 2014 – December 2015

Madrid, Spain

Linguistic Data Analyst

Fundación del Español Urgente

Conducted a Corpus Linguistics project on the evolution of the Spanish language on the media during the 20th century.

October 2010 – January 2016

Madrid, Spain

Analytical Linguist

Molino de Ideas

Developed, annotated and evaluated linguistic resources for Spanish language.

Publications

More Publications

Elena Álvarez Mellado, Jordi Porta Zamorano, Constatine Lignos, Julio Gonzalo Arroyo (2025). Overview of ADoBo at IberLEF 2025: Automatic Detection of Anglicisms in Spanish. Procesamiento del Lenguaje Natural: Vol. 75.

PDF

Enrique Amigó, Elena Álvarez Mellado, Julio Gonzalo, Jorge Carrillo-de-Albornoz (2025). Detecting Evaluating Sequence Labeling on the basis of Information Theory. Proceedings of the 63th Annual Meeting of the Association for Computational Linguistics (ACL 2025): Long Papers.

PDF

Sina Ahmadi, Micha David Hess, Elena Álvarez-Mellado, Alessia Battisti, Cui Ding, Anne Göhring, Yingqiang Gao, Zifan Jiang, Andrianos Michail, Peshmerge Morad, Joel Niklaus, Maria Christina Panagiotopoulou, Stefano Perrella, Juri Opitz, Anastassia Shaitarova, Rico Sennrich (2025). ConLoan: A Contrastive Multilingual Dataset for Evaluating Loanwords.. Proceedings of the 63th Annual Meeting of the Association for Computational Linguistics (ACL 2025): Long Papers.

PDF Code

Elena Álvarez-Mellado (2025). Lexical borrowing detection as a sequence labeling task. Data, modeling and evaluation methods for anglicism retrieval in Spanish. PhD dissertation, School of Computer Science, UNED.

PDF Slides

Elena Álvarez-Mellado, Julio Gonzalo (2024). Characterizing Spans for Sequence Labeling: A Case on Anglicism Detection. Procesamiento del lenguaje natural (SEPLN 2024): Vol. 73, p. 235-246.

PDF

Andrew Rueda, Elena Álvarez Mellado, Constatine Lignos (2024). CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024).

PDF

Projects

ADoBo

A shared task on automatic detection of borrowings at IberLEF 2025 and 2021. Organized with Luis Espinosa Anke, Julio Gonzalo, Constantine Lignos and Jordi Porta.

pylazaro

A Python library that automatically detects lexical borrowings (or loanwords) in Spanish

COALAS 🐨

COrpus of AngLicisms in the SpAnish PresS. With Constantine Lignos

Observatorio Lázaro

An observatory of anglicism usage in the Spanish press.

@LazaroBot

A Twitter bot that tweets new anglicisms found in the Spanish press.

Corpus of political speeches

Analysis and visualizations in Python of a corpus of Spanish political speeches from 1937 to 2019.

Subtitles Corpus

A corpus of Spanish subtitles from LOTR, Star Wars, OITNB, GoT, HIMYM, etc.

Aracne

A corpus linguistics project supported by Fundeu on the evolution of the Spanish language on the media during the 20th century. With Leticia Martín-Fuertes and Molino de Ideas.

AZRAEL

A rule-based automatic language detector based on the syllable structure of words. Current supported languages: Spanish, French, Italian, Portuguese, Catalan, Latin and Basque.

Writing & dissemination

I occasionally write a column about language for Spanish newspaper elDiario.es, a column that was awarded with the Miguel Delibes National Journalism Award (Premio Nacional de Periodismo Miguel Delibes) in 2017 for an article about conceptual metaphor and cancer (Metáforas peligrosas. El cáncer como lucha).

I also write for Archiletras, a pop Linguistics magazine where I’m also member of the editorial board.

In 2016 I wrote the pop linguistics book Anatomía de la Lengua.

From 2012 to 2015 I was a radio contributor at Spanish National Radio (RNE) on a weekly section about language and Linguistics.

Some of my personal writing can be read in my old blog (in Spanish).

These are the columns and other journalistic contributions I have written so far:

Réquiem por un simio elDiario.es
¿Para qué estudiar sintaxis? elDiario.es
Nuevas palabras en el diccionario: hablemos de los cómos elDiario.es
La biblioteca está en llamas y los sabios no han llegado elDiario.es
Por sus ejemplos los conoceréis elDiario.es
Nosotros, las personas elDiario.es
Novedades en el diccionario: cinco claves lingüísticas elDiario.es
Evidencial, mi querido Watson elDiario.es
Yolanders, ayusers, errejoners elDiario.es
‘Monomarental’: del activismo al BOE elDiario.es
‘Sólo’, la tilde que se resiste a morir elDiario.es
Pulpos, loros y sistemas conversacionales elDiario.es
Ego elDiario.es
«La gramática es un gigantesco rompecabezas formado por un número relativamente pequeño de piezas». Entrevista a Ignacio Bosque en Archiletras
Donde habita el lenguaje elDiario.es
Todos decimos ‘mamá’ elDiario.es
Bizarro: «Las palabras no somos estancas, evolucionamos» elDiario.es
La fantasía de la España monolingüe elDiario.es
‘Verdul’ que te quiero ‘verdul’ elDiario.es
«Durante mucho tiempo se ha creído que sí, pero ser bilingüe no tiene ninguna desventaja». Entrevista a Esti Blanco Elorrieta en Archiletras
El arte de escribir titulares relevantes elDiario.es
Lenguaje inclusivo: algunas claves lingüísticas elDiario.es
Donde nacen las preposiciones elDiario.es
600 formas de mirar una pandemia elDiario.es
Las Humanidades según Vesalio Observatorio de Humanidades y Tecnología
¿Para qué sirven las lenguas? elDiario.es
Réquiem por un ‘cuyo’ elDiario.es
Radiografía del anglicismo en la prensa española Archiletras
Instrucciones para hacer una pregunta trampa elDiario.es
Hamburguesa de espinacas, leche de soja elDiario.es
La belleza de decir ‘los Rolling’ elDiario.es
Sobre la ortografía elDiario.es
Un cerdo a la izquierda elDiario.es
¿El covid o la covid? elDiario.es
Palabras monógamas elDiario.es
«El objetivo en ciencia es descubrir la teoría más simple para el dominio en cuestión». Entrevista a Noam Chomsky en Archiletras
«Los cambios del idioma no son para preocuparse». Entrevista a Gretchen McCulloch en Archiletras
La irresistible agramaticalidad del teclado predictivo Archiletras
La falacia etimológica Archiletras
Siete mitos sobre lengua que la Lingüística desmiente Archiletras
‘Marrona’ y los adjetivos ‘wannabe’ elDiario.es
Baia baia. La irreverencia ortográfica del meme elDiario.es
De ser un grammar nazi también se sale Archiletras
‘Preveyó’ y los cantos de sirena elDiario.es
Más allá de la economía del lenguaje elDiario.es
Las trabajadoras y la cooperación elDiario.es
El ‘consejo de ministras’ y el no de la RAE elDiario.es
James Rhodes, ‘ancabuela’ y la mirada extraterrestre elDiario.es
‘Aprovechategui’ y el arte del insulto elDiario.es
Palabras de ida y vuelta: ‘sororidad’ elDiario.es
La RAE y el lujo de definir elDiario.es
La plaga de las palabras siamesas elDiario.es
Sobre las ‘portavozas’ elDiario.es
Una escritura propia Revista Mujeres de eldiario.es
Norma lingüística, ¿para qué?, ¿para quién? Revista Entrelíneas de la UAH [pdf]
Ultras, sí. Pero ¿ultra qué? elDiario.es
El extraño caso de las décadas sin nombre elDiario.es
Nadie hablará de nosotras cuando nos eliminen del diccionario elDiario.es
‘Andó’ y el doloroso camino a la regularidad elDiario.es
De qué hablamos cuando hablamos de ‘consentir’ elDiario.es
Las lenguas como castigo elDiario.es
Metáforas peligrosas: el cáncer como lucha elDiario.es
‘Piolines’: el nacimiento de una palabra elDiario.es
‘Apología’ y las malas compañías elDiario.es
El poder delator de las comillas españolas elDiario.es
Madrid, campos de fútbol y otras formas de medir el mundo elDiario.es
“Iros”: el bueno, el feo y el malo elDiario.es
‘Invent’ y el arte de contar mentiras elDiario.es
Todas, tod@s, todxs, todes: historia de la disidencia gramatical elDiario.es
Elogio de ‘la calor’ elDiario.es
Dejad de pedirle a la RAE que elimine palabras elDiario.es
Comas: el infierno de la puntuación elDiario.es
Cuando “literalmente” ya no es literal elDiario.es
Teoría marica o el insulto como bandera elDiario.es
El mito de las palabras que no están en la RAE elDiario.es
No hablarás con acento andaluz en el telediario de las 9 elDiario.es
Señoría, no le entiendo elDiario.es
El peligroso arte de construir realidades con palabras elDiario.es
“Sólo” y la tilde de la nostalgia elDiario.es