I am Elena. I am an Assistant Professor (profesora ayudante doctora) in the Department of Linguistics at Universidad Autónoma de Madrid. My research interests lie at the intersection of language and technology, specifically using computational linguistics to study language contact, lexical borrowing and linguistic change over time. As part of this work, I developed Observatorio Lázaro, a continuous pipeline that monitors the Spanish press daily and has cataloged over 1.8 million instances of anglicisms since 2020.
My academic background includes an MS in Computational Linguistics from Brandeis University and a PhD in NLP completed within the UNED NLP&IR research group, where I focused on lexical borrowing identification under the supervision of Julio Gonzalo and Constantine Lignos. Prior to that, I worked for a decade as a language technology specialist at institutions such as the Information Sciences Institute at the University of Southern California, Fundéu, Molino de Ideas and UNED Digital Humanities Lab.
My research has been recognized and supported by several institutions. I am the recipient of the Adam Kilgarriff Prize (2022), the Generation Google Scholarship for Women in Computer Science (2021), the Premio HDH (2021), and a LaCaixa Scholarship (2018). I am also an honorary member (socia de honor) of ASETRAD.
In addition to my academic research, I am actively involved in public outreach and science communication. I write a regular language column for the Spanish newspaper elDiario.es, for which I received the Miguel Delibes National Journalism Award (Premio Nacional de Periodismo Miguel Delibes) in 2017. I also serve on the editorial board of the linguistics magazine Archiletras, where I am a regular contributor, and I am the author of the pop-linguistics book Anatomía de la Lengua (2016).
PhD in Natural Language Processing
UNED
MS in Computational Linguistics
Brandeis University
BA in Linguistics
Universidad Complutense de Madrid (UCM)
An observatory of anglicism usage in the Spanish press.
A shared task on automatic detection of borrowings at IberLEF 2025 and 2021. Organized with Luis Espinosa Anke, Julio Gonzalo, Constantine Lignos and Jordi Porta.
A Python library that automatically detects lexical borrowings (or loanwords) in Spanish
COrpus of AngLicisms in the SpAnish PresS. With Constantine Lignos
A Twitter bot that tweets new anglicisms found in the Spanish press.
Analysis and visualizations in Python of a corpus of Spanish political speeches from 1937 to 2019.
A corpus linguistics project supported by Fundeu on the evolution of the Spanish language on the media during the 20th century. With Leticia Martín-Fuertes and Molino de Ideas.
A rule-based automatic language detector based on the syllable structure of words. Current supported languages: Spanish, French, Italian, Portuguese, Catalan, Latin and Basque.
I occasionally write a column about language for Spanish newspaper elDiario.es, a column that was awarded with the Miguel Delibes National Journalism Award (Premio Nacional de Periodismo Miguel Delibes) in 2017 for an article about conceptual metaphor and cancer (Metáforas peligrosas. El cáncer como lucha).
I am also a member of the editorial board and a contributor for Archiletras, a pop Linguistics magazine where I have written several columns and interviewed leading figures in Linguistics, such as Ignacio Bosque, Juan Carlos Moreno Cabrera, Felisa Verdejo, Olimpia Andrés, Gretchen McCulloch and Noam Chomsky.
I am the author of the pop linguistics book Anatomía de la Lengua (2016).
From 2012 to 2015 I was a radio contributor at Spanish National Radio (RNE) on a weekly section about language and Linguistics.
Some of my personal writing can be read in my old blog (in Spanish).
These are the columns and other journalistic contributions I have written so far: