A Python library that automatically detects lexical borrowings (or loanwords) in Spanish
COrpus of AngLicisms in the SpAnish PresS. With Constantine Lignos
An observatory of anglicism usage in the Spanish press.
A Twitter bot that tweets new anglicisms found in the Spanish press.
A PyTorch model that classifies Spanish text as being easy to read (plain language) or not.
Analysis and visualizations in Python of a corpus of Spanish political speeches from 1937 to 2019.
Named Entity Recognition for podcast transcripts. With Julian Fernandez, Kristen Sheets and Linxuan Yang.
A corpus of Spanish subtitles from LOTR, Star Wars, OITNB, GoT, HIMYM, etc.
A corpus linguistics project supported by Fundeu on the evolution of the Spanish language on the media during the 20th century. With Leticia Martín-Fuertes and Molino de Ideas.
A rule-based automatic language detector based on the syllable structure of words. Current supported languages: Spanish, French, Italian, Portuguese, Catalan, Latin and Basque.