Demo

pylazaro

A Python library that automatically detects lexical borrowings (or loanwords) in Spanish

COALAS 🐨

COrpus of AngLicisms in the SpAnish PresS. With Constantine Lignos

Observatorio Lázaro

An observatory of anglicism usage in the Spanish press.

@LazaroBot

A Twitter bot that tweets new anglicisms found in the Spanish press.

Caravaggio

A PyTorch model that classifies Spanish text as being easy to read (plain language) or not.

Corpus of political speeches

Analysis and visualizations in Python of a corpus of Spanish political speeches from 1937 to 2019.

NER4Podcasts

Named Entity Recognition for podcast transcripts. With Julian Fernandez, Kristen Sheets and Linxuan Yang.

Subtitles Corpus

A corpus of Spanish subtitles from LOTR, Star Wars, OITNB, GoT, HIMYM, etc.

Aracne

A corpus linguistics project supported by Fundeu on the evolution of the Spanish language on the media during the 20th century. With Leticia Martín-Fuertes and Molino de Ideas.

AZRAEL

A rule-based automatic language detector based on the syllable structure of words. Current supported languages: Spanish, French, Italian, Portuguese, Catalan, Latin and Basque.