Corpus Paisà

A large (250 million tokens) corpus of authentic Italian contemporary texts from the web, freely available and freely distributable, fully annotated in CoNNL format, and openly accessible and searchable through an advanced, learner-oriented interface.


ContactObjectsActivitiesTechniquesDisciplines
Simonetta MontemagniDigital HumanitiesDisseminationLinked open data > Enrichment-Annotation; Dissemination-PublishingLinguistics