JANES - Resources, Tools and Methods for the Research of Nonstandard Internet Slovene
Viri, orodja in metode za raziskovanje nestandardne spletne slovenščine

No. of contract:

from 01.07.2014 to 30.06.2017


Standard Slovene is well researched and supported with linguistic resources and tools. But there are no representative corpora for studies of nonstandard language, no tools for its analysis and processing, and characteristics of nonstandard language are hardly ever included in language descriptions, textbooks or school curricula.
The project aims to overcome this gap by developing an infrastructure and methodology for the analysis of user-generated content in (mostly nonstandard) Slovene. The project uses a combination of state-of-the-art methods from corpus and computational linguistics to enable a comprehensive study into a segment of the Slovene language, which is changing rapidly, gaining increasing importance in all our activities but has been, so far, ignored for various reasons.