No. of contract:
Standard Slovene is well researched and supported with linguistic resources and tools. But there are no representative corpora for studies of nonstandard language, no tools for its analysis and processing, and characteristics of nonstandard language are hardly ever included in language descriptions, textbooks or school curricula.
The project aims to overcome this gap by developing an infrastructure and methodology for the analysis of user-generated content in (mostly nonstandard) Slovene. The project uses a combination of state-of-the-art methods from corpus and computational linguistics to enable a comprehensive study into a segment of the Slovene language, which is changing rapidly, gaining increasing importance in all our activities but has been, so far, ignored for various reasons.