Some Details from the Text Corpus Nova beseda

Authors

  • Primož Jakopin

DOI:

https://doi.org/10.3986/jz.v9i2.2594

Abstract

In the paper some interesting data about the distribution of letters, words and sentences from the text corpus at the Corpus Laboratory of the Fran Ramovš Institute of the Slovenian Language are revealed. The corpus, Nova beseda, is the main freely accessible online source for the quantitative research of Slovenian language (http://bos.zrc-sazu.si) and currently consists of 100 million running words, mainly newspaper texts and fiction.

Downloads

Download data is not yet available.

Published

2015-08-12

How to Cite

Jakopin, P. (2015). Some Details from the Text Corpus Nova beseda. Jezikoslovni Zapiski, 9(2). https://doi.org/10.3986/jz.v9i2.2594