Some Details from the Text Corpus Nova beseda
DOI:
https://doi.org/10.3986/jz.v9i2.2594Abstract
In the paper some interesting data about the distribution of letters, words and sentences from the text corpus at the Corpus Laboratory of the Fran Ramovš Institute of the Slovenian Language are revealed. The corpus, Nova beseda, is the main freely accessible online source for the quantitative research of Slovenian language (http://bos.zrc-sazu.si) and currently consists of 100 million running words, mainly newspaper texts and fiction.Downloads
Download data is not yet available.
Downloads
Published
2015-08-12
How to Cite
Jakopin, P. (2015). Some Details from the Text Corpus Nova beseda. Jezikoslovni Zapiski, 9(2). https://doi.org/10.3986/jz.v9i2.2594
Issue
Section
Articles
License
Authors guarantee that the work is their own original creation and does not infringe any statutory or common-law copyright or any proprietary right of any third party. In case of claims by third parties, authors commit their self to defend the interests of the publisher, and shall cover any potential costs.
More in: Submission chapter