The Share of Minimal Pairs for Word Forms and Lemmas

Authors

  • Primož Jakopin Inštitut za slovenski jezik Frana Ramovša ZRC SAZU

DOI:

https://doi.org/10.3986/jz.v15i1-2.2422

Abstract

Minimal pairs differ by only a single phoneme (e.g., pear/bear). This article uses words from the index of the text corpus Nova beseda (New Word; 240 million running words) and lemmas from the web resource Besede slovenskega jezika (Slovenian Words; 356,000 entries) to calculate the share of minimal pairs with regard to near-minimal pairs in which words differ by two letters, and among all possible word pairs of equal length. The share increases with word length and is also significantly greater for word forms than for lemmas.

Downloads

Download data is not yet available.

References

Gložančev idr. 2009 = Alenka Gložančev idr. 2009, Novejša slovenska leksika (v povezavi s spletnimi jezikovnimi viri), Ljubljana: Založba ZRC, 2009.

Jakopin 1995 = Primož Jakopin, EVA – a Textual Data Processing Tool, TELRI Newsletter 2, December 1995, 13.

Jakopin 2001 = Primož Jakopin, Words and nonwords as basic units of a newspaper text corpus, COMPLEX 2001 / 6th Conference on Computational Lexicography and Corpus Research »Computational Lexicography and New EU Languages«, University of Birmingham, 49–65

Jakopin – Michelizza 2009 = Primož Jakopin – Mija Michelizza, Besedilni korpus Nova beseda, Mostovi 41 (2007/08), št. 1–2, 165–176.

Orešnik 2008 = Janez Orešnik, Natural syntax: English reported speech, Studia Anglica Posnaniensia 44 (2008), 218–252.

Sinclair 1991 = John Sinclair, Corpus, Concordance, Collocation, Oxford: Oxford University Press, 1991.

SSKJ 1 = Slovar slovenskega knjižnega jezika 1, Ljubljana: DZS, 1970.

Published

2015-07-29

How to Cite

Jakopin, P. (2015). The Share of Minimal Pairs for Word Forms and Lemmas. Jezikoslovni Zapiski, 15(1-2). https://doi.org/10.3986/jz.v15i1-2.2422