The Share of Minimal Pairs for Word Forms and Lemmas
DOI:
https://doi.org/10.3986/jz.v15i1-2.2422Abstract
Minimal pairs differ by only a single phoneme (e.g., pear/bear). This article uses words from the index of the text corpus Nova beseda (New Word; 240 million running words) and lemmas from the web resource Besede slovenskega jezika (Slovenian Words; 356,000 entries) to calculate the share of minimal pairs with regard to near-minimal pairs in which words differ by two letters, and among all possible word pairs of equal length. The share increases with word length and is also significantly greater for word forms than for lemmas.Downloads
References
Gložančev idr. 2009 = Alenka Gložančev idr. 2009, Novejša slovenska leksika (v povezavi s spletnimi jezikovnimi viri), Ljubljana: Založba ZRC, 2009.
Jakopin 1995 = Primož Jakopin, EVA – a Textual Data Processing Tool, TELRI Newsletter 2, December 1995, 13.
Jakopin 2001 = Primož Jakopin, Words and nonwords as basic units of a newspaper text corpus, COMPLEX 2001 / 6th Conference on Computational Lexicography and Corpus Research »Computational Lexicography and New EU Languages«, University of Birmingham, 49–65
Jakopin – Michelizza 2009 = Primož Jakopin – Mija Michelizza, Besedilni korpus Nova beseda, Mostovi 41 (2007/08), št. 1–2, 165–176.
Orešnik 2008 = Janez Orešnik, Natural syntax: English reported speech, Studia Anglica Posnaniensia 44 (2008), 218–252.
Sinclair 1991 = John Sinclair, Corpus, Concordance, Collocation, Oxford: Oxford University Press, 1991.
SSKJ 1 = Slovar slovenskega knjižnega jezika 1, Ljubljana: DZS, 1970.
Downloads
Published
How to Cite
Issue
Section
License
Authors guarantee that the work is their own original creation and does not infringe any statutory or common-law copyright or any proprietary right of any third party. In case of claims by third parties, authors commit their self to defend the interests of the publisher, and shall cover any potential costs.
More in: Submission chapter