Syllable Corpus
The Syllable Corpus is a peculiar corpus that features syllable tagging for Turkish. The corpus includes 5 million 714 thousand and 422 unique words. Each word has hyphenated and each syllable is tagged with a special tag set developed by TS corpus.
The main idea behind the corpus is calculating syllable frequency and building an index of valid syllables of Turkish.
0
Million Tokens
0
Syllables