Syllable Corpus

The Syllable Corpus is a peculiar corpus that features syllable tagging for Turkish. The corpus includes 5 million 714 thousand and 422 unique words. Each word has hyphenated and each syllable is tagged with a special tag set developed by TS corpus.
The main idea behind the corpus is calculating syllable frequency and building an index of valid syllables of Turkish.

 

0
Million Tokens
0
Syllables

If you have registered to TS Corpus

Login Now

If you haven’t registered you can sign up now

Sign Up Now