Syllable Corpus

The Syllable Corpus is a peculiar corpus that features syllable tagging for Turkish. The corpus includes 5 million 714 thousand and 422 unique words. Each word has hyphenated and each syllable is tagged with a special tag set developed by TS corpus.
The main idea behind the corpus is calculating syllable frequency and building an index of valid syllables of Turkish.

Million Tokens

If you have registered to TS Corpus

Login Now

If you haven’t registered you can sign up now

Sign Up Now