TS Columns Corpus

A collection of 25.000 columns collected from Turkish press, distributed equally between female and male authors. The corpus covers a 10 years period and allows users to run restricted queries by gender of the author, date and the source.
There are four per-prepared sub-corpora are available for users.

The corpus, like other corpora released by the project, features part-of-speech tagging and morphological annotation.

Million Tokens
Word Types

If you have registered to TS Corpus

Login Now

If you haven’t registered you can sign up now

Sign Up Now