764 Million Corpus

This is a “general purpose” corpus, that contains over 764 million tokens harvested from on-line sources. The data is a part of a larger corpus that we are still working on. The corpus is still in beta version therefore not open to public access.
Please contact with administrator for access request.

Million Tokens
Word Types

If you have registered to TS Corpus

Login Now

If you haven’t registered you can sign up now

Sign Up Now