764 Million Corpus
This is a “general purpose” corpus, that contains over 764 million tokens harvested from on-line sources. The data is a part of a larger corpus that we are still working on. The corpus is still in beta version therefore not open to public access.
Please contact with administrator for access request.