代表性語料庫
美語
The Open American National Corpus
a massive electronic collection of American English, including texts of all genres and transcripts of spoken data produced from 1990 onward. All data and annotations are fully open and unrestricted for any use.
Available Data and Annotations
OANC : 15 million words of contemporary American English with automatically-produced annotations for a variety of linguistic phenomena.
MASC : 500,000 words of OANC data equally distributed over 19 genres of American English, with manully produced or validated annotations for several layers of linguistic phenomena.
英語
漢語
德語
瑞典語
Last updated