代表性語料庫
Last updated
Was this helpful?
Last updated
Was this helpful?
a massive electronic collection of American English, including texts of all genres and transcripts of spoken data produced from 1990 onward. All data and annotations are fully open and unrestricted for any use.
Available Data and Annotations
OANC : 15 million words of contemporary American English with automatically-produced annotations for a variety of linguistic phenomena.
MASC : 500,000 words of OANC data equally distributed over 19 genres of American English, with manully produced or validated annotations for several layers of linguistic phenomena.