The American National Corpus (ANC) is more or less like the British one, both work in a very similar way.
American National Corpus (ANC) project creates a huge electronic collection of American English,that include texts of all types of genres and transcripts of spoken American English from 1990 to the actual date. Anyone can contribute to the system adding text and transcripts. The American National Corpus is created to provide a more comprehensive picture of the American English, and to serve as a resource for students, linguistic and lexicographic research, and technology development.
It is a text corpus of American English currently containing 22 million words written and spoken data produced since 1990. The ANC includes a range of genres comparable to the British National Corpus and is annotated for part of speech and lemma, shallow parse, and named entities. The ANC will contain a core corpus of at least 100 million words, including both written and spoken data comparable across genres to the BNC.
Its First Release was published in 2003, which includes over 11 million words. Nevertheless, it is not a balanced corpus. The Second Release contains over 22,000,000 words with annotated for lemma, part of speech, noun and verb chunks.
Randi Reppen, professor at the University of Arizona, is the project manager. he is helped by a group of nine advisors and a Steering Committee.The Technical Director is Nancy Ide and the Research Associate is Keith Suderman.

Source:
- ANC. American National Corpus (2002-2009) from http://www.anc.org/index.html
- Wikipedia, La Enciclopedia libre from http://en.wikipedia.org/wiki/American_National_Corpus
Comentarios recientes