Review:
Corpora Such As The Corpus Of Historical American English (coha)
overall review score: 4.5
⭐⭐⭐⭐⭐
score is between 0 and 5
The Corpus of Historical American English (COHA) is a large and comprehensive digital collection of written texts that span from the 1810s to the 2000s. It provides researchers with access to a diachronic corpus of American English, enabling detailed linguistic, cultural, and historical analyses of language change over nearly two centuries.
Key Features
- Extensive temporal coverage from the 1810s to the present
- Contains over 400 million words across a wide range of genres including fiction, newspapers, magazines, and academic texts
- Structured into decade-based sub-corpora for year-by-year or period-specific analysis
- Facilitates research in diachronic linguistics, lexicography, cultural studies, and more
- Publicly accessible via online interfaces and data repositories for academic use
Pros
- Provides invaluable insights into historical language usage and evolution
- Rich, diverse datasets support detailed linguistic research
- User-friendly interface for accessing and querying data
- Supports multiple scholarly disciplines beyond linguistics
Cons
- Limited to written texts, excluding spoken language or conversational speech
- Requires some familiarity with corpus linguistics to maximize utility
- Potential gaps in the earliest periods due to available data sources
- Access may be constrained by institutional subscriptions or availability constraints