Text mining, also known as computational text analysis, is a method where a researcher uses computational tools to analyze a large set of texts (a text corpus). Text mining can be used to discover patterns or deviations in a set of texts, examine relationships between documents or ideas, analyze sentiment, or track changes in texts over time.
To see text mining in action, check out America's Public Bible or Mining the Dispatch.
Completing a text mining project can be broken down into three overarching steps. This is just an overview; the steps themselves are broken down further on the Processing Text page.