Would a corpus like wikipedia or Project Gutenberg be appropriate for you purpose ? Both are freely and easily accessible. <http://dumps.wikimedia.org/backup-index.html> and <http://www.gutenberg.org/wiki/Gutenberg:Feeds#The_Complete_Project_Gutenberg_Catalog>. Eric.