Saturday, 17 March 2012

MONK in the Library

Abstract: The MONK Project (Metadata Offer New Knowledge)has developed a collection of literary texts in English from about 1600-1900, in a variety of genres, from both commercial and public domain sources, totaling 150 million words. Texts have been brought into a uniform XML format, part-of-speech tagged, and ingested into a database, with a user-interface that facilitates statistical analysis of texts and sub-collections. In the current phase of the project, MONK is being brought up as a

Read more ...

No comments:

Post a Comment