A couple of days ago, I came across a note by Tim O’Reilly concerning the Open Text Mining Interface (OTMI). O’Reilly described it as a “copyright hack.” It seems this initiative was started by Timo Hannay, who has also blogged about it on the website of his employer, Nature magazine. The initiative itself is an attempt to respond positively to requests from indexers and data-miners for full-text versions of articles, but without at the same time making human-readable versions of the articles readily available free to non-subscribers. OTMI, an XML format, consists of “word vectors” plus “snippets” which amount, more or less, to all of the sentences in the article arranged alphabetically instead of in their original order. Links to samples are available in Hannay’s posting.