Sip is a project mainly written in Ruby, it's free.
hadoop ruby/streaming statistically improbable phrases
on the train project to attempt to copy amazons statistically improbable phrase calculations data from project gutenberg, runs using hadoop streaming with ruby map/reduce functions
see project page at http://matpalm.com/sip
bash>
coming soon: running in the cloud, when does hadoop become worth it...