Computational-Linguistics is a project mainly written in JAVA and RUBY, based on the View license.
Collocation extraction from text corpora
== ComputationalLinguistics
Aim of this project is to extract collocations from Brown text corpus.
Steps to do this:
ToDo: 1 (possible). build a hash from NN_NN, JJ_NN, VB_NN pairs. the key is the first word in a collocation (head) second is array of tails. 2+. split line into words. 3+. split word into 4+. iterate through the corpus.
Real steps done:
Fixes:
You should document your project here.
TODO: