Saturday, January 27, 2007

Natural Language Processing - resources...

I offered to give a talk on "open source" and "natural language processing" in Linux Asia, and was doing a scan to find what resources are available. Here are some gems I found.

Projects in NLP

http://www.cs.mu.oz.au/research/lt/student-projects.html


Toolkits, etc
  • nltk.sourceforget.net - comprehensive NL toolkit including various datasets, etc.
  • BOW toolkit: http://www.cs.cmu.edu/~mccallum/bow/ [A Toolkit for Statistical Language Modeling, Text Retrieval, Classification and Clustering]
  • CLUTO - a clustering toolkit. www-users.cs.umn.edu/~karypis/cluto

No comments: