Skip to content
jasonbaldridge edited this page Jan 24, 2013 · 22 revisions

Books

Natural Language Processing

Machine Learning

Scala

Visualization

Other

Scala tutorials

Applications and Toolkits

Note: These lists are certainly not exhaustive. Suggestions for additions welcome!

Natural language processing and machine learning

  • Chalk: my evolving Scalafication of OpenNLP (see below)
  • Nak: a Java/Scala machine learning toolkit (forked from OpenNLP and used by Chalk)
  • OpenNLP: a Java toolkit for NLP
  • Breeze: a Scala toolkit for NLP and learning
  • Mallet: a Java toolkit for NLP that includes many classifiers and topic modeling
  • Junto: a Java/Scala toolkit for label propagation

Other useful and relevant software

  • Hadoop: Open source MapReduce implementation and related tools.
  • Scoobi: Scala wrapper to Hadoop.
  • Spark: Scala system for distributed computation (similar to Hadoop, but better for iterative algorithms)
  • Storm: Real-time scalable, distributed processing system.
  • Akka: Java/Scala system for concurrent, distributed computing.

Interesting and relevant programming blog posts

Interesting and relevant media

Clone this wiki locally