Skip to content
Jason Baldridge edited this page Jan 10, 2013 · 22 revisions

Books

Natural Language Processing

Machine Learning

  • Bishop, Pattern Recognition and Machine Learning
  • Murphy, Machine Learning

Scala

  • Horstmann, Scala for the Impatient. (Free PDF of the first nine chapters is available for free from Typesafe)
  • Odersky et al, Programming in Scala (2nd edition) is excellent, though assumes one knows Java already. (Odersky is the creator of the Scala language.)
  • Wampler and Payne, Programming Scala is available for free online.
  • Pollack, Beginning Scala
  • For other options, see this list of books for learning Scala.

Other

  • White, Hadoop: The Definitive Guide
  • Manning et al, Introduction to Information Retrieval

Scala tutorials

Applications and Toolkits

Note: These lists are not meant to be exhaustive.

Natural language processing and machine learning

  • Chalk: my evolving Scalafication of OpenNLP (see below)
  • OpenNLP: a Java toolkit for NLP
  • Breeze: a Scala toolkit for NLP and learning
  • Mallet: a Java toolkit for NLP that includes many classifiers and topic modeling
  • Junto: a Java/Scala toolkit for label propagation
Clone this wiki locally