summary

This is a translation of the WordCount example from the Apache Hadoop Map/Reduce Tutorial to scala. I ran into a few snags making this work myself so I thought I'd bundle up a working example and hopefully save other people some trouble.

I've tried to follow the java example as closely as possible in the scala version, so I haven't tried to impose any higher level of abstraction on the code, even though you can imagine building something much more expressive on top of this with scala.

I chopped out the extra argument parsing logic that was in the java example because I think it just obscures the point of the example. Adding it back to the scala version is left as an exercise for the reader.

Note: this example requires Scala 2.8.

running the scala WordCount example

install hadoop and make sure the hadoop script is on your path
install scala and make sure scalac is on your path
copy the hadoop-core jar from the root directory of the hadoop distribution to the directory in which you've checked out this tutorial
copy the commons-logging jar from the lib directory of the hadoop distribution to the directory in which you've checked out this tutorial
copy the commons-cli jar from the lib directory of the hadoop distribution to the directory in which you've checked out this tutorial
copy the scala-library.jar jar from the lib directory of the scala distribution to the directory in which you've checked out this tutorial
run the scala version of WordCount with the run.sh script included here

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
WordCount.scala		WordCount.scala
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

summary

running the scala WordCount example

About

Releases

Packages

Languages

milesegan/scala-hadoop-example

Folders and files

Latest commit

History

Repository files navigation

summary

running the scala WordCount example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages