emailarchive
emailarchive copied to clipboard
Hadoop for archiving email
trafficstars
To run the sample, take the following steps:
- Put sample emails from data folder into HDFS
- Run hadoop job:
hadoop jar convertsearch.jar ConvertEmailsToSequence
- The sample data contains small set of .msg files (all copies) and the results in your /tmp dir should be identical to this