emailarchive icon indicating copy to clipboard operation
emailarchive copied to clipboard

Hadoop for archiving email

trafficstars

To run the sample, take the following steps:

  1. Put sample emails from data folder into HDFS
  2. Run hadoop job: hadoop jar convertsearch.jar ConvertEmailsToSequence hadoop jar convertsearch.jar SearchEmail
  3. The sample data contains small set of .msg files (all copies) and the results in your /tmp dir should be identical to this