Download Email Zoning Dataset

This site contains information and resources related to Andrew Lampert's email text segmentation and classification research.

Zebra Image by arnolouise, licensed under Creative Commons

Annotated Email Dataset

Please fill in your details in the following form to download our dataset of zone-annotated email text.

Intended Use

Your details will be kept confidential, and only be used for us to gauge the level of interest in our work or to contact you with updates regarding the Zebra dataset or Zebra system code.

Creative Commons LicenseOur annotated data is licensed under a Creative Commons Attribution-Noncommercial 2.0 Generic License.

If you make use of any of these resources, please cite the following paper:

Andrew Lampert, Robert Dale and Cécile Paris (2009) - Segmenting Email Message Text into Zones, In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2009), pp. 919-928, Singapore, August 6-7.