This site contains information and resources related to Andrew Lampert's email text segmentation and classification research.
Please fill in your details in the following form to download our dataset of zone-annotated email text.
Your details will be kept confidential, and only be used for us to gauge the level of interest in our work or to contact you with updates regarding the Zebra dataset or Zebra system code.
Our annotated data is licensed under a Creative Commons Attribution-Noncommercial 2.0 Generic License.
If you make use of any of these resources, please cite the following paper:
Andrew Lampert, Robert Dale and Cécile Paris (2009) - Segmenting Email Message Text into Zones, In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2009), pp. 919-928, Singapore, August 6-7.