Documents
Presentation Slides
Presentation Slides
Towards Building a Standard Dataset for Arabic Keyphrase Extraction Evaluation
- Citation Author(s):
- Submitted by:
- Muhammad Helmy
- Last updated:
- 30 November 2016 - 4:11am
- Document Type:
- Presentation Slides
- Document Year:
- 2016
- Event:
- Presenters:
- Muhammad Helmy
- Paper Code:
- 102
- Categories:
- Keywords:
- Log in to post comments
Keyphrases are short phrases that best represent a document content. They can be useful in a variety of applications, including document summarization and retrieval models. In this paper, we introduce the first dataset of keyphrases for an Arabic document collection, obtained by means of crowdsourcing. We experimentally evaluate different crowdsourced answer aggregation strategies and validate their performances against expert annotations to evaluate the quality of our dataset. We report about our experimental results, the dataset features, some lessons learned, and ideas for future
work.