Documents
Presentation Slides
Leveraging Arabic Morphology and Syntax for Achieving Better Keyphrase Extraction
- Citation Author(s):
- Submitted by:
- Muhammad Helmy
- Last updated:
- 30 November 2016 - 4:12am
- Document Type:
- Presentation Slides
- Document Year:
- 2016
- Event:
- Presenters:
- Muhammad Helmy
- Paper Code:
- 101
- Categories:
- Keywords:
- Log in to post comments
Arabic is one of the fastest growing languages on the Web, with an increasing amount of user generated content being published by both native and non-native speakers all over the world. Despite the great linguistic differences between Arabic and western languages such as English, most Arabic keyphrase extraction systems rely on approaches designed for western languages, thus ignoring its rich morphology and syntax. In this paper we present a new approach leveraging the Arabic morphology and syntax to generate a restricted set of meaningful candidates among which keyphrases are selected. Though employing a small set of well-known features to select the final keyphrases, our system consistently outperforms the well-known and established systems.