Skip to main content
Presentation
Readability Classification for German using lexical, syntactic, and morphological features
Proceedings of COLING 2012: Technical Papers (2012)
  • Julia Hancke, Universität Tübingen
  • Sowmya Vajjala, Universität Tübingen
  • Detmar Meurers, Universität Tübingen
Abstract
We investigate the problem of reading level assessment for German texts on a newly compiled corpus of freely available easy and difficult articles, targeted at adult and child readers respectively. We adapt a wide range of syntactic, lexical and language model features from previous research on English and combined them with new features that make use of the rich morphology of German. We show that readability classification for German based on these features is highly successful, reaching 89.7% accuracy, with the new morphological features making an important contribution.
Publication Date
December, 2012
Location
Mumbai, India
Comments
Copyright 2012 The Authors
Citation Information
Julia Hancke, Sowmya Vajjala and Detmar Meurers. "Readability Classification for German using lexical, syntactic, and morphological features" Proceedings of COLING 2012: Technical Papers (2012)
Available at: http://works.bepress.com/sowmya-vajjala/6/