Cookies Policy
X

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.

I accept this policy

Find out more here

Syntactic Annotation and Text Classification: A Study Using the Penn-Helsinki Parsed Corpus of Middle English

Brill’s MyBook program is exclusively available on BrillOnline Books and Journals. Students and scholars affiliated with an institution that has purchased a Brill E-Book on the BrillOnline platform automatically have access to the MyBook option for the title(s) acquired by the Library. Brill MyBook is a print-on-demand paperback copy which is sold at a favorably uniform low price.

Access this chapter

+ Tax (if applicable)

Chapter Summary

This article explores the possibility of using the syntactic annotations of the Penn-Helsinki Parsed Corpus of Middle English (Phase 1) to classify the texts therein in regard to text genre. Texts in the period tend to be classified in terms of regional or periodical aspects, text genres, or foreign influences. The texts in question are classified under 19 text types in the corpus. Adoption or selection of these variables is, however, external to texts and arbitrary. The corpus contains 52 types of syntactic tagging, which can be utilized in research on text distinction. This study shows that internal information such as syntactic features within a text can be of use to linguistic studies in Middle English. It will be noted that syntactic properties in text can reflect the stylistic varieties of text genre. This study demonstrates that a correlation between text genre and syntactic structures can be illustrated through a cluster analysis. Texts classified into a given text genre in the corpus are grouped together with those in another text genre, which indicates that classification based on external criteria is not satisfactory. It is also shown that decisive factors in text distinction are nouns, verbs, conjunctions and clause types.

10.1163/9789004334205_016
/content/books/b9789004334205s016
dcterms_subject,pub_keyword
10
5
Loading

Sign-in

Can't access your account?
  • Tools

  • Add to Favorites
  • Printable version
  • Email this page
  • Recommend to your library

    You must fill out fields marked with: *

    Librarian details
    Your details
    Why are you recommending this title?
    Select reason:
     
    English Corpus Linguistics in Japan — Recommend this title to your library
  • Export citations
  • Key

  • Full access
  • Open Access
  • Partial/No accessInformation