Cookies Policy
X

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.

I accept this policy

Find out more here

Using the MF/MD method for automatic text classification

Brill’s MyBook program is exclusively available on BrillOnline Books and Journals. Students and scholars affiliated with an institution that has purchased a Brill E-Book on the BrillOnline platform automatically have access to the MyBook option for the title(s) acquired by the Library. Brill MyBook is a print-on-demand paperback copy which is sold at a favorably uniform low price.

Access this chapter

+ Tax (if applicable)

Chapter Summary

In corpus linguistics, but also in computational linguistics and information retrieval, there is an increasing demand for the automatic classification of large amounts of text(s). In his research, Biber uses the Multi-Feature/Multi-Dimension (MF/MD) method to obtain a classification of English texts. A major disadvantage of his approach is the heavy reliance on the frequency count of complex grammatical features which are hard to retrieve automatically. In this paper, we investigate whether Biber’s MF/MD method can be used for automatic text classification. For this purpose, the MF/MD method is applied to the ICE-GB corpus, using three different sets of linguistic features. The results indicate that automatic text classification is indeed feasible using word class tags as input for the MF/MD method.

10.1163/9789042029248_004
/content/books/b9789042029248s004
dcterms_subject,pub_keyword
10
5
Loading

Sign-in

Can't access your account?
  • Tools

  • Add to Favorites
  • Printable version
  • Email this page
  • Recommend to your library

    You must fill out fields marked with: *

    Librarian details
    Your details
    Why are you recommending this title?
    Select reason:
     
    Extending the scope of corpus-based research — Recommend this title to your library
  • Export citations
  • Key

  • Full access
  • Open Access
  • Partial/No accessInformation