Cookies Policy

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.

I accept this policy

Find out more here

Developing a text-based corpus of the language of Japanese comics (manga)

Brill’s MyBook program is exclusively available on BrillOnline Books and Journals. Students and scholars affiliated with an institution that has purchased a Brill E-Book on the BrillOnline platform automatically have access to the MyBook option for the title(s) acquired by the Library. Brill MyBook is a print-on-demand paperback copy which is sold at a favorably uniform low price.

Access this chapter

+ Tax (if applicable)

Chapter Summary

While demands for corpora from media which mix visual and linguistic elements have increased in recent years with developments in corpus-based linguistics research, the actual creation and design of such corpora present many unique problems. Most centrally, there remains much to be considered in terms of how to isolate and meaningfully represent their linguistic data. In line with these trends, in this paper I introduce a 687,654 character (55,415 entries) corpus of the language from Japanese comics (manga). Many of the issues encountered in its design are found with other media – newspaper stories, advertisements, political cartoons – which mix the visual with the linguistic. In addition to describing how such unusual text could be of interest to other researchers, the approaches taken here may help others with similar projects.



Can't access your account?
  • Tools

  • Add to Favorites
  • Printable version
  • Email this page
  • Recommend to your library

    You must fill out fields marked with: *

    Librarian details
    Your details
    Why are you recommending this title?
    Select reason:
    Corpus-based Studies in Language Use, Language Learning, and Language Documentation — Recommend this title to your library
  • Export citations
  • Key

  • Full access
  • Open Access
  • Partial/No accessInformation