Cookies Policy

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.

I accept this policy

Find out more here

A Dutch Chunker as a Basis for the Extraction of Linguistic Knowledge

Brill’s MyBook program is exclusively available on BrillOnline Books and Journals. Students and scholars affiliated with an institution that has purchased a Brill E-Book on the BrillOnline platform automatically have access to the MyBook option for the title(s) acquired by the Library. Brill MyBook is a print-on-demand paperback copy which is sold at a favorably uniform low price.

Access this chapter

+ Tax (if applicable)

Chapter Summary

We have developed a fully automatic recursive chunker for unrestricted Dutch text to be used as a basis for the extraction of linguistic and terminological information. The chunker is based on the approach adopted for the analysis of German in the YAC-chunker. Our tool builds up flat annotations of (maximal) syntactic constituents, using a multi-pass algorithm.We describe the chunking procedure and the coverage of the chunker with examples, e.g. PPs/NPs with prenominal modification, tegen de uit ioniserende stralingen voortspruitende gevaren or de te fuseren vennootschappen. We also illustrate its use in term candidate extraction from about 20 million words of social security documents from Flanders.



Can't access your account?
  • Tools

  • Add to Favorites
  • Printable version
  • Email this page
  • Recommend to your library

    You must fill out fields marked with: *

    Librarian details
    Your details
    Why are you recommending this title?
    Select reason:
    Computational Linguistics in the Netherlands 2002 — Recommend this title to your library
  • Export citations
  • Key

  • Full access
  • Open Access
  • Partial/No accessInformation