Restructuring Morphological Output for Best Practices
Presented by:          Hans Nelson, Jarren Bodily and Deryle Lonsdale,
BYU Linguistics and English Language
 
Project / Software Title :          
Project / Software URL:    
Access / Availability:          
Description:    

Restructuring PC-Kimmo output
  • Austonesian language engines: Tagalog, Cebuano, Indonesian
  • XML output in KOML format
Kimmo to XML
  • KimXM (Visual Basic .NET 2003)
    • Purpose: to structure the recognition results of PC-Kimmo into an XML data structure.
      • Used string matching and replace functions
      • Regular expressions
      • XML special character checking functions
  • Benefits
    • This system offers a suitable output mechanism assuring compatibility across linguistic research and search engine platforms along with an encoding method that makes such sharability possible.
  • Kimmo Output Markup Language (KOML)
    • Constrained using a DTD (koml v1.dtd)


References:
ANTWORTH, EVAN L. 1990. PC-KIMMO: a two-level processor for morphological analysis. Occasional Publications in Academic Computing No. 16. Dallas, TX: Summer Institute of Linguistics. ISBN 0-88312-639-7, 273 pages, paperbound.
NELSON, HANS. (2004). A Two-level Engine for Tagalog Morphology and a Structured XML Output for PC-Kimmo. Master's Thesis Project, Brigham Young University.


Program
Papers and Handouts
Participants
Instructions for Participants
Working Groups
Registration
Local Arrangements
E-MELD 2001 E-MELD 2002 E-MELD 2003 E-MELD Homepage