Show simple item record

dc.contributor.authorDilukshana, C.
dc.contributor.authorSarveswaran, K
dc.date.accessioned2025-12-11T09:47:21Z
dc.date.accessioned2025-12-11T10:22:03Z
dc.date.available2025-12-11T09:47:21Z
dc.date.available2025-12-11T10:22:03Z
dc.date.issued2025-11
dc.identifier.urihttps://ir.kdu.ac.lk/handle/345/8980
dc.description.abstractTreebanks are critical resources in Natural Language Processing (NLP), supporting parser development, linguistic research, and the evaluation of large language models. While Tamil has seen progress in Universal Dependencies (UD) treebanking, existing corpora have been restricted to prose texts, leaving its vast poetic tradition underrepresented. This paper presents the first effort to be made to construct a syntactic treebank for Tamil poetry, specifically focussing on the ThirukkuRaḷ, which is composed in kuRaḷ veṇpā form. A central challenge in this work is posed by the treatment of multiword tokens (MWTs) and elliptical constructions, both of which are observed to occur frequently in Tamil verse due to its agglutinative morphology and metrical constraints. An annotation strategy is proposed within the Enhanced UD (EUD) framework to systematically address five major types of ellipsis—casal, verbal, adjectival, comparative/simile, and cumulative—alongside complex MWT patterns. These annotations not only enhance the representation of Tamil poetic syntax but also broaden the applicability of UD guidelines to underrepresented genres. The contribution is shown to underscore the linguistic and computational importance of capturing the structural specificities of Tamil poetry, while establishing a foundation for future cross-linguistic and literary treebanking effortsen_US
dc.language.isoenen_US
dc.subjectElliptical Compounds, Multiword Tokenization, Poetic Treebank, Universal Dependencies, Tamil Languageen_US
dc.titleExtending Universal Dependencies to Tamil Poetics:Multiword Tokenisation and Ellipsis in the ThirukkuRaḷ Treebanken_US
dc.typeJournal articleen_US
dc.identifier.facultyFGSen_US
dc.identifier.journalKJMSen_US
dc.identifier.issue02en_US
dc.identifier.volume07en_US
dc.identifier.pgnos207-216en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record