Sense through Time

dc.bibliographicCitation.firstPage1431
dc.bibliographicCitation.issue2
dc.bibliographicCitation.lastPage1465
dc.bibliographicCitation.volume59
dc.contributor.authorSchlechtweg, Dominik
dc.contributor.authorZamora-Reina, Frank D.
dc.contributor.authorBravo-Marquez, Felipe
dc.contributor.authorArefyev, Nikolay
dc.date.accessioned2025-08-25T11:11:16Z
dc.date.available2025-08-25T11:11:16Z
dc.date.issued2024
dc.date.updated2025-07-02T04:01:52Z
dc.description.abstractThere has been extensive work on human word sense annotation, i.e., manually labeling word uses in natural texts according to their senses. Such labels were primarily created for the tasks of Word Sense Disambiguation (WSD) and Word Sense Induction (WSI). However, almost all datasets annotated with word senses are synchronic datasets, i.e., contain texts created in a relatively short period of time and often do not provide the creation date of the texts. This ignores possible applications in diachronic-historic settings, where the aim is to induce or disambiguate historical word senses or changes in senses across time. To facilitate investigations into historical WSD and WSI and to establish connections with the task of Lexical Semantic Change Detection (LSCD), there is a crucial need for historical word sense-annotated data. Hence, we created a new reliable diachronic WSD/WSI dataset ‘DWUG DE Sense’. We describe the preparation and annotation and analyze central statistics. We then describe a thorough evaluation of different prediction systems for jointly solving both WSI and LSCD tasks. All our systems are based on a state-of-the-art architecture that combines Word-in-Context models and graph clustering techniques with different hyperparameter settings. Our findings reveal that using the WSI task as optimization criterion yields better results for both tasks even when the LSCD task is the focal point of optimization. This underscores that although both tasks are related, WSI seems to be more general and able to incorporate the LSCD task.
dc.description.sponsorshipOpen Access funding enabled and organized by Projekt DEAL.
dc.description.sponsorshipVetenskapsrådethttp://dx.doi.org/10.13039/501100004359
dc.description.sponsorshipRiksbankens Jubileumsfondhttp://dx.doi.org/10.13039/501100004472
dc.description.sponsorshipAgencia Nacional de Investigación y Desarrollohttp://dx.doi.org/10.13039/501100020884
dc.description.sponsorshipEuropean Union’s Horizon Europe
dc.description.sponsorshipUniversität Stuttgart (1023)
dc.identifier.doi10.1007/s10579-024-09771-7
dc.identifier.urihttp://resolver.sub.uni-goettingen.de/purl?fidaac-11858/3639
dc.language.isoeng
dc.relation.issn1574-020X
dc.relation.journalLanguage Resources and Evaluation
dc.rightsCC BY 4.0
dc.subject.ddcddc:400
dc.subject.ddcddc:420
dc.subject.fieldlinguistics
dc.subject.fieldenglishstudies
dc.subject.fielddigitalhumanities
dc.titleSense through Time
dc.title.alternativeDiachronic Word Sense Annotations for Word Sense Induction and Lexical Semantic Change Detection
dc.typearticle
dc.type.versionpublishedVersion
dspace.entity.typePublication

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
10579_2024_Article_9771.pdf
Size:
2.38 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
5.84 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections