Morphosyntactic Variation in Italian and Romansh Dialects: The Manzini & Savoia (2005) Corpus Within Project CHANGES

dc.contributor.authorMazzaggio, Gretaen_US
dc.contributor.authorZoli, Carloen_US
dc.contributor.authorBinazzi, Nerien_US
dc.contributor.authorLudovico, Luca Andreaen_US
dc.contributor.authorVena, Mael Vittorioen_US
dc.contributor.authorManzini, M. Ritaen_US
dc.contributor.authorSavoia, Leonardo Mariaen_US
dc.contributor.editorCampana, Stefanoen_US
dc.contributor.editorFerdani, Danieleen_US
dc.contributor.editorGraf, Holgeren_US
dc.contributor.editorGuidi, Gabrieleen_US
dc.contributor.editorHegarty, Zackaryen_US
dc.contributor.editorPescarin, Sofiaen_US
dc.contributor.editorRemondino, Fabioen_US
dc.date.accessioned2025-09-05T20:57:10Z
dc.date.available2025-09-05T20:57:10Z
dc.date.issued2025
dc.description.abstractThis paper presents the digitization and online dissemination of the Manzini & Savoia (2005) corpus, one of the most comprehensive resources on morphosyntactic variation in Italian and Romansh dialects. Developed within Project CHANGES (Cultural Heritage Active Innovation for Sustainable Society), the initiative responds to the urgent need for systematic documentation and open-access preservation of linguistic diversity. The new platform integrates a relational PostgreSQL database, a Strapi-based backend, and an interactive web interface, offering multiple modes of exploration-including map-based navigation, morphosyntactic query, and access to original fieldwork notebooks. The entire dataset (64,472 examples with IPA transcription and metadata) is openly available on Zenodo for independent research and reuse. The project also explores experimental applications of Large Language Models (LLMs) for automatic annotation, demonstrating the potential for computational approaches in dialectology. This work provides a replicable model for sustainable digital archiving and fosters interdisciplinary research across linguistic, computational, and cultural heritage domains.en_US
dc.description.sectionheadersDigital Technologies for CHANGES (CHANGES SESSION) - Part 2
dc.description.seriesinformationDigital Heritage
dc.identifier.doi10.2312/dh.20253066
dc.identifier.isbn978-3-03868-277-6
dc.identifier.pages8 pages
dc.identifier.urihttps://doi.org/10.2312/dh.20253066
dc.identifier.urihttps://diglib.eg.org/handle/10.2312/dh20253066
dc.publisherThe Eurographics Associationen_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Human-centered computing → Web-based interaction; Information systems → Relational database model; Applied computing → Language translation; Social and professional topics → Cultural characteristics
dc.subjectHuman centered computing → Web
dc.subjectbased interaction
dc.subjectInformation systems → Relational database model
dc.subjectApplied computing → Language translation
dc.subjectSocial and professional topics → Cultural characteristics
dc.titleMorphosyntactic Variation in Italian and Romansh Dialects: The Manzini & Savoia (2005) Corpus Within Project CHANGESen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
dh20253066.pdf
Size:
21.31 MB
Format:
Adobe Portable Document Format