Small Codes: a platform for digital resources and tools for minority languages and dialects

dc.contributor.authorZoli, Carloen_US
dc.contributor.authorMazzaggio, Gretaen_US
dc.contributor.authorBinazzi, Nerien_US
dc.contributor.editorCampana, Stefanoen_US
dc.contributor.editorFerdani, Danieleen_US
dc.contributor.editorGraf, Holgeren_US
dc.contributor.editorGuidi, Gabrieleen_US
dc.contributor.editorHegarty, Zackaryen_US
dc.contributor.editorPescarin, Sofiaen_US
dc.contributor.editorRemondino, Fabioen_US
dc.date.accessioned2025-09-05T20:57:40Z
dc.date.available2025-09-05T20:57:40Z
dc.date.issued2025
dc.description.abstractSmall Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules-including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives-tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.en_US
dc.description.sectionheadersDigital Technologies for CHANGES (CHANGES SESSION) - Part 2
dc.description.seriesinformationDigital Heritage
dc.identifier.doi10.2312/dh.20253331
dc.identifier.isbn978-3-03868-277-6
dc.identifier.pages9 pages
dc.identifier.urihttps://doi.org/10.2312/dh.20253331
dc.identifier.urihttps://diglib.eg.org/handle/10.2312/dh20253331
dc.publisherThe Eurographics Associationen_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Human-centered computing → Web-based interaction; Information systems → Relational database model; Applied computing → Language translation; Social and professional topics → Cultural characteristics
dc.subjectHuman centered computing → Web
dc.subjectbased interaction
dc.subjectInformation systems → Relational database model
dc.subjectApplied computing → Language translation
dc.subjectSocial and professional topics → Cultural characteristics
dc.titleSmall Codes: a platform for digital resources and tools for minority languages and dialectsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
dh20253331.pdf
Size:
979.58 KB
Format:
Adobe Portable Document Format