Small Codes: a platform for digital resources and tools for minority languages and dialects

Loading...
Thumbnail Image
Date
2025
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association
Abstract
Small Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules-including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives-tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.
Description

CCS Concepts: Human-centered computing → Web-based interaction; Information systems → Relational database model; Applied computing → Language translation; Social and professional topics → Cultural characteristics

        
@inproceedings{
10.2312:dh.20253331
, booktitle = {
Digital Heritage
}, editor = {
Campana, Stefano
and
Ferdani, Daniele
and
Graf, Holger
and
Guidi, Gabriele
and
Hegarty, Zackary
and
Pescarin, Sofia
and
Remondino, Fabio
}, title = {{
Small Codes: a platform for digital resources and tools for minority languages and dialects
}}, author = {
Zoli, Carlo
and
Mazzaggio, Greta
and
Binazzi, Neri
}, year = {
2025
}, publisher = {
The Eurographics Association
}, ISBN = {
978-3-03868-277-6
}, DOI = {
10.2312/dh.20253331
} }
Citation