Small Codes: a platform for digital resources and tools for minority languages and dialects
Loading...
Date
2025
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association
Abstract
Small Codes is an open digital infrastructure designed to support the preservation and revitalization of minority languages through scalable, interoperable and user-friendly tools. The platform combines linguistic data management with web-based technologies, offering an integrated suite of software modules-including online dictionaries, spell-checkers, corpus alignment systems, linguistic maps, and multimedia archives-tailored for under-resourced and dialectally fragmented languages. Unlike standard language technology pipelines designed for dominant languages, Small Codes supports linguistically diverse input and community-led data models. It operates through a federated, semi-industrial development model, balancing long-term sustainability with flexibility for academic and institutional partners. This paper outlines the system architecture and core functionalities of Small Codes, presents selected implementation scenarios, and discusses its contribution to digital heritage and computational dialectology.
Description
CCS Concepts: Human-centered computing → Web-based interaction; Information systems → Relational database model; Applied computing → Language translation; Social and professional topics → Cultural characteristics
@inproceedings{10.2312:dh.20253331,
booktitle = {Digital Heritage},
editor = {Campana, Stefano and Ferdani, Daniele and Graf, Holger and Guidi, Gabriele and Hegarty, Zackary and Pescarin, Sofia and Remondino, Fabio},
title = {{Small Codes: a platform for digital resources and tools for minority languages and dialects}},
author = {Zoli, Carlo and Mazzaggio, Greta and Binazzi, Neri},
year = {2025},
publisher = {The Eurographics Association},
ISBN = {978-3-03868-277-6},
DOI = {10.2312/dh.20253331}
}