| Título: | LUAGRAPHEME: A LUA LIBRARY FOR UNICODE TEXT SEGMENTATION | ||||||||||||
| Autor(es): |
GABRIEL VIANNA SOLDANI |
||||||||||||
| Colaborador(es): |
ROBERTO IERUSALIMSCHY - Orientador LUIS FERNANDO TEIXEIRA BICALHO - Orientador |
||||||||||||
| Catalogação: | 25/MAR/2026 | Língua(s): | PORTUGUESE - BRAZIL |
||||||||||
| Tipo: | TEXT | Subtipo: | SENIOR PROJECT | ||||||||||
| Notas: |
[pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio. [en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio. |
||||||||||||
| Referência(s): |
[pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=75806@1 [en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/TFCs/consultas/conteudo.php?strSecao=resultado&nrSeq=75806@2 |
||||||||||||
| DOI: | https://doi.org/10.17771/PUCRio.acad.75806 | ||||||||||||
| Resumo: | |||||||||||||
|
luagrapheme is a library for Unicode text segmentation in the Lua
programming language. Its goal is to accurately identify textual units such as
grapheme clusters (the visual characters perceived by users), words,
sentences, and line break opportunities. The project consists of bindings for
libgrapheme, a minimalist C library that faithfully implements the algorithms
defined by the Unicode Standard. luagrapheme provides integration with the
LPeg library, enabling the definition and recognition of patterns and
grammars based on Parsing Expression Grammars (PEGs) that incorporate
Unicode segmentation algorithms. The methodology combined development
in Lua and C, supported by automated testing and continuous integration.
The result is a small, predictable, and portable library that enhances Unicode
support in the Lua ecosystem and makes multilingual text processing more
accessible to developers.
|
|||||||||||||
|
|||||||||||||