Introducing STAF: The Saarbrücken Treebank of Albanian Fiction

The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF foc...

Full description

Saved in:
Bibliographic Details
Main Author: Luigi Talamo
Format: Article
Language:English
Published: Ubiquity Press 2025-01-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF focuses on the fiction genre, featuring 200 sentences selected from nine literary texts written by Albanian contemporary authors.
ISSN:2059-481X