Introducing STAF: The Saarbrücken Treebank of Albanian Fiction

The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF foc...

Full description

Saved in:
Bibliographic Details
Main Author: Luigi Talamo
Format: Article
Language:English
Published: Ubiquity Press 2025-01-01
Series:Journal of Open Humanities Data
Subjects:
Online Access:https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823859316223901696
author Luigi Talamo
author_facet Luigi Talamo
author_sort Luigi Talamo
collection DOAJ
description The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF focuses on the fiction genre, featuring 200 sentences selected from nine literary texts written by Albanian contemporary authors.
format Article
id doaj-art-477f2a814dd0488192d5b6a00eee1db8
institution Kabale University
issn 2059-481X
language English
publishDate 2025-01-01
publisher Ubiquity Press
record_format Article
series Journal of Open Humanities Data
spelling doaj-art-477f2a814dd0488192d5b6a00eee1db82025-02-11T05:37:28ZengUbiquity PressJournal of Open Humanities Data2059-481X2025-01-01113310.5334/johd.285285Introducing STAF: The Saarbrücken Treebank of Albanian FictionLuigi Talamo0https://orcid.org/0009-0009-4640-3052Language Science and Technology, Saarland University, SaarbrückenThe present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF focuses on the fiction genre, featuring 200 sentences selected from nine literary texts written by Albanian contemporary authors.https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285albaniantreebankuniversal dependenciesfiction
spellingShingle Luigi Talamo
Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
Journal of Open Humanities Data
albanian
treebank
universal dependencies
fiction
title Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
title_full Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
title_fullStr Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
title_full_unstemmed Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
title_short Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
title_sort introducing staf the saarbrucken treebank of albanian fiction
topic albanian
treebank
universal dependencies
fiction
url https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285
work_keys_str_mv AT luigitalamo introducingstafthesaarbruckentreebankofalbanianfiction