Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF foc...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2025-01-01
|
Series: | Journal of Open Humanities Data |
Subjects: | |
Online Access: | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823859316223901696 |
---|---|
author | Luigi Talamo |
author_facet | Luigi Talamo |
author_sort | Luigi Talamo |
collection | DOAJ |
description | The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF focuses on the fiction genre, featuring 200 sentences selected from nine literary texts written by Albanian contemporary authors. |
format | Article |
id | doaj-art-477f2a814dd0488192d5b6a00eee1db8 |
institution | Kabale University |
issn | 2059-481X |
language | English |
publishDate | 2025-01-01 |
publisher | Ubiquity Press |
record_format | Article |
series | Journal of Open Humanities Data |
spelling | doaj-art-477f2a814dd0488192d5b6a00eee1db82025-02-11T05:37:28ZengUbiquity PressJournal of Open Humanities Data2059-481X2025-01-01113310.5334/johd.285285Introducing STAF: The Saarbrücken Treebank of Albanian FictionLuigi Talamo0https://orcid.org/0009-0009-4640-3052Language Science and Technology, Saarland University, SaarbrückenThe present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF focuses on the fiction genre, featuring 200 sentences selected from nine literary texts written by Albanian contemporary authors.https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285albaniantreebankuniversal dependenciesfiction |
spellingShingle | Luigi Talamo Introducing STAF: The Saarbrücken Treebank of Albanian Fiction Journal of Open Humanities Data albanian treebank universal dependencies fiction |
title | Introducing STAF: The Saarbrücken Treebank of Albanian Fiction |
title_full | Introducing STAF: The Saarbrücken Treebank of Albanian Fiction |
title_fullStr | Introducing STAF: The Saarbrücken Treebank of Albanian Fiction |
title_full_unstemmed | Introducing STAF: The Saarbrücken Treebank of Albanian Fiction |
title_short | Introducing STAF: The Saarbrücken Treebank of Albanian Fiction |
title_sort | introducing staf the saarbrucken treebank of albanian fiction |
topic | albanian treebank universal dependencies fiction |
url | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285 |
work_keys_str_mv | AT luigitalamo introducingstafthesaarbruckentreebankofalbanianfiction |