Introducing STAF: The Saarbrücken Treebank of Albanian Fiction
The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF foc...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Ubiquity Press
2025-01-01
|
Series: | Journal of Open Humanities Data |
Subjects: | |
Online Access: | https://account.openhumanitiesdata.metajnl.com/index.php/up-j-johd/article/view/285 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The present paper describes the building of STAF, a Universal Dependencies treebank for Albanian. STAF was bootstrapped using a Stanza model trained on previously unreleased data and then manually corrected by three Albanian speakers supervised by the author, who also revised all sentences. STAF focuses on the fiction genre, featuring 200 sentences selected from nine literary texts written by Albanian contemporary authors. |
---|---|
ISSN: | 2059-481X |