Clustering OKU Timur Script Images using VGG Feature extraction and K-Means

This study focuses on the utilization of clustering models to group manuscript images from the OKU Timur region based on specific characteristics. OKU Timur is rich in cultural heritage, including a unique writing system known as the OKU Timur script. The development of intelligent systems technolog...

Full description

Saved in:

Bibliographic Details
Main Authors:	Liu Toriko, Susan Dian Purnamasari, Yesi Novaria Kunang, Ilman Zuhri Yadi, Andri Andri
Format:	Article
Language:	English
Published:	LPPM ISB Atma Luhur 2024-12-01
Series:	Jurnal Sisfokom
Subjects:	oku timur script vgg16 model clustering k-means manuscript images
Online Access:	https://jurnal.atmaluhur.ac.id/index.php/sisfokom/article/view/2292
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1823856512590675968
author	Liu Toriko Susan Dian Purnamasari Yesi Novaria Kunang Ilman Zuhri Yadi Andri Andri
author_facet	Liu Toriko Susan Dian Purnamasari Yesi Novaria Kunang Ilman Zuhri Yadi Andri Andri
author_sort	Liu Toriko
collection	DOAJ
description	This study focuses on the utilization of clustering models to group manuscript images from the OKU Timur region based on specific characteristics. OKU Timur is rich in cultural heritage, including a unique writing system known as the OKU Timur script. The development of intelligent systems technology can be employed to recognize the OKU Timur script. For this purpose, a dataset of OKU Timur script is needed, which will later be used for classifying script images. One of the challenges in preparing the dataset is grouping a large number of script image samples according to the number of characters. A proposed solution in this research is to automatically group script images by applying the K-Means algorithm. The dataset comprises 2,280 images, representing 19 characters and 228 variations with different diacritics. Features are extracted using the VGG16 model, which are then clustered with the K-Means algorithm. Clustering performance is evaluated based on the percentage of correctly grouped characters. For 19 groups (character count), the model achieves an accuracy of 82.6%. For 228 groups (variations and diacritics), it correctly groups 48.16% of characters. Despite the challenges, the results demonstrate the model’s potential for further refinement. This study’s contribution lies in introducing an efficient clustering approach for cultural manuscripts, supporting digital preservation, and advancing automatic recognition of the OKU Timur script. These efforts aim to preserve the script for future generations.
format	Article
id	doaj-art-8f0373a26fba43efb429f7219a9d3828
institution	Kabale University
issn	2301-7988 2581-0588
language	English
publishDate	2024-12-01
publisher	LPPM ISB Atma Luhur
record_format	Article
series	Jurnal Sisfokom
spelling	doaj-art-8f0373a26fba43efb429f7219a9d38282025-02-12T07:27:39ZengLPPM ISB Atma LuhurJurnal Sisfokom2301-79882581-05882024-12-01141152210.32736/sisfokom.v14i1.22921959Clustering OKU Timur Script Images using VGG Feature extraction and K-MeansLiu Toriko0Susan Dian Purnamasari1Yesi Novaria Kunang2Ilman Zuhri Yadi3Andri Andri4Intelligent Systems Research Group, Faculty of Science Technology, Bina Darma UniversityIntelligent Systems Research Group, Faculty of Science Technology, Bina Darma UniversityIntelligent Systems Research Group, Faculty of Science Technology, Bina Darma UniversityIntelligent Systems Research Group, Faculty of Science Technology, Bina Darma UniversityIntelligent Systems Research Group, Faculty of Science Technology, Bina Darma UniversityThis study focuses on the utilization of clustering models to group manuscript images from the OKU Timur region based on specific characteristics. OKU Timur is rich in cultural heritage, including a unique writing system known as the OKU Timur script. The development of intelligent systems technology can be employed to recognize the OKU Timur script. For this purpose, a dataset of OKU Timur script is needed, which will later be used for classifying script images. One of the challenges in preparing the dataset is grouping a large number of script image samples according to the number of characters. A proposed solution in this research is to automatically group script images by applying the K-Means algorithm. The dataset comprises 2,280 images, representing 19 characters and 228 variations with different diacritics. Features are extracted using the VGG16 model, which are then clustered with the K-Means algorithm. Clustering performance is evaluated based on the percentage of correctly grouped characters. For 19 groups (character count), the model achieves an accuracy of 82.6%. For 228 groups (variations and diacritics), it correctly groups 48.16% of characters. Despite the challenges, the results demonstrate the model’s potential for further refinement. This study’s contribution lies in introducing an efficient clustering approach for cultural manuscripts, supporting digital preservation, and advancing automatic recognition of the OKU Timur script. These efforts aim to preserve the script for future generations.https://jurnal.atmaluhur.ac.id/index.php/sisfokom/article/view/2292oku timur scriptvgg16 modelclusteringk-meansmanuscript images
spellingShingle	Liu Toriko Susan Dian Purnamasari Yesi Novaria Kunang Ilman Zuhri Yadi Andri Andri Clustering OKU Timur Script Images using VGG Feature extraction and K-Means Jurnal Sisfokom oku timur script vgg16 model clustering k-means manuscript images
title	Clustering OKU Timur Script Images using VGG Feature extraction and K-Means
title_full	Clustering OKU Timur Script Images using VGG Feature extraction and K-Means
title_fullStr	Clustering OKU Timur Script Images using VGG Feature extraction and K-Means
title_full_unstemmed	Clustering OKU Timur Script Images using VGG Feature extraction and K-Means
title_short	Clustering OKU Timur Script Images using VGG Feature extraction and K-Means
title_sort	clustering oku timur script images using vgg feature extraction and k means
topic	oku timur script vgg16 model clustering k-means manuscript images
url	https://jurnal.atmaluhur.ac.id/index.php/sisfokom/article/view/2292
work_keys_str_mv	AT liutoriko clusteringokutimurscriptimagesusingvggfeatureextractionandkmeans AT susandianpurnamasari clusteringokutimurscriptimagesusingvggfeatureextractionandkmeans AT yesinovariakunang clusteringokutimurscriptimagesusingvggfeatureextractionandkmeans AT ilmanzuhriyadi clusteringokutimurscriptimagesusingvggfeatureextractionandkmeans AT andriandri clusteringokutimurscriptimagesusingvggfeatureextractionandkmeans

Clustering OKU Timur Script Images using VGG Feature extraction and K-Means

Similar Items