MoNetViT: an efficient fusion of CNN and transformer technologies for visual navigation assistance with multi query attention

Aruco markers are crucial for navigation in complex indoor environments, especially for those with visual impairments. Traditional CNNs handle image segmentation well, but transformers excel at capturing long-range dependencies, essential for machine vision tasks. Our study introduces MoNetViT (Mini...

Full description

Saved in:

Bibliographic Details
Main Authors:	Liliek Triyono, Rahmat Gernowo, Prayitno
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-02-01
Series:	Frontiers in Computer Science
Subjects:	indoor navigation computer vision markers assistive technology mobile devices
Online Access:	https://www.frontiersin.org/articles/10.3389/fcomp.2025.1510252/full
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.frontiersin.org/articles/10.3389/fcomp.2025.1510252/full

MoNetViT: an efficient fusion of CNN and transformer technologies for visual navigation assistance with multi query attention

Internet

Similar Items