Efficient k-mer based curation of raw sequence data: application in Drosophila suzukii

Several studies have highlighted the presence of contaminated entries in public sequence repositories, calling for special attention to the associated metadata. Here, we propose and evaluate a fast and efficient k–mer-based approach to assess the degree of mislabeling or contamination. We applied it...

Full description

Saved in:
Bibliographic Details
Main Author: Gautier, Mathieu
Format: Article
Language:English
Published: Peer Community In 2023-09-01
Series:Peer Community Journal
Online Access:https://peercommunityjournal.org/articles/10.24072/pcjournal.309/
Tags: Add Tag
No Tags, Be the first to tag this record!