Printed Gujarati Text Detection and Recognition (PGTDR)
2023 6th International Conference on Advances in Science and Technology (ICAST)(2023)
摘要
India is the land of many historic manuscripts and parchments. More than one-third Indian manuscripts are written in Gujarati, according to statistics from the National Mission for Manuscripts. Since handwritten data on paper is difficult to maintain, digitizing these manuscripts and storing them in a database proves helpful in their preservation. The research will use machine learning models trained on a custom dataset to recognize Gujarati text and convert it into a machine-readable format. The research presents an effective, robust and high-precision system to conveniently digitize these manuscripts, using YOLOv8 for character detection with 98% precision and DenseNet model for text recognition with 99.2% accuracy. The system is also capable of detecting text blocks in newspaper articles and this proof of concept will be extremely efficient for detection of texts in Gujarati manuscripts.
更多查看译文
关键词
Gujarati,YOLO,handwritten,digitize,EfficientNet,text detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要