An OCR System for Baybayin Scripts using SVM
In this paper, we intend to discriminate the Baybayin script, a pre-colonial writing system used in the Philippines, from the Latin script at a character level. The proposed algorithm uses four main Support Vector Machine (SVM) classifiers to perform the following classifications between: Baybayin and Latin script, Baybayin characters, Latin characters, and Baybayin diacritical marks. This method emphasizes the recognition of Baybayin characters and so we tested the algorithm with the set of images found in (1) that satisfies our system assumptions. We also include here the codes on how we generate the aforementioned classifiers using the dataset found in (2), (3), and (4) for Baybayin, Latin, and Baybayin diacritic characters, respectively. Finally, we discuss the strengths and limitations of the system, its experimental results and recommendations for further research.
URL links for dataset:
(1) https://www.kaggle.com/jamesnogra/baybayn-baybayin-handwritten-images
(2) https://www.kaggle.com/rodneypino/baybayin-and-latin-binary-images-in-mat-format?select=Baybayin
(3) https://www.kaggle.com/rodneypino/baybayin-and-latin-binary-images-in-mat-format?select=Latin
(4) https://www.kaggle.com/rodneypino/baybayin-and-latin-binary-images-in-mat-format?select=Baybayin+Diacritics
You can check the full paper here: https://peerj.com/articles/cs-360/.
Cite As
Rodney Pino (2024). An OCR System for Baybayin Scripts using SVM (https://github.com/rbp0803/An-OCR-System-for-Baybayin-Scripts-using-SVM), GitHub. Retrieved .
MATLAB Release Compatibility
Platform Compatibility
Windows macOS LinuxTags
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!Discover Live Editor
Create scripts with code, output, and formatted text in a single executable document.
Versions that use the GitHub default branch cannot be downloaded
Version | Published | Release Notes | |
---|---|---|---|
1.0.3 | Uploaded recently the .m files to the main branch page. Apologies for the mismanage upload |
|
|
1.0.2 | See release notes for this release on GitHub: https://github.com/rbp0803/An-OCR-System-for-Baybayin-Scripts-using-SVM/releases/tag/v1.0.2 |
||
1.0.1 | Title Change (minor): Changed the first word of the title from "Proposed" to "An". |
||
1.0.0 |