Abstract
Text detection is one of the most challenging and commonly dealt applications in computer vision. Detecting text regions is the first step of the text recognition systems called Optical Character Recognition. This process requires the separation of text region from non-text region. In this paper, we utilize Maximally Stable Extremal Regions to acquire very first text region candidates. Then these possible regions are reduced in quantity by using geometric and stroke width properties. Candidate regions are joined to obtain text groups. Finally, Tesseract Optical Character Recognition engine is utilized as the last step to eliminate non-text groups. We evaluated the proposed system on KAIST and ICDAR datasets for both natural images and computer-generated images. For natural images 82.7% precision and 52.0% f-accuracy; for computer-generated images 64.0% precision and 65.2% f-accuracy is achieved.
Original language | English |
---|---|
Title of host publication | 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1-4 |
Number of pages | 4 |
ISBN (Electronic) | 9781538615010 |
DOIs | |
Publication status | Published - 5 Jul 2018 |
Event | 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018 - Izmir, Turkey Duration: 2 May 2018 → 5 May 2018 |
Publication series
Name | 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018 |
---|
Conference
Conference | 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018 |
---|---|
Country/Territory | Turkey |
City | Izmir |
Period | 2/05/18 → 5/05/18 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- Geometric and stroke width properties
- Maximally stable extremal regions
- Non-text region elimination
- Text detection