Abstract
In this work, we propose a method to detect spelling of numbers from noisy text and efficiently convert them into their numerical representations. We design a greedy algorithm that detects spanning of numbers and use a fast and memory efficient data structure based on word graphs for digit conversion. Our proposed data structure scored 88.2% on synthetic dataset and overall system resulted with 70.3% on real dataset.
Translated title of the contribution | Conversion of number expressions within noisy text into numerical representation |
---|---|
Original language | Turkish |
Title of host publication | 2017 25th Signal Processing and Communications Applications Conference, SIU 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781509064946 |
DOIs | |
Publication status | Published - 27 Jun 2017 |
Event | 25th Signal Processing and Communications Applications Conference, SIU 2017 - Antalya, Turkey Duration: 15 May 2017 → 18 May 2017 |
Publication series
Name | 2017 25th Signal Processing and Communications Applications Conference, SIU 2017 |
---|
Conference
Conference | 25th Signal Processing and Communications Applications Conference, SIU 2017 |
---|---|
Country/Territory | Turkey |
City | Antalya |
Period | 15/05/17 → 18/05/17 |
Bibliographical note
Publisher Copyright:© 2017 IEEE.