Abstract
Automatic detection of natural disasters and incidents has become more important as a tool for fast response. There have been many studies to detect incidents using still images and text. However, the number of approaches that exploit temporal information is rather limited. One of the main reasons for this is that a diverse video dataset with various incident types does not exist. To address this need, in this paper we present a video dataset - Video Dataset of Incidents, VIDI - that contains 4,534 video clips corresponding to 43 incident categories. Each incident class has around 100 videos with a duration of ten seconds on average. To increase diversity, the videos have been searched in several languages. To assess the performance of the recent state-of-the-art approaches, Vision Transformer and TimeSformer, as well as to explore the contribution of video-based information for incident classification, we performed benchmark experiments on the VIDI and Incidents Dataset. We have shown that the recent methods improve the incident classification accuracy. We have found that employing video data is very beneficial for the task. By using the video data, the top-1 accuracy is increased to 76.56% from 67.37%, which was obtained using a single frame. VIDI will be made publicly available. Additional materials can be found at the following link: https://github.com/vididataset/VIDI
Original language | English |
---|---|
Title of host publication | IVMSP 2022 - 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781665478229 |
DOIs | |
Publication status | Published - 2022 |
Event | 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 - Nafplio, Greece Duration: 26 Jun 2022 → 29 Jun 2022 |
Publication series
Name | IVMSP 2022 - 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop |
---|
Conference
Conference | 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 |
---|---|
Country/Territory | Greece |
City | Nafplio |
Period | 26/06/22 → 29/06/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Funding
We would like to thank Ferda Ofli for his support and Ethan Weber for sharing test set of the Incidents Dataset. This study was funded by the Scientific and Technological Research Council of Turkey (TUBITAK) Grant No 121E408 and Istanbul Technical University Research Fund, project code MGA-2020-42547. Finally, we would like to thank Lely Turkey Product Development for travel grant for Duygu Sesver. This study was funded by the Scientific and Technological Research Council of Turkey (TUBITAK) Grant No 121E408 and Istanbul Technical University Research Fund, project code MGA- 2020-42547. Finally, we would like to thank Lely Turkey Product Development for travel grant for Duygu Sesver
Funders | Funder number |
---|---|
Duygu Sesver | |
Türkiye Bilimsel ve Teknolojik Araştırma Kurumu | 121E408 |
Istanbul Teknik Üniversitesi | MGA-2020-42547 |
Keywords
- Video processing
- incident classification
- video incidents dataset