VIDI: A Video Dataset of Incidents

Duygu Sesver, Alp Eren Gencoglu, Cagri Emre Yildiz, Zehra Gunindi, Faeze Habibi, Ziya Ata Yazici, Hazim Kemal Ekenel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Automatic detection of natural disasters and incidents has become more important as a tool for fast response. There have been many studies to detect incidents using still images and text. However, the number of approaches that exploit temporal information is rather limited. One of the main reasons for this is that a diverse video dataset with various incident types does not exist. To address this need, in this paper we present a video dataset - Video Dataset of Incidents, VIDI - that contains 4,534 video clips corresponding to 43 incident categories. Each incident class has around 100 videos with a duration of ten seconds on average. To increase diversity, the videos have been searched in several languages. To assess the performance of the recent state-of-the-art approaches, Vision Transformer and TimeSformer, as well as to explore the contribution of video-based information for incident classification, we performed benchmark experiments on the VIDI and Incidents Dataset. We have shown that the recent methods improve the incident classification accuracy. We have found that employing video data is very beneficial for the task. By using the video data, the top-1 accuracy is increased to 76.56% from 67.37%, which was obtained using a single frame. VIDI will be made publicly available. Additional materials can be found at the following link: https://github.com/vididataset/VIDI

Original languageEnglish
Title of host publicationIVMSP 2022 - 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665478229
DOIs
Publication statusPublished - 2022
Event14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 - Nafplio, Greece
Duration: 26 Jun 202229 Jun 2022

Publication series

NameIVMSP 2022 - 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop

Conference

Conference14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022
Country/TerritoryGreece
CityNafplio
Period26/06/2229/06/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Funding

We would like to thank Ferda Ofli for his support and Ethan Weber for sharing test set of the Incidents Dataset. This study was funded by the Scientific and Technological Research Council of Turkey (TUBITAK) Grant No 121E408 and Istanbul Technical University Research Fund, project code MGA-2020-42547. Finally, we would like to thank Lely Turkey Product Development for travel grant for Duygu Sesver. This study was funded by the Scientific and Technological Research Council of Turkey (TUBITAK) Grant No 121E408 and Istanbul Technical University Research Fund, project code MGA- 2020-42547. Finally, we would like to thank Lely Turkey Product Development for travel grant for Duygu Sesver

FundersFunder number
Duygu Sesver
Türkiye Bilimsel ve Teknolojik Araştırma Kurumu121E408
Istanbul Teknik ÜniversitesiMGA-2020-42547

    Keywords

    • Video processing
    • incident classification
    • video incidents dataset

    Fingerprint

    Dive into the research topics of 'VIDI: A Video Dataset of Incidents'. Together they form a unique fingerprint.

    Cite this