Labeling consecutive search query pairs using siamese networks

Nurullah Ateş, Yusuf Yaslan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

As internet users interact with search engines to meet their information needs, a huge amount of search queries are stored. Proper analysis of such query data enhances prediction and understanding of user tasks. User tasks can be used to increase the performance of search engines and recommendations. Query segmentation is an initial step that is commonly performed while analyzing user queries. It determines whether two consecutive query expressions belong to the same sub-Task. Any deficits in query segmentation process is likely to affect all other advanced query based steps and activities like task identification and query suggestion. Recently, some researchers focused on application of algorithms including Recurrent Neural Networks (RNN) to seek for the semantics of queries, and attention based neural networks, but such methodologies are not task-specific. In this paper, we propose a Siamese Convolutional Neural Network (CNN) that models input queries into a more task-specific embedding and a decider network that does the labelling. The proposed method is compared with Context Attention Based Long Short Term Memory (CA-LSTM) and Bi-RNN Gated Retified Unit (GRU) models on Webis Search Mission Corpus 2012 (WSMC12) and Cross-Session Task Extraction (CSTE) datasets. The proposed model performs 95%, implying a 1% improvement over the already existing models and an accuracy of 81% on CSTE dataset implying an improvement classification accuracy of 6% over the previous best results.

Original languageEnglish
Title of host publicationIET Conference Proceedings
PublisherInstitution of Engineering and Technology
Pages206-210
Number of pages5
Volume2021
Edition1
ISBN (Electronic)9781839534300
DOIs
Publication statusPublished - 2021
Event11th International Conference of Pattern Recognition Systems, ICPRS 2021 - Virtual, Online
Duration: 17 Mar 202119 Mar 2021

Conference

Conference11th International Conference of Pattern Recognition Systems, ICPRS 2021
CityVirtual, Online
Period17/03/2119/03/21

Bibliographical note

Publisher Copyright:
© 2021 IET Conference Proceedings. All rights reserved.

Keywords

  • Query Log Data Segmentation
  • Siamese Neural Network
  • User Task Extraction.

Fingerprint

Dive into the research topics of 'Labeling consecutive search query pairs using siamese networks'. Together they form a unique fingerprint.

Cite this