Event Source Page Discovery via Policy-Based RL with Multi-task Neural Sequence Model

Chia Hui Chang, Yu Ching Liao, Ting Yeh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The problem of finding event announcement pages for any given website is called event source page discovery. In this paper, we show a policy-based deep reinforcement learning (RL) model for the event source page discovery agent. We use two stages to train our agent, pre-training and fine-tuning. In the pre-training phase, the model is trained with limited labeled data, where each episode has a fixed number of steps. In the fine-tuning phase, the agent is trained using unlabeled data and a reward system based on an event source page classifier. The agent learns whether to continue exploring or stop exploring through an adaptive threshold. The proposed agent achieves 74% precision with a 1.28 unit cost (the average number of clicks for each event source page) on the real word data set.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2022 - 23rd International Conference, Proceedings
EditorsRichard Chbeir, Helen Huang, Fabrizio Silvestri, Yannis Manolopoulos, Yanchun Zhang, Yanchun Zhang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages597-606
Number of pages10
ISBN (Print)9783031208904
DOIs
StatePublished - 2022
Event23rd International Conference on Web Information Systems Engineering, WISE 2021 - Biarritz, France
Duration: 1 Nov 20223 Nov 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13724 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd International Conference on Web Information Systems Engineering, WISE 2021
Country/TerritoryFrance
CityBiarritz
Period1/11/223/11/22

Keywords

  • Event source page discovery
  • Multi-task neural model
  • Reinforcement learning
  • Web mining

Fingerprint

Dive into the research topics of 'Event Source Page Discovery via Policy-Based RL with Multi-task Neural Sequence Model'. Together they form a unique fingerprint.

Cite this