TY - JOUR
T1 - Enabling maps/location searches on mobile devices
T2 - constructing a POI database via focused crawling and information extraction
AU - Chuang, Hsiu Min
AU - Chang, Chia Hui
AU - Kao, Ting Yao
AU - Cheng, Chung Ting
AU - Huang, Ya Yun
AU - Cheong, Kuo Pin
N1 - Publisher Copyright:
© 2016 Taylor & Francis.
PY - 2016/7/2
Y1 - 2016/7/2
N2 - With the popularity of mobile devices and smartphones, we have witnessed rapid growth in mobile applications and services, especially in location-based services (LBS). According to a mobile marketing survey, maps/location searches are among the most utilized services on smartphones. Points of interest (POIs), such as stores, shops, gas stations, parking lots, and bus stops, are particularly important for maps/location searches. Existing map services such as Google Maps and Wikimapia are constructed manually either professionally or with crowd sourcing. However, manual annotation is costly and limited in current POI search services. With the abundance of information on the Web, many store POIs can be extracted from the Web. In this paper, we focus on automatically constructing a POI database to enable store POI map searches. We propose techniques that are required to construct a POI database, including focused crawling, information extraction, and information retrieval techniques. We first crawl Yellow Page web sites to obtain vocabularies of store names. These vocabularies are then investigated with search engines to obtain sentences containing these store names from search snippets in order to train a store name recognition model. To extract POIs scattered across the Web, we propose a query-based crawler to find address-bearing pages that might be used to extract addresses and store names. We crawled 1.25 million distinct POI pairs scattered across the Web and implemented a POI search service via Apache Lucent’s search platform, called Solr. The experimental results demonstrate that the proposed geographical information retrieval model outperforms Wikimapia and a commercial app called ‘What’s the Number?’
AB - With the popularity of mobile devices and smartphones, we have witnessed rapid growth in mobile applications and services, especially in location-based services (LBS). According to a mobile marketing survey, maps/location searches are among the most utilized services on smartphones. Points of interest (POIs), such as stores, shops, gas stations, parking lots, and bus stops, are particularly important for maps/location searches. Existing map services such as Google Maps and Wikimapia are constructed manually either professionally or with crowd sourcing. However, manual annotation is costly and limited in current POI search services. With the abundance of information on the Web, many store POIs can be extracted from the Web. In this paper, we focus on automatically constructing a POI database to enable store POI map searches. We propose techniques that are required to construct a POI database, including focused crawling, information extraction, and information retrieval techniques. We first crawl Yellow Page web sites to obtain vocabularies of store names. These vocabularies are then investigated with search engines to obtain sentences containing these store names from search snippets in order to train a store name recognition model. To extract POIs scattered across the Web, we propose a query-based crawler to find address-bearing pages that might be used to extract addresses and store names. We crawled 1.25 million distinct POI pairs scattered across the Web and implemented a POI search service via Apache Lucent’s search platform, called Solr. The experimental results demonstrate that the proposed geographical information retrieval model outperforms Wikimapia and a commercial app called ‘What’s the Number?’
KW - geographic information retrieval
KW - Location-based service
KW - POI crawling
KW - POI relation pairing
KW - store name recognition
UR - http://www.scopus.com/inward/record.url?scp=84954222299&partnerID=8YFLogxK
U2 - 10.1080/13658816.2015.1133820
DO - 10.1080/13658816.2015.1133820
M3 - 期刊論文
AN - SCOPUS:84954222299
SN - 1365-8816
VL - 30
SP - 1405
EP - 1425
JO - International Journal of Geographical Information Science
JF - International Journal of Geographical Information Science
IS - 7
ER -