Sorry, you need to enable JavaScript to visit this website.

Knowledge and Data Engineering

Improving the Effectiveness of POI Search by Associated Information Summarization


The demand for map services has risen significantly in recent years due to the popularity of mobile devices and wireless networks. Since there are always emerging point-of-interest (POI) in the real world, mining POIs shared by users from the Web has been a challenging problem to enrich existing POI database. However, crawling address-bearing pages and extracting POI relations are only the fundamentals for constructing POI database, the description of POIs, i.e. the services and products that POIs provide are especially essential for POI search.

Paper Details

Authors:
Hsiu-Min Chuang, Chia-Hui Chang, Chung-Ting Cheng
Submitted On:
22 November 2016 - 9:56am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

IALP_Text Mining Session

(98 downloads)

Keywords

Additional Categories

Subscribe

[1] Hsiu-Min Chuang, Chia-Hui Chang, Chung-Ting Cheng, "Improving the Effectiveness of POI Search by Associated Information Summarization", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1294. Accessed: Jun. 24, 2017.
@article{1294-16,
url = {http://sigport.org/1294},
author = {Hsiu-Min Chuang; Chia-Hui Chang; Chung-Ting Cheng },
publisher = {IEEE SigPort},
title = {Improving the Effectiveness of POI Search by Associated Information Summarization},
year = {2016} }
TY - EJOUR
T1 - Improving the Effectiveness of POI Search by Associated Information Summarization
AU - Hsiu-Min Chuang; Chia-Hui Chang; Chung-Ting Cheng
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1294
ER -
Hsiu-Min Chuang, Chia-Hui Chang, Chung-Ting Cheng. (2016). Improving the Effectiveness of POI Search by Associated Information Summarization. IEEE SigPort. http://sigport.org/1294
Hsiu-Min Chuang, Chia-Hui Chang, Chung-Ting Cheng, 2016. Improving the Effectiveness of POI Search by Associated Information Summarization. Available at: http://sigport.org/1294.
Hsiu-Min Chuang, Chia-Hui Chang, Chung-Ting Cheng. (2016). "Improving the Effectiveness of POI Search by Associated Information Summarization." Web.
1. Hsiu-Min Chuang, Chia-Hui Chang, Chung-Ting Cheng. Improving the Effectiveness of POI Search by Associated Information Summarization [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1294

Web Content Extraction Based on Maximum Continuous Sum of Text Density


Generally different websites have different web page structures, which would heavily affect the extraction quality when the web content is automatically collected. The maximum continuous sum of text density (MCSTD) method can extract web content from different web pages efficiently and effectively.

Paper Details

Authors:
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu
Submitted On:
21 November 2016 - 9:34pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Kai Sun – IALP 2016.pptx

(0)

Keywords

Subscribe

[1] Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu, "Web Content Extraction Based on Maximum Continuous Sum of Text Density", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1288. Accessed: Jun. 24, 2017.
@article{1288-16,
url = {http://sigport.org/1288},
author = {Kai Sun; Miao Li; Jinhua Du; Lei Chen; Zhengxin Yang; Yi Gao; Sha Fu },
publisher = {IEEE SigPort},
title = {Web Content Extraction Based on Maximum Continuous Sum of Text Density},
year = {2016} }
TY - EJOUR
T1 - Web Content Extraction Based on Maximum Continuous Sum of Text Density
AU - Kai Sun; Miao Li; Jinhua Du; Lei Chen; Zhengxin Yang; Yi Gao; Sha Fu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1288
ER -
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. (2016). Web Content Extraction Based on Maximum Continuous Sum of Text Density. IEEE SigPort. http://sigport.org/1288
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu, 2016. Web Content Extraction Based on Maximum Continuous Sum of Text Density. Available at: http://sigport.org/1288.
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. (2016). "Web Content Extraction Based on Maximum Continuous Sum of Text Density." Web.
1. Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. Web Content Extraction Based on Maximum Continuous Sum of Text Density [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1288

Web Content Extraction Based on Maximum Continuous Sum of Text Density


Generally different websites have different web page structures, which would heavily affect the extraction quality when the web content is automatically collected. The maximum continuous sum of text density (MCSTD) method can extract web content from different web pages efficiently and effectively.

Paper Details

Authors:
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu
Submitted On:
21 November 2016 - 9:34pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Kai Sun – IALP 2016.pptx

(0)

Keywords

Subscribe

[1] Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu, "Web Content Extraction Based on Maximum Continuous Sum of Text Density", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1287. Accessed: Jun. 24, 2017.
@article{1287-16,
url = {http://sigport.org/1287},
author = {Kai Sun; Miao Li; Jinhua Du; Lei Chen; Zhengxin Yang; Yi Gao; Sha Fu },
publisher = {IEEE SigPort},
title = {Web Content Extraction Based on Maximum Continuous Sum of Text Density},
year = {2016} }
TY - EJOUR
T1 - Web Content Extraction Based on Maximum Continuous Sum of Text Density
AU - Kai Sun; Miao Li; Jinhua Du; Lei Chen; Zhengxin Yang; Yi Gao; Sha Fu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1287
ER -
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. (2016). Web Content Extraction Based on Maximum Continuous Sum of Text Density. IEEE SigPort. http://sigport.org/1287
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu, 2016. Web Content Extraction Based on Maximum Continuous Sum of Text Density. Available at: http://sigport.org/1287.
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. (2016). "Web Content Extraction Based on Maximum Continuous Sum of Text Density." Web.
1. Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. Web Content Extraction Based on Maximum Continuous Sum of Text Density [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1287

Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks

Paper Details

Authors:
Steven Du , Xi Zhang
Submitted On:
21 November 2016 - 6:56pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

IALP2016-114-PDF

(72 downloads)

Keywords

Additional Categories

Subscribe

[1] Steven Du , Xi Zhang, "Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1286. Accessed: Jun. 24, 2017.
@article{1286-16,
url = {http://sigport.org/1286},
author = {Steven Du ; Xi Zhang },
publisher = {IEEE SigPort},
title = {Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks},
year = {2016} }
TY - EJOUR
T1 - Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks
AU - Steven Du ; Xi Zhang
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1286
ER -
Steven Du , Xi Zhang. (2016). Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks. IEEE SigPort. http://sigport.org/1286
Steven Du , Xi Zhang, 2016. Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks. Available at: http://sigport.org/1286.
Steven Du , Xi Zhang. (2016). "Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks." Web.
1. Steven Du , Xi Zhang. Aicyber’s System for IALP 2016 Shared Task:Character-enhanced Word Vectors and Boosted Neural Networks [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1286

Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity


In this paper, we study language models based on recurrent neural networks on three databases in two languages. We implement basic recurrent neural networks (RNN) and refined RNNs with long short-term memory (LSTM) cells. We use the corpora of Penn Tree Bank (PTB) and AMI in English, and the Academia Sinica Balanced Corpus (ASBC) in Chinese. On ASBC, we investigate wordbased and character-based language models. For characterbased language models, we look into the cases where the inter-word space is treated or not treated as a token.

40_RNN.pdf

PDF icon 40_RNN (70 downloads)

Paper Details

Authors:
Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen
Submitted On:
21 November 2016 - 10:21am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

40_RNN

(70 downloads)

Keywords

Additional Categories

Subscribe

[1] Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen, "Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1285. Accessed: Jun. 24, 2017.
@article{1285-16,
url = {http://sigport.org/1285},
author = {Tzu-Hsuan Yang; Tzu-Hsuan Tseng; Chia-Ping Chen },
publisher = {IEEE SigPort},
title = {Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity},
year = {2016} }
TY - EJOUR
T1 - Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity
AU - Tzu-Hsuan Yang; Tzu-Hsuan Tseng; Chia-Ping Chen
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1285
ER -
Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen. (2016). Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity. IEEE SigPort. http://sigport.org/1285
Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen, 2016. Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity. Available at: http://sigport.org/1285.
Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen. (2016). "Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity." Web.
1. Tzu-Hsuan Yang, Tzu-Hsuan Tseng, Chia-Ping Chen. Recurrent Neural Network-based Language Models with Variation in Net Topology, Language, and Granularity [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1285

Verifying the Long-range Dependency of RNN Language Models


It has been argued that recurrent neural network language models are better in capturing long-range dependency than n-gram language models. In this paper, we attempt to verify this claim by investigating the prediction accuracy and the perplexity of these language models as a function of word position, i.e., the position of a word in a sentence. It is expected that as word position increases, the advantage of using recurrent neural network language models over n-gram language models will become more and more evident.

Paper Details

Authors:
Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen
Submitted On:
21 November 2016 - 10:24am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

41_ngram_rnn

(61 downloads)

Keywords

Additional Categories

Subscribe

[1] Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen, "Verifying the Long-range Dependency of RNN Language Models", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1284. Accessed: Jun. 24, 2017.
@article{1284-16,
url = {http://sigport.org/1284},
author = {Tzu-Hsuan Tseng; Tzu-Hsuan Yang; Chia-Ping Chen },
publisher = {IEEE SigPort},
title = {Verifying the Long-range Dependency of RNN Language Models},
year = {2016} }
TY - EJOUR
T1 - Verifying the Long-range Dependency of RNN Language Models
AU - Tzu-Hsuan Tseng; Tzu-Hsuan Yang; Chia-Ping Chen
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1284
ER -
Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen. (2016). Verifying the Long-range Dependency of RNN Language Models. IEEE SigPort. http://sigport.org/1284
Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen, 2016. Verifying the Long-range Dependency of RNN Language Models. Available at: http://sigport.org/1284.
Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen. (2016). "Verifying the Long-range Dependency of RNN Language Models." Web.
1. Tzu-Hsuan Tseng, Tzu-Hsuan Yang, Chia-Ping Chen. Verifying the Long-range Dependency of RNN Language Models [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1284

History Question Classification and Representation for Chinese Gaokao


In this paper, we propose a question representation based on entity labeling and question classification for a automatic question answering system of Chinese Gaokao history question. A CRF model is used for the entity labeling and SVM/CNN/LSTM models are tested for question classification. Our experiments show that CRF model provides a high performance when used to label informative entities out while neural networks has a promising performance for the question classification task.

80.pdf

PDF icon 80.pdf (81 downloads)

Paper Details

Authors:
Ke Yu, Qiuzhi Liu, Yuqing Zheng, Tiejun Zhao, Dequan Zheng
Submitted On:
21 November 2016 - 9:27pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

80.pdf

(81 downloads)

Keywords

Subscribe

[1] Ke Yu, Qiuzhi Liu, Yuqing Zheng, Tiejun Zhao, Dequan Zheng, "History Question Classification and Representation for Chinese Gaokao", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1283. Accessed: Jun. 24, 2017.
@article{1283-16,
url = {http://sigport.org/1283},
author = {Ke Yu; Qiuzhi Liu; Yuqing Zheng; Tiejun Zhao; Dequan Zheng },
publisher = {IEEE SigPort},
title = {History Question Classification and Representation for Chinese Gaokao},
year = {2016} }
TY - EJOUR
T1 - History Question Classification and Representation for Chinese Gaokao
AU - Ke Yu; Qiuzhi Liu; Yuqing Zheng; Tiejun Zhao; Dequan Zheng
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1283
ER -
Ke Yu, Qiuzhi Liu, Yuqing Zheng, Tiejun Zhao, Dequan Zheng. (2016). History Question Classification and Representation for Chinese Gaokao. IEEE SigPort. http://sigport.org/1283
Ke Yu, Qiuzhi Liu, Yuqing Zheng, Tiejun Zhao, Dequan Zheng, 2016. History Question Classification and Representation for Chinese Gaokao. Available at: http://sigport.org/1283.
Ke Yu, Qiuzhi Liu, Yuqing Zheng, Tiejun Zhao, Dequan Zheng. (2016). "History Question Classification and Representation for Chinese Gaokao." Web.
1. Ke Yu, Qiuzhi Liu, Yuqing Zheng, Tiejun Zhao, Dequan Zheng. History Question Classification and Representation for Chinese Gaokao [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1283

The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation


This paper presents initial English-Tigrinya statistical machine translation (SMT) research. Tigrinya is a highly inflected Semitic language spoken in Eritrea and Ethiopia. Translation involving morphologically complex languages is challenged by factors including data sparseness and source-target word alignment. We try to address these problems through morphological segmentation of Tigrinya words. After segmentation the difference in token count dropped significantly from 37.7% to 0.1%. The out-of-vocabulary rate was reduced by 46%.

IALP-tig.pdf

PDF icon IALP-tig.pdf (68 downloads)

Paper Details

Authors:
Yemane Tedla and Kazuhide Yamamoto
Submitted On:
21 November 2016 - 8:31am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

IALP-tig.pdf

(68 downloads)

Keywords

Additional Categories

Subscribe

[1] Yemane Tedla and Kazuhide Yamamoto, "The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1281. Accessed: Jun. 24, 2017.
@article{1281-16,
url = {http://sigport.org/1281},
author = {Yemane Tedla and Kazuhide Yamamoto },
publisher = {IEEE SigPort},
title = {The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation},
year = {2016} }
TY - EJOUR
T1 - The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation
AU - Yemane Tedla and Kazuhide Yamamoto
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1281
ER -
Yemane Tedla and Kazuhide Yamamoto. (2016). The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation. IEEE SigPort. http://sigport.org/1281
Yemane Tedla and Kazuhide Yamamoto, 2016. The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation. Available at: http://sigport.org/1281.
Yemane Tedla and Kazuhide Yamamoto. (2016). "The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation." Web.
1. Yemane Tedla and Kazuhide Yamamoto. The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1281

Word Sense Implantation as Orthographical Conversion


We present a word sense disambiguation (WSD) tool of Japanese Hiragana words. Unlike other WSD tasks which output something like “sense #3” as result, our WSD task rewrites the target word into a Kanji word, which is a different orthography. This means that the task is also a kind of orthographical normalization as well as WSD. In this paper we present the task, our method, and the performance.

IALP-wsd.pdf

PDF icon IALP-wsd.pdf (73 downloads)

Paper Details

Authors:
Kazuhide Yamamoto and Yuki Mikami
Submitted On:
21 November 2016 - 8:27am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

IALP-wsd.pdf

(73 downloads)

Keywords

Additional Categories

Subscribe

[1] Kazuhide Yamamoto and Yuki Mikami, "Word Sense Implantation as Orthographical Conversion", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1280. Accessed: Jun. 24, 2017.
@article{1280-16,
url = {http://sigport.org/1280},
author = {Kazuhide Yamamoto and Yuki Mikami },
publisher = {IEEE SigPort},
title = {Word Sense Implantation as Orthographical Conversion},
year = {2016} }
TY - EJOUR
T1 - Word Sense Implantation as Orthographical Conversion
AU - Kazuhide Yamamoto and Yuki Mikami
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1280
ER -
Kazuhide Yamamoto and Yuki Mikami. (2016). Word Sense Implantation as Orthographical Conversion. IEEE SigPort. http://sigport.org/1280
Kazuhide Yamamoto and Yuki Mikami, 2016. Word Sense Implantation as Orthographical Conversion. Available at: http://sigport.org/1280.
Kazuhide Yamamoto and Yuki Mikami. (2016). "Word Sense Implantation as Orthographical Conversion." Web.
1. Kazuhide Yamamoto and Yuki Mikami. Word Sense Implantation as Orthographical Conversion [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1280

Detecting Representative Web Articles Using Heterogeneous Graphs

Paper Details

Authors:
Submitted On:
21 November 2016 - 3:13am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

slide.11.21_upload.pptx

(62 downloads)

Keywords

Additional Categories

Subscribe

[1] , "Detecting Representative Web Articles Using Heterogeneous Graphs", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1279. Accessed: Jun. 24, 2017.
@article{1279-16,
url = {http://sigport.org/1279},
author = { },
publisher = {IEEE SigPort},
title = {Detecting Representative Web Articles Using Heterogeneous Graphs},
year = {2016} }
TY - EJOUR
T1 - Detecting Representative Web Articles Using Heterogeneous Graphs
AU -
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1279
ER -
. (2016). Detecting Representative Web Articles Using Heterogeneous Graphs. IEEE SigPort. http://sigport.org/1279
, 2016. Detecting Representative Web Articles Using Heterogeneous Graphs. Available at: http://sigport.org/1279.
. (2016). "Detecting Representative Web Articles Using Heterogeneous Graphs." Web.
1. . Detecting Representative Web Articles Using Heterogeneous Graphs [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1279

Pages