Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Web Content Extraction Based on Maximum Continuous Sum of Text Density

Abstract: 

Generally different websites have different web page structures, which would heavily affect the extraction quality when the web content is automatically collected. The maximum continuous sum of text density (MCSTD) method can extract web content from different web pages efficiently and effectively.

up
0 users have voted:

Paper Details

Authors:
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu
Submitted On:
21 November 2016 - 9:34pm
Short Link:
Type:
Poster
Event:
Presenter's Name:
Kai Sun
Paper Code:
IALP105
Document Year:
2016
Cite

Document Files

Kai Sun – IALP 2016.pptx

(0)

Subscribe

[1] Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu, "Web Content Extraction Based on Maximum Continuous Sum of Text Density", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1288. Accessed: Aug. 19, 2017.
@article{1288-16,
url = {http://sigport.org/1288},
author = {Kai Sun; Miao Li; Jinhua Du; Lei Chen; Zhengxin Yang; Yi Gao; Sha Fu },
publisher = {IEEE SigPort},
title = {Web Content Extraction Based on Maximum Continuous Sum of Text Density},
year = {2016} }
TY - EJOUR
T1 - Web Content Extraction Based on Maximum Continuous Sum of Text Density
AU - Kai Sun; Miao Li; Jinhua Du; Lei Chen; Zhengxin Yang; Yi Gao; Sha Fu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1288
ER -
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. (2016). Web Content Extraction Based on Maximum Continuous Sum of Text Density. IEEE SigPort. http://sigport.org/1288
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu, 2016. Web Content Extraction Based on Maximum Continuous Sum of Text Density. Available at: http://sigport.org/1288.
Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. (2016). "Web Content Extraction Based on Maximum Continuous Sum of Text Density." Web.
1. Kai Sun, Miao Li, Jinhua Du, Lei Chen, Zhengxin Yang, Yi Gao, Sha Fu. Web Content Extraction Based on Maximum Continuous Sum of Text Density [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1288