Priya, R. Vishnu, Vijayakumar, V. and Yang, Longzhi (2019) Semantics Based Web Ranking Using a Robust Weight Scheme. International Journal of Web Portals, 11 (1). pp. 47-63. ISSN 1938-0194
|
Text (Full text)
Priya et al - Semantics Based Web Ranking Using a Robust Weight Scheme.pdf - Published Version Download (485kB) | Preview |
Abstract
In this paper, HTML tags and attributes are used to determine different structural position of text in a web page. Tags- attributes based models are used to assign a weight to a text that exist in different structural position of web page. Genetic algorithms (GAs), harmony search (HS), and particle swarm optimization (PSO) algorithms are used to select the informative terms using a novel tags-attributes and term frequency weighting scheme. These informative terms with heuristic weight give emphasis to important terms, qualifying how well they semantically explain a webpage and distinguish them from each other. The proposed approach is developed by customizing Terrier and tested over the Clueweb09B, WT10g, .GOV2 and uncontrolled data collections. The performance of the proposed approach is found to be encouraging against five baseline ranking models. The percentage gain of approach achieved is 75-90%, 70-83% and 43-60% in P@5, P@10 and MAP, respectively.
Item Type: | Article |
---|---|
Subjects: | G500 Information Systems |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences |
Depositing User: | Paul Burns |
Date Deposited: | 14 Jan 2019 11:36 |
Last Modified: | 01 Aug 2021 07:46 |
URI: | http://nrl.northumbria.ac.uk/id/eprint/37564 |
Downloads
Downloads per month over past year