Semantics Based Web Ranking Using a Robust Weight Scheme

Priya, R. Vishnu, Vijayakumar, V. and Yang, Longzhi (2019) Semantics Based Web Ranking Using a Robust Weight Scheme. International Journal of Web Portals, 11 (1). pp. 47-63. ISSN 1938-0194

[img]
Preview
Text (Full text)
Priya et al - Semantics Based Web Ranking Using a Robust Weight Scheme.pdf - Published Version

Download (485kB) | Preview
Official URL: http://dx.doi.org/10.4018/IJWP.2019010104

Abstract

In this paper, HTML tags and attributes are used to determine different structural position of text in a web page. Tags- attributes based models are used to assign a weight to a text that exist in different structural position of web page. Genetic algorithms (GAs), harmony search (HS), and particle swarm optimization (PSO) algorithms are used to select the informative terms using a novel tags-attributes and term frequency weighting scheme. These informative terms with heuristic weight give emphasis to important terms, qualifying how well they semantically explain a webpage and distinguish them from each other. The proposed approach is developed by customizing Terrier and tested over the Clueweb09B, WT10g, .GOV2 and uncontrolled data collections. The performance of the proposed approach is found to be encouraging against five baseline ranking models. The percentage gain of approach achieved is 75-90%, 70-83% and 43-60% in P@5, P@10 and MAP, respectively.

Item Type: Article
Subjects: G500 Information Systems
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Paul Burns
Date Deposited: 14 Jan 2019 11:36
Last Modified: 15 Jan 2019 09:45
URI: http://nrl.northumbria.ac.uk/id/eprint/37564

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics


Policies: NRL Policies | NRL University Deposit Policy | NRL Deposit Licence