The paper presents 32 low-cost quality factors for detecting web spam, categorized into URL, content, and link features, aimed at improving search engine accuracy. Utilizing a resilient back-propagation learning algorithm in a neural network, the authors achieved a high classification accuracy of 92% during testing with a dataset of 370 pages. Overall, the study emphasizes the economical and real-time applicability of these factors in enhancing spam detection in web pages.
Related topics: