This document discusses a fuzzy clustering algorithm designed to extract and discover contextual meanings in web documents by utilizing a fuzzy linguistic topological space. It identifies issues with existing clustering methods that require prior knowledge of cluster counts and proposes a system that organizes semantic complexes for better topic relevance. The proposed system consists of three modules: a word search engine, anchor disambiguation, and anchor parsing, all aimed at improving the clustering and semantic understanding of web content.
Related topics: