Results :: Search Electronic Theses and Dissertations (ETDs)
Your search for ETD Subject "Algorithms--Computer programs" resulted in 1 match(es).
Back to search page
- Apriori approach to graph-based clustering of text documents
- Author: Hossain, Mahmud Shahriar
- Date: 2008-05-15
- Program: Computer Science
- Abstract: This thesis report introduces a new technique of document clustering based on frequent senses. The developed system, named GDClust (Graph-Based Document Clustering) [1], works with frequent senses rather than dealing with frequent keywords used in traditional text mining techniques. GDClust presents text documents as hierarchical document-graphs and uses an Apriori paradigm to find the frequent subgraphs, which reflect frequent senses. Discovered frequent subgraphs are then utilized to generate accurate sense-based document clusters. We propose a novel multilevel Gaussian minimum support strat...
- Download File | View full details
Back to search page
print-friendly page | mobile-friendly page