This document discusses document classification using graphs and Neo4j. It introduces hierarchical pattern recognition (HPR) for graph-based document classification. HPR learns deep feature representations in a hierarchy using finite state machines. The features are mapped to a vector space model for classification. The document demonstrates HPR by classifying US presidential speeches by political affiliation, achieving over 70% similarity for predicted vs actual labels. It encourages attendees to get involved in the Neo4j community.
Related topics: