Graph Mining under Linguistic Constraints for Exploring Large Texts

Solen Quiniou, Peggy Cellier, Thierry Charnois, Dominique Legallois


In this paper, we propose an approach to
explore large texts by highlighting coherent sub-parts.
The exploration method relies on a graph representation
of the text according to Hoey’s linguistic model which
allows the selection and the binding of adjacent and
non-adjacent sentences. The main contribution of our
work consists in proposing a method based on both
Hoey’s linguistic model and a special graph mining
technique, called CoHoP mining, to extract coherent
sub-parts of the graph representation of the text. We
have conducted some experiments on several English
texts showing the interest of the proposed approach.



Text coherence, graph representation, graph mining, Hoey’s linguistic model.

