UTOPIAN: User-driven topic modeling based on interactive nonnegative matrix factorization

Jaegul Choo, Changhyun Lee, Chandan K. Reddy, Haesun Park

    Research output: Contribution to journalArticlepeer-review

    243 Citations (Scopus)

    Abstract

    Topic modeling has been widely used for analyzing text document collections. Recently, there have been significant advancements in various topic modeling techniques, particularly in the form of probabilistic graphical modeling. State-of-the-art techniques such as Latent Dirichlet Allocation (LDA) have been successfully applied in visual text analytics. However, most of the widely-used methods based on probabilistic modeling have drawbacks in terms of consistency from multiple runs and empirical convergence. Furthermore, due to the complicatedness in the formulation and the algorithm, LDA cannot easily incorporate various types of user feedback. To tackle this problem, we propose a reliable and flexible visual analytics system for topic modeling called UTOPIAN (User-driven Topic modeling based on Interactive Nonnegative Matrix Factorization). Centered around its semi-supervised formulation, UTOPIAN enables users to interact with the topic modeling method and steer the result in a user-driven manner. We demonstrate the capability of UTOPIAN via several usage scenarios with real-world document corpuses such as InfoVis/VAST paper data set and product review data sets.

    Original languageEnglish
    Article number6634167
    Pages (from-to)1992-2001
    Number of pages10
    JournalIEEE Transactions on Visualization and Computer Graphics
    Volume19
    Issue number12
    DOIs
    Publication statusPublished - 2013

    Keywords

    • Latent dirichlet allocation
    • interactive clustering
    • nonnegative matrix factorization
    • text analytics
    • topic modeling
    • visual analytics

    ASJC Scopus subject areas

    • Software
    • Signal Processing
    • Computer Vision and Pattern Recognition
    • Computer Graphics and Computer-Aided Design

    Fingerprint

    Dive into the research topics of 'UTOPIAN: User-driven topic modeling based on interactive nonnegative matrix factorization'. Together they form a unique fingerprint.

    Cite this