Discovering high-quality threaded discussions in online forums

Jung Tae Lee, Min Chul Yang, Hae Chang Rim

    Research output: Contribution to journalArticlepeer-review

    18 Citations (Scopus)

    Abstract

    Archives of threaded discussions generated by users in online forums and discussion boards contain valuable knowledge on various topics. However, not all threads are useful because of deliberate abuses, such as trolling and flaming, that are commonly observed in online conversations. The existence of various users with different levels of expertise also makes it difficult to assume that every discussion thread stored online contains high-quality contents. Although finding high-quality threads automatically can help both users and search engines sift through a huge amount of thread archives and make use of these potentially useful resources effectively, no previous work to our knowledge has performed a study on such task. In this paper, we propose an automatic method for distinguishing high-quality threads from low-quality ones in online discussion sites. We first suggest four different artificial measures for inducing overall quality of a thread based on ratings of its posts. We then propose two tasks involving prediction of thread quality without using post rating information. We adopt a popular machine learning framework to solve the two prediction tasks. Experimental results on a real world forum archive demonstrate that our method can significantly improve the prediction performance across all four measures of thread quality on both tasks. We also compare how different types of features derived from various aspects of threads contribute to the overall performance and investigate key features that play a crucial role in discovering high-quality threads in online discussion sites.

    Original languageEnglish
    Pages (from-to)519-531
    Number of pages13
    JournalJournal of Computer Science and Technology
    Volume29
    Issue number3
    DOIs
    Publication statusPublished - 2014 May

    Bibliographical note

    Funding Information:
    Regular Paper This research was partially supported by the Ministry of Knowledge Economy (MKE), Korea, and Microsoft Research through the IT/SW Creative Research Program supervised by the National IT Industry Promotion Agency (NIPA) of Korea under Grant No. NIPA-2012-H0503-12-1012, and the Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning of Korea under Grant No. NRF-2012M3C4A7033344. Part of this work was done while the first author was an intern at Microsoft Research Asia, Beijing. ∗Corresponding Author ©2014 Springer Science + Business Media, LLC & Science Press, China

    Keywords

    • discussion board
    • online forum
    • thread quality

    ASJC Scopus subject areas

    • Software
    • Theoretical Computer Science
    • Hardware and Architecture
    • Computer Science Applications
    • Computational Theory and Mathematics

    Fingerprint

    Dive into the research topics of 'Discovering high-quality threaded discussions in online forums'. Together they form a unique fingerprint.

    Cite this