From small-scale to large-scale text classification

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    17 Citations (Scopus)

    Abstract

    Neural network models have achieved impressive results in the field of text classification. However, existing approaches often suffer from insufficient training data in a large-scale text classification involving a large number of categories (e.g., several thousands of categories). Several neural network models have utilized multi-task learning to overcome the limited amount of training data. However, these approaches are also limited to small-scale text classification. In this paper, we propose a novel neural network-based multi-task learning framework for large-scale text classification. To this end, we first treat the different scales of text classification (i.e., large and small numbers of categories) as multiple, related tasks. Then, we train the proposed neural network, which learns small- and large-scale text classification tasks simultaneously. In particular, we further enhance this multi-task learning architecture by using a gate mechanism, which controls the flow of features between the small- and large-scale text classification tasks. Experimental results clearly show that our proposed model improves the performance of the large-scale text classification task with the help of the small-scale text classification task. The proposed scheme exhibits significant improvements of as much as 14% and 5% in terms of micro-averaging and macro-averaging F1-score, respectively, over state-of-the-art techniques.

    Original languageEnglish
    Title of host publicationThe Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019
    PublisherAssociation for Computing Machinery, Inc
    Pages853-862
    Number of pages10
    ISBN (Electronic)9781450366748
    DOIs
    Publication statusPublished - 2019 May 13
    Event2019 World Wide Web Conference, WWW 2019 - San Francisco, United States
    Duration: 2019 May 132019 May 17

    Publication series

    NameThe Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019

    Conference

    Conference2019 World Wide Web Conference, WWW 2019
    Country/TerritoryUnited States
    CitySan Francisco
    Period19/5/1319/5/17

    Bibliographical note

    Publisher Copyright:
    © 2019 IW3C2 (International World Wide Web Conference Committee), published under Creative Commons CC-BY 4.0 License.

    Keywords

    • Deep Neural Networks
    • Large-scale Text Classification
    • Multi-task Learning

    ASJC Scopus subject areas

    • Computer Networks and Communications
    • Software

    Fingerprint

    Dive into the research topics of 'From small-scale to large-scale text classification'. Together they form a unique fingerprint.

    Cite this