Utilizing probase in open directory project-based text classification

So Young Jun, Dinara Aliyeva, Ji Min Lee, Sang-Geun Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)


Open Directory Project (ODP) has been successfully utilized in text classification due to its representation ability of various categories. However, ODP includes a limited number of entities, which play an important role in classification tasks. In this paper, we enrich the semantics of ODP categories with Probase entities. To effectively incorporate Probase entities in ODP categories, we first represent each ODP category and Probase entity in terms of concepts. Next, we measure the semantic relevance between an ODP category and a Probase entity based on the concept vector. Finally, we use Probase entity to enrich the semantics of the ODP categories. Our experimental results show that the proposed methodology exhibits a significant improvement over state-of-the-art techniques in the ODP-based text classification.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Fuzzy Systems, FUZZ 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509060207
Publication statusPublished - 2018 Oct 12
Event2018 IEEE International Conference on Fuzzy Systems, FUZZ 2018 - Rio de Janeiro, Brazil
Duration: 2018 Jul 82018 Jul 13

Publication series

NameIEEE International Conference on Fuzzy Systems
ISSN (Print)1098-7584


Other2018 IEEE International Conference on Fuzzy Systems, FUZZ 2018
CityRio de Janeiro

Bibliographical note

Funding Information:
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (number 2015R1A2A1A10052665).

Publisher Copyright:
© 2018 IEEE.

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Artificial Intelligence
  • Applied Mathematics


Dive into the research topics of 'Utilizing probase in open directory project-based text classification'. Together they form a unique fingerprint.

Cite this