Association analysis in item response datasets

Eun Young Kwak, Hyeoncheol Kim

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Association rule mining is a data mining technique used to find frequent patterns in a huge dataset. In this paper, we address the issues of its application to item response datasets, which is generally high multidimensional. The primary disadvantage about mining association rules in a high multidimensional dataset is the huge number of patterns that are discovered, most of which are trivial or uninteresting. In this paper, we introduce a new measure called suprisal that estimates the informativeness of transactional instances and attributes. Our approach to the item association analysis includes elimination of noisy and uninformative data using the surprisal first, and then generation of association rules of good quality. Experimental results on real datasets of national-level tests for Korean high school student show that the surprisal-based pruning improves quality of association rules in item response datasets significantly.

    Original languageEnglish
    Pages (from-to)913-920
    Number of pages8
    JournalWSEAS Transactions on Computers
    Volume6
    Issue number6
    Publication statusPublished - 2007 Jun

    Keywords

    • Association rule
    • Data mining
    • Interestingness measure
    • Item response analysis

    ASJC Scopus subject areas

    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Association analysis in item response datasets'. Together they form a unique fingerprint.

    Cite this