Abstract
Contextual advertising is an important revenue source for major service providers on the Web. Ads classification is one of main tasks in contextual advertising, and it is used to retrieve semantically relevant ads with respect to the content of web pages. However, it is difficult for traditional text classification methods to achieve satisfactory performance in ads classification due to scarce term features in ads. In this paper, we propose a novel ads classification method that handles the lack of term features for classifying ads with short text. The proposed method utilizes a vocabulary expansion technique using semantic associations among terms learned from large-scale search query logs. The evaluation results show that our methodology achieves 4.0% ~ 9.7% improvements in terms of the hierarchical f-measure over the baseline classifiers without vocabulary expansion.
Original language | English |
---|---|
Pages (from-to) | 1373-1387 |
Number of pages | 15 |
Journal | KSII Transactions on Internet and Information Systems |
Volume | 6 |
Issue number | 5 |
DOIs | |
Publication status | Published - 2012 May 25 |
Keywords
- Advertisement classification
- Centroid classifier
- Query log
- Semantic association
- Vocabulary expansion
ASJC Scopus subject areas
- Information Systems
- Computer Networks and Communications