Abstract
The feature of brevity in mobile phone messages makes it difficult to distinguish lexical patterns to identify spam. This paper proposes a novel approach to spam classification of extremely short messages using not only lexical features that reflect the content of a message but new stylistic features that indicate the manner in which the message is written. Experiments on two mobile phone message collections in two different languages show that the approach outperforms previous content-based approaches significantly, regardless of language.
Original language | English |
---|---|
Pages (from-to) | 364-369 |
Number of pages | 6 |
Journal | Pattern Recognition Letters |
Volume | 33 |
Issue number | 3 |
DOIs | |
Publication status | Published - 2012 Feb 1 |
Keywords
- Mobile spam classification
- Stylistic features
- Text messaging service
ASJC Scopus subject areas
- Software
- Signal Processing
- Computer Vision and Pattern Recognition
- Artificial Intelligence