TY - GEN
T1 - Page-based anomaly detection in large scale web clusters using adaptive MapReduce
AU - Lee, Junsup
AU - Cha, Sungdeok
PY - 2008
Y1 - 2008
N2 - While anomaly detection systems typically work on single server, most commercial web sites operate cluster environments, and user queries trigger transactions scattered through multiple servers. For this reason, anomaly detectors in a same server farm should communicate with each other to integrate their partial profile. In this paper, we describe a real-time distributed anomaly detection system that can deal with over one billion transactions per day. In our system, base on Google MapReduce algorithm, an anomaly detector in each node shares profiles of user behaviors and propagates intruder information to reduce false alarms. We evaluated our system using web log data from www.microsoft.com. The web log data, about 250GB in size, contains over one billion transactions recorded in a day.
AB - While anomaly detection systems typically work on single server, most commercial web sites operate cluster environments, and user queries trigger transactions scattered through multiple servers. For this reason, anomaly detectors in a same server farm should communicate with each other to integrate their partial profile. In this paper, we describe a real-time distributed anomaly detection system that can deal with over one billion transactions per day. In our system, base on Google MapReduce algorithm, an anomaly detector in each node shares profiles of user behaviors and propagates intruder information to reduce false alarms. We evaluated our system using web log data from www.microsoft.com. The web log data, about 250GB in size, contains over one billion transactions recorded in a day.
UR - http://www.scopus.com/inward/record.url?scp=56749160523&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56749160523&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-87403-4_28
DO - 10.1007/978-3-540-87403-4_28
M3 - Conference contribution
AN - SCOPUS:56749160523
SN - 354087402X
SN - 9783540874027
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 404
EP - 405
BT - Recent Advances in Intrusion Detection - 11th International Symposium, RAID 2008, Proceedings
T2 - Recent Advances in Intrusion Detection - 11th International Symposium, RAID 2008, Proceedings
Y2 - 15 September 2008 through 17 September 2008
ER -