TY - GEN
T1 - BOSS
T2 - ACM 5th International Workshop on Data and Text Mining in Biomedical Informatics, DTMBIO'11, in Conjunction with the 20th ACM International Conference on Information and Knowledge Management, CIKM'11
AU - Choi, Jaehoon
AU - Kim, Donghyeon
AU - Kim, Seongsoon
AU - Lee, Sunwon
AU - Lee, Kyubum
AU - Kang, Jaewoo
PY - 2011
Y1 - 2011
N2 - It has never been easy to search relevant information from ever increasing corpus of academic literatures. Large volume of research exists concerning this problem. The previous solutions are put on either ends of spectrum: general-purpose search and domain-specific "deep" search systems. The general-purpose search systems such as PubMed offer flexible query interface, but churn out a list of matching documents that users have to digest in order to find the answers to their queries. On the other hand, the "deep" search systems such as PPI Finder and iHOP return the precompiled results in a structured way. Their results, however, are often found only within some predefined contexts. In order to address this problem, we introduce a new search engine, BOSS, for search on biomedical objects. Unlike the conventional search systems, BOSS indexes segments, rather than documents. A segment refers to a minimal semantic unit such as phrase, clause or sentence that is semantically coherent in the given context (e.g., biomedical objects or their relations). For a user query, BOSS finds all matching segments, identifies the objects appearing in the segments, and aggregates the segments for each object. Finally, it turns up for the user the ranked list of the objects along with their matching segments. BOSS fills the gap between either ends of the spectrum by allowing users to pose context-free queries and by returning a structured set of results. Furthermore, BOSS exhibits the characteristic of good scalability, just as with conventional document search engines, because as it is designed to use a standard document-indexing model with minimal modifications. Considering the features, BOSS is believed to notch up the technological level of traditional solutions for search on biomedical information. BOSS is accessible at http://boss.korea.ac.kr.
AB - It has never been easy to search relevant information from ever increasing corpus of academic literatures. Large volume of research exists concerning this problem. The previous solutions are put on either ends of spectrum: general-purpose search and domain-specific "deep" search systems. The general-purpose search systems such as PubMed offer flexible query interface, but churn out a list of matching documents that users have to digest in order to find the answers to their queries. On the other hand, the "deep" search systems such as PPI Finder and iHOP return the precompiled results in a structured way. Their results, however, are often found only within some predefined contexts. In order to address this problem, we introduce a new search engine, BOSS, for search on biomedical objects. Unlike the conventional search systems, BOSS indexes segments, rather than documents. A segment refers to a minimal semantic unit such as phrase, clause or sentence that is semantically coherent in the given context (e.g., biomedical objects or their relations). For a user query, BOSS finds all matching segments, identifies the objects appearing in the segments, and aggregates the segments for each object. Finally, it turns up for the user the ranked list of the objects along with their matching segments. BOSS fills the gap between either ends of the spectrum by allowing users to pose context-free queries and by returning a structured set of results. Furthermore, BOSS exhibits the characteristic of good scalability, just as with conventional document search engines, because as it is designed to use a standard document-indexing model with minimal modifications. Considering the features, BOSS is believed to notch up the technological level of traditional solutions for search on biomedical information. BOSS is accessible at http://boss.korea.ac.kr.
KW - biomedical object search engine
KW - biomedical text mining
KW - information retrieval
UR - http://www.scopus.com/inward/record.url?scp=83255166529&partnerID=8YFLogxK
U2 - 10.1145/2064696.2064702
DO - 10.1145/2064696.2064702
M3 - Conference contribution
AN - SCOPUS:83255166529
SN - 9781450309608
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 19
EP - 26
BT - CIKM 2011 Glasgow
Y2 - 24 October 2011 through 24 October 2011
ER -