An efficient string searching algorithm based on occurrence frequency and pattern of vowels and consonants in a pattern

Kwang Sik Chung, Soo Young Kim, Heon Chang Yu

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    Information and communication technologies enable people to access to various documentations and information. Huge documents and information in the Internet or storage disks have made search time more important. Especially as the volume size and the number of documents on the Internet increase, string search times and costs increase have become big burden to search service. But, most of string searching algorithms have not consider lexical structures nor vowels’ occurrence frequency. Formal documents (articles, news, novels, etc.) have important characteristic that is ‘well-formed written’ English. And words of formal documents have ‘limit number of words and alphabets’ that are listed in a dictionary. The ‘limit number of words and alphabets’ has predictable occurrence probability in real world’s documentations. We try to use the alphabet occurrence probability as first search condition. We analyze all the words in the dictionaries (dictionary of free dictionary project, scrabblehelper–Revision 20, Winedit dictionary) and calculate each alphabet occurrence probability of repeated vowels, repeated consonants, not-repeated vowels and not-repeated consonants. In this paper, we define and propose the search rules and string searching algorithm, based on occurrence frequency and patterns of vowels and consonants. We use only the occurrence patterns and repeated positions of vowel and consonant in a text. Therefore, in the real world, proposed string searching algorithm (OFRP algorithm) is based on occurrence frequency and repetition pattern of vowels and consonants and is usefully and effectively applied to string search service and web search engine.

    Original languageEnglish
    Title of host publicationIntelligence Science and Big Data Engineering
    Subtitle of host publicationBig Data and Machine Learning Techniques - 5th International Conference, IScIDE 2015, Revised Selected Papers
    EditorsZhi-Hua Zhou, Baochuan Fu, Fuyuan Hu, Zhancheng Zhang, Zhi-Yong Liu, Yanning Zhang, Xiaofei He, Xinbo Gao
    PublisherSpringer Verlag
    Pages277-286
    Number of pages10
    ISBN (Print)9783319238616
    DOIs
    Publication statusPublished - 2015
    Event5th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2015 - Suzhou, China
    Duration: 2015 Jun 142015 Jun 16

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume9243
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Other

    Other5th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2015
    Country/TerritoryChina
    CitySuzhou
    Period15/6/1415/6/16

    Bibliographical note

    Funding Information:
    This work was supported by 2014 Korea National Open University Research Fund.

    Publisher Copyright:
    © Springer International Publishing Switzerland 2015.

    Keywords

    • Occurrence frequency of consonants
    • Occurrence frequency of vowels
    • Repetition pattern of consonants
    • Repetition pattern of vowels
    • String search
    • Vowel and consonant-based string search

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'An efficient string searching algorithm based on occurrence frequency and pattern of vowels and consonants in a pattern'. Together they form a unique fingerprint.

    Cite this