An efficient method of extracting strings from unfixed-form data

Sangjun Jeon, Jungheum Park, Keun Gi Lee, Sangjin Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution


As all the society becomes computerized, there increases computerized data, and for digital forensic investigations, there is a great deal of unfixed-form data collected, whose exact forms are difficult to figure out, such as physical memory or page files. The most efficient method for investigating unfixed-form data is to extract strings. In case of document files, strings extracted from unfixed-form data come to include contents of the relevant documents, and in case of physical memory or page files, they can even include passwords that users have entered in addition to traces of users' using a messenger or a web page. Although extracting strings plays an important role in investigating unfixed-form data like this, the present method of extracting string includes a number of meaningless strings, while being carried out without considering the Unicode environment properly. Accordingly, this thesis intends to suggest a way of excluding meaningless strings effectively while considering the Unicode environment during the process of extracting strings.

Original languageEnglish
Title of host publicationFuture Information Technology, Application, and Service, FutureTech 2012
Number of pages10
EditionVOL. 1
Publication statusPublished - 2012
Event7th FTRA International Conference on Future Information Technology, FutureTech 2012 - Vancouver, BC, Canada
Duration: 2012 Jun 262012 Jun 28

Publication series

NameLecture Notes in Electrical Engineering
NumberVOL. 1
Volume164 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119


Other7th FTRA International Conference on Future Information Technology, FutureTech 2012
CityVancouver, BC


  • Pagefile
  • String
  • Unallocated space

ASJC Scopus subject areas

  • Industrial and Manufacturing Engineering


Dive into the research topics of 'An efficient method of extracting strings from unfixed-form data'. Together they form a unique fingerprint.

Cite this