A SUBSTRING SEARCH ALGORITHM IN EXTENDIBLE HASHING

Citation
H. Mochizuki et al., A SUBSTRING SEARCH ALGORITHM IN EXTENDIBLE HASHING, Information sciences, 108(1-4), 1998, pp. 13-30
Citations number
12
Categorie Soggetti
Computer Science Information Systems","Computer Science Information Systems
Journal title
ISSN journal
00200255
Volume
108
Issue
1-4
Year of publication
1998
Pages
13 - 30
Database
ISI
SICI code
0020-0255(1998)108:1-4<13:ASSAIE>2.0.ZU;2-D
Abstract
An extendible hashing scheme resolves bucket overflows by reorganizing the hash function and file structure locally, so it is very suitable for fast key retrievals of dynamic key sets. However, it cannot search keys that contain a given string as substrings efficiently. In this p aper, in order to design this substring search in extendible hashing, sig nature vectors are introduced as hash values, and a trie structure as an extendible hash table, where each vector is composed by a bit s tream. Pseudo signature vectors are defined to identify the buckets, a nd a constrained depth-fir st search is presented to traverse the arcs of the trie structure. To construct a compact trie despite an increas e in the number of keys, uniform signature treetops are introduced, an d the method for an incremental expansion of the hash table is propose d. This approach can restrict the size of the bit stream for each key, making constrained depth-first search efficient. From simulation resu lts, by applying the presented schemes to Japanese and English key set s, it was shown that the number of accessed buckets decreased from 40% to 10% in comparison with traditional extendible hashing for which on ly descriptors were used. In addition, the search time cost of the pre sented approach is 2-10 times faster. (C) 1998 Elsevier Science Inc. A ll rights reserved.