Low discrepancy sets yield approximate min-wise independent permutation families

Citation
M. Saks et al., Low discrepancy sets yield approximate min-wise independent permutation families, INF PROCESS, 73(1-2), 2000, pp. 29-32
Citations number
15
Categorie Soggetti
Information Tecnology & Communication Systems
Journal title
INFORMATION PROCESSING LETTERS
ISSN journal
00200190 → ACNP
Volume
73
Issue
1-2
Year of publication
2000
Pages
29 - 32
Database
ISI
SICI code
0020-0190(20000131)73:1-2<29:LDSYAM>2.0.ZU;2-J
Abstract
Motivated by a problem of filtering near-duplicate Web documents, Broder, C harikar, Frieze and Mitzenmacher defined the following notion of epsilon-ap proximate min-wise independent permutation families. A multiset F of permut ations of {0, 1,..., n - 1} is such a family if for all K subset of or equa l to {0, 1,..., n - 1} and any x is an element of K, a permutation n chosen uniformly at random from F satisfies [GRAPHICS] We show connections of such families with low discrepancy sets for geometri c rectangles, and give explicit constructions of such families F of size n( O(root log n)) for epsilon = 1/n(Theta(1)), improving upon the previously b est-known bound of Indyk. We also present polynomial-size constructions whe n the min-wise condition is required only for \K\ less than or equal to 2(O (log2/3n)), with epsilon greater than or equal to 2(-O(log2/3 n)). (C) 2000 Published by Elsevier Science B.V. All rights reserved.