Computer Science Department
School of Computer Science, Carnegie Mellon University
MindReader: Querying Databases through Multiple Examples
Yoshiharu Ishikawa*, Ravishankar Subramanya**, Christos Faloutos
Our goal is to provide a user-friendly, but theoretically solid method, to handle such queries. We allow the user to give several examples, and optionally, their "goodness" scores, and we propose a novel method to "guess" which attributes are important, which correlations are important, and with what weight.
Our contributions are twofold: (a) we formalize the problem as a minimization problem and show how to solve for the optimal solution, completely avoiding the ad-hoc heuristics of the past. (b) Moreover, we are the first that can handle "diagonal" queries (like the "overweight" query above). Experiments on synthetic and real datasets show that our method estimtes quickly and accurately the "hidden" distance function in the user's mind.
Keywords: Databases, information retrieval, access methods, multimedia