Computer Science Department
School of Computer Science, Carnegie Mellon University
The Open Diamond® Platform for
M. Satyanarayanan, Rahul Sukthankar*, Adam Goode,
Interactive exploration of large distributed collections of complex, non-text data such as medical images is a challenging task because of the difficulty of creating useful indexes. To handle such tasks, we introduce a new approach to search called discard-based search. In contrast to classic search strategies that precompute indexes for all anticipated queries, discard-based search is an on-demand strategy that performs content-based computation in response to a specific query. This simple change in strategy turns out to have deep consequences for flexibility and user control, while also enabling easy exploitation of CPU and storage parallelism on servers. This paper presents the design and implementation of the OpenDiamond platform for discard-based search, describes some of the applications that have been built with it, and offers experimental evidence that its workloads exhibit easily-exploitable storage parallelism.
*Intel Research Pittsburgh