CMU-CS-10-120
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-10-120

Interactive Machine Learning in Diamond

Shiva Kaul

May 2010

Masters Thesis

CMU-CS-10-120.pdf


Keywords: Information retrieval, active learning, human-computer interaction

Unindexed search systems, such as Diamond, are more useful than indexed search systems precisely when the cost of indexing cannot be amortized and classifiers are inexpensive to create. This thesis establishes the latter condition for many image classification tasks. To accommodate a wide variety of visual phenomena, a flexible, learned image representation, Semantic Texton Forests, is adapted for use in Diamond. To reduce the amount of interaction required to produce a high-quality classifier, a novel active learning algorithm, Active Learning by Measure Approximation, is theoretically developed. To consolidate all components of the system, a usable interface, Algum, is implemented. The result is an effective workflow realizing the Diamond vision of iterated, interactive hypothesis exploration.

69 pages


Return to: SCS Technical Report Collection
School of Computer Science

This page maintained by reports@cs.cmu.edu