CMU-CS-04-115
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-04-115

Kernel Conditional Random Fields:
Representation, Clique Selection, and Semi-Supervised Learning

John Lafferty, Yan Liu, Xiaojin Zhu

February 2004

CMU-CS-04-115.ps
CMU-CS-04-115.pdf


Keywords: Sequence analysis, graphical models, conditional models, kernels, semi-supervised learning, protein secondary structure


Kernel conditional random fields are introduced as a framework for discriminative modeling of graph-structured data. A representer theorem for conditional graphical models is given which shows how kernel conditional random fields arise from risk minimization procedures defined using Mercer kernels on labeled graphs. A procedure for greedily selecting cliques in the dual representation is then proposed, which allows sparse representations. By incorporating kernels and implicit feature spaces into conditional graphical models, the framework enables semi-supervised learning algorithms for structured data through the use of graph kernels. The clique selection and semi-supervised methods are demonstrated in synthetic data experiments, and are also applied to the problem of protein secondary structure prediction.

15 pages


Return to: SCS Technical Report Collection
School of Computer Science homepage

This page maintained by reports@cs.cmu.edu