CMU-CS-98-166
Computer Science Department
School of Computer Science, Carnegie Mellon University



CMU-CS-98-166

A Probabilistic Method for Tracking a Vocalist

Lorin V. Grubb

September 1998

Ph.D. Thesis

CMU-CS-98-166.ps
CMU-CS-98-166.pdf


Keywords: Computer music, stochastic processes, signal processing, singing, musical accompaniment, statistical modeling, automated accompaniment


When a musician gives a recital or concert, the music performed generally includes accompaniment. To render a good performance, the soloist and the accompaniest must know the musical score and must follow the other musician's performance. Both performing and reheasing are limited by constraints on the time and money available for bringing musicians together. Computer systems that automatically provide musical accompaniment offer an inexpensive, readily available alternative. Effective computer accompaniment requires software that can listen to live performers and follow along in a musical score.

This work presents an implemented system and method for automatically accompanying a singer given a musical score. Specifically, I offer a method for robust, real-time detection of a singer's score position and tempo. Robust score following requires combining information obtained both from analyzing a complex signal (the singer's performance) and from processing symbolic notation (the score). Unfortunately, the mapping from the available information to score position does not define a function. Consequently, this work investigated a statistical characterization of a singer's score position and a model that combines the available musical information to produce a probabilistic position estimate. By making careful assumptions and estimating statistics from a set of actual vocal performances, a useful approximation of this model can be implemented in software and executed in real time during a musical performance.

As part of this project, a metric was defined for evaluating the system's ability to follow a singer. This metric was used to access the system's ability to track vocal performances. The presented evaluation includes a characterization of how tracking ability can be improved by using several different measurements from the sound signal rather than only one type of measurement. Examined measurements of the sound signal include fundamental pitch, spectral features dependent upon the score's phonetic content, and amplitude changes correleted with the start of a musical note. The evaluation results demonstrate how incorporating multiple measurements of the same signal can improve the accuracy of performance tracking, for individual performances as well as on average. Overall improvement of the performance tracking system through incremental specification, development, and evaluation is facilitated by the formal statistical approach to the problem.

259 pages


Return to: SCS Technical Report Collection
School of Computer Science homepage

This page maintained by reports@cs.cmu.edu