Recent Articles
Turning Transportation Data Into Effective Web Sites
An overview of web mapping software used for transportation planning applications (PDF, 270 kb)

The Web: A Communication Medium for Health Care
Using the Internet and handheld devices to Improve Health Care (PDF, 235 kb)

The Wireless Internet
Today and Tomorrow: From simple text to spoken and graphical interfaces
(PDF, 59 kb)
Privacy vs. Convenience
Protocols for protecting your privacy when browsing
(PDF, 159 kb)
Introduction to Metadata
How XML and RDF are used to describe information about documents.
(PDF, 1.9 Mb)
About the author
Email Contact
gflammia AT alum.mit.edu

Discourse Segmentation of Spoken Dialogue: An Empirical Approach
The Structure of Information-Seeking Telephone Conversations

PhD Thesis by Giovanni Flammia
MIT Laboratory for Computer Science
Spoken Language Systems Group
May 1998

Download:

Thesis (PDF, 152 pages, 46 figures, 12 tables, 1.8 Mb)

Nb discourse annotation tool (Zip, Tcl/Tk application with annotated examples, 221 kb)

Kappa coefficient program (plain text source in C).

In this thesis, the analysis of a corpus of information-seeking dialogues provides evidence about the differences between human-to-human telephone conversation and interactive voice response systems (IVRs) and question-answer systems (QAs).

In IVRs and QAs interaction is necessarily limited a priori. In contrast, in natural conversation either speaker can take the initiative at all time. In spite of this lack of constraints, information-seeking dialogues such as getting theater showtimes and giving directions are highly structured.

The goal of this thesis has been to determine empirically the extent to which structured discourse segment boundaries can be extracted from annotated transcriptions of spontaneous, natural dialogues.

The contributions of this thesis are twofold.

Firstly, we developed and evaluated the performance of a novel annotation tool called Nb and associated discourse segmentation instructions. Our findings indicate that it is possible to obtain reliable discourse segmentation when the annotation task is limited to choosing among few independent alternatives. The scores for the most reliable experiments are 83.9% recall, 85% precision, 0.82 kappa coefficient (22 dialogues, between 7 and 9 coders per dialogue).

Secondly, the annotated data support cognitive theories of dialogue as a joint activity (Clark and Schaefer 1989, Grosz and Sidner 1990, among others) in which discourse segments are initiated by either speaker with the purpose of either repairing/preventing misunderstanding or co-operatively finding a mutually agreed upon solution to the task at hand. The data also support the hypothesis that a stack data structure can model spontaneous phenomena such as repairs, fresh starts and switches between multiple active purposes.

Screenshot of the Nb discourse annotation tool
Screenshot of the Nb discourse annotation tool

Thesis Topic Keywords: Discourse Analysis, Dialogue, Telephone Conversations, Natural Language Processing, Discourse Segments, Discourse Segmentation, Corpus Analysis, Discourse Annotation, Content Analysis, Kappa Coefficient.