Overview
This paper addresses the problem of extracting key pieces of information from voicemail messages, such as the identity and phone number of the caller. This task differs from the named entity task in that the information this paper is interested in is a subset of the named entities in the message, and consequently, the need to pick the correct subset makes the problem more difficult. Also, the caller's identity may include information that is not typically associated with a named entity. This paper presents three information extraction methods, one based on hand-crafted rules, one based on maximum entropy tagging, and one based on probabilistic transducer induction. It evaluate their performance on both manually transcribed messages and on the output of a speech recognition system.
|
|
Oracle Live Webcast
Enabling Telecom 2.0 with the next generation Service Delivery Platform
Careers at Accenture
Immediate job opportunities at Accenture – Apply Now!
Oracle Technology Solutions for Midsize Businesses
Find out more about Oracle's Vision for an Enterprise Performance Management System
Compare your IT salary
Sign-up for free download of IT salary benchmark report 2008.