Abstract
Automatic Speech Recognition (ASR) research has been primarily focused towards
large-scale systems and industry, while other areas that require attention are
often over-looked by researchers. For this reason, this research looked at
automatic speech recognition at the consumer level. Many individual consumers
will purchase and use automatic software recognition for a different purpose than
that of the military or commercial industries, such as telecommunications.
Consumers who purchase the software for personal use will mainly use ASR for
dictation of correspondences and documents. Two ASR dictation software packages
were used to conduct the study. The research examined the relationships between
(1) speech recognition software training and word accuracy, (2) error-correction
time by the user and word accuracy, and (3) correspondence type and word accuracy.
The correspondences evaluated were those that resemble Personal, Business, and
Technical Correspondences. Word accuracy was assessed after initial system
training, five minutes of error-correction time, and ten minutes of
error-correction time.
Results indicated that word recognition accuracy
achieved does affect user satisfaction. It was also found that with
increased error-correction time, word accuracy results improved. Additionally,
the results found that Personal Correspondence achieved the highest mean word
accuracy rate for both systems and that Dragon Systems achieved the highest mean
word accuracy recognition for the Correspondences explored in this research.
Results were discussed in terms of subjective and objective measures,
advantages and disadvantages of speech input, and design recommendations were
provided.
|