To solve problems that manual provision of real-time captions to a presentation or the like has low popularization in costs, a high recognition rate can not be expected only by an automatic voice recognition apparatus and there is a problem of incorrect translation and to provide an inexpensive apparatus or the like.
The caption correction apparatus obtains character strings and a degree of confidence of a voice recognition result. A time monitoring monitor monitors time and judges whether processing is delayed or not on the basis of the degree of confidence and time status. When the processing is not delayed, manual judgment is requested to a checker. In this case, voice is processed and the manual judgment of the voice recognition result is performed on the basis of the processed voice. When the processing is delayed, automatic judgment is performed on the basis of the degree of confidence. When the validity of the voice recognition result is judged as the result of manual judgment or automatic judgment, the character strings are displayed as determined character strings. When the invalidity of the voice recognition result is judged, the voice recognition result is automatically corrected by matching on the basis of a succeeding candidate based on voice recognition, the text/attributes of the presentation, the text of a script, and so on. Automatically corrected character strings are displayed as indefinite character strings.
COPYRIGHT: (C)2008,JPO&INPIT
ARAKAWA KENICHI
OKANE TOSHIYA
Yoshihiro City
Takeshi Ueno
Tasaichi Tanae
Masayuki Masabayashi