 |
|
 |


| Voice Recognition in VOICE CAIWA being developed by PtoPA consists of two main features: ①¡¡Auto formation of Language Model and ②¡¡Prosody Analyzer. They will be discussed below. |

 |
2)Prosody Analyzer
|
 |
 |
The necessary cognition rate of voice recognition in an ideal sound environment can normally be achieved by adjusting language models and sound models. However the cognition rate is frequently degraded by various elements in normal world environments. For example, unintended recognition results are formed caused by casual noises, faltering tonal voices, or discontinued utterances, or the cognition rate can be greatly lowered on account of extreme volumes (loud/small) of speakers voices. In addition, if the volume control is not correctly adjusted, unnecessary noises can be picked up, resulting in low cognition rate. This casts a serious burden on Voice Recognition, namely to realize robust voice recognition by avoiding undesirable influences to sound environments as much as possible. This is true of voice recognition in our VOICE CAIWA. It is necessary to provide the most robust voice recognition and input the recognition results to CAIWA. The filter of the Prosody Analyzer discerns voice data for voice recognition from other sound date (such as sudden noises or fillers). The Voice Control focuses on the sound volume of voice data, and provides such processes as giving off ¡Æloud voice warning¡Ç when loud voice is sensed, and ¡Ælow voice warning¡Ç for small voice utterances. In addition, a Voice Volume Controlling feature has been developed to respond to environmental noises more flexibly, and volume control can be performed according to the sound level of normal noises.
Prosody Analyzer is being developed in order for our VOICE CAIWA to realize such ¡Èvoice recognition with high robust ability¡É Wave processing technology is introduced into prosody analyzer to realize ¡Æfilter processing¡Ç, ¡Ævoice control¡Ç and ¡Ævolume control¡Ç.
These features mounted in the Prosody Analyzer allow target voice recognition of proper voice data alone, which will help realize robust voice recognition. This realization of robust voice recognition technology can lead to a ¡Ælanguage model auto formation feature¡Ç as well as a ¡Æcharacteristic of voice recognition technology¡Ç of PtoPA.
|
|
 |
 |
|
 |