NTT Advanced Technology Corporation(NTT-AT) had supplied the multi-lingual speech database for telephonometry 1988 for three years from 1989 to 1991.
Benefits / Features
NTT Advanced Technology Corporation(NTT-AT) had supplied the multi-lingual speech database for telephonometry 1988 for three years from 1989 to 1991. However, its service terminated two years ago, since the number available for distribution had declined to only in-stock remaining. Now, NTT-AT is restarting the distribution service using the fully revised version of the database called "Multi-Lingual Speech Database FOR TELEPHONOMETRY 1994" in response to persistent request from researchers and engineers.
Specifications / Details
All speech samples are new recordings using digital equipment.
The recording was conducted in compliance with the ITU-T Rec. P.800 recommendation.
As the frequency components of speech samples are preserved up to 8kHz (The recording system ensures the flatness up to 12kHz including microphone.), the database can be apply to the evaluation not only to a conventional 3.4kHz telephone system but to a 7kHz wide band telephone system using ISDN (ITU-T Rec. G.722).
Covering 21 languages from all over the world. Deficiencies in the previous version are reduced by the addition of Asian languages.
American English, Arabic, Chinese (Mandarin), Dutch, English (British), Finnish, French, German, Greek, Hungarian, Hindi, Indonesian, Italian, Japanese, Korean, Polish, Portuguese (Brazilian), Russian, Spanish (Castilian), Swedish, Thai.
Four male and four female native speakers are assigned to each language.
Twenty four different short sentences (pairing two sentences) are spoken by each speaker. Speech samples used in the subjective testing for the recent.
8kbit/s speech coding standardization are collected in the data area.
All speech samples are recorded in four CD-ROM disks. Each disk is divided into two areas, audio and data.
Speech samples in the audio area can be played back by a commercial CD player. Since the digitized speech data are recorded by standardized format, they can be retrieved by an ordinary PC-DOS system and CD-ROM reader.
Speech data are sampled at 16-bit and 16kHz rates and formed into raw (binary) data files. * Speech samples are digitized at 16-bit and 16kHz rates. All files are stored in ISO9660-formatted CD-ROMs.
The active power level of each sample is normalized to-26 dBov according to the ITU-T Rec. P.56 algorithm.
American English, German, Italian only. For these speech samples, NTT-AT is licenced from AT&T Bell Telephone Labs./U.S.A., Deutsche Bundespost Telekom/Germany, CSELT/Italy.
- Notes :
- The delivery charge is included.
- Clients are requested to pay their domestic tax or customs duty by themselves.
|Multi-Lingual Speech Database for Telephonometry 1994||Please contact us.|