• 探索记忆之谜 科学家成功向海兔移植记忆 2019-05-22
  • 强村带弱村结对共发展 2019-05-22
  • 上海06月份天气上海06月份气温上海2018年06月份历史天气 2019-05-21
  • 端午小长假 歌舞飞扬“剧”精彩 2019-05-21
  • 免费与收费混搭总会拖垮社会主义经济(原创首发) 2019-05-20
  • 中共一大会址纪念馆中那个装满母爱的书包 2019-05-20
  • 西农大学生研发“校徽月饼”走红 原料取材学校培育作物 2019-05-20
  • 用这杯啤酒跟世界杯问好! 2019-05-19
  • [猜想]——混沌至极!!!!后羿射日、精卫填海、愚公移山是不是神话故事??!! 2019-05-19
  • 顺义共有产权房今起申购:购房人可获六成产权 225套专配“新北京人” 2019-05-19
  • 推动能源革命 促山西资源型经济转型发展 2019-05-18
  • 西部网(陕西新闻网)www.cnwest.com 2019-05-18
  • 南方电网首座500千伏智能变电站投运 2019-05-18
  • 世界首台!我国量子计算机超越早期经典计算机 2019-05-17
  • 吕梁:交口公安侦破“5.24”疯狂砸车玻璃盗窃案 2019-05-17
  • 欢迎来到吉林时时彩开奖规则      联系电话:010-62660053  电子邮件:[email protected]

    English

    主要业务

    了解更多信息请致电

    +86-10-62660928

    工程数据 科研数据 最新发布数据

    King-ASR-143

    This is a 3-channel Mexican Spanish mobile speech database, which is collected over three mobile phone simultaneously (android mobiles, iPhone and windows phones) in Mexico and performed in a quiet environment. The prompts were the phonetically rich sentences. Due to the potential cognitive load of saying these sentences by the subjects, we took care to choose natural sentences of length between 5 and 25 words. The raw sentences are selected from different domain: Conversations, News, etc. We did remove a number of sentences that includes offensive or negative words or phrase. In order to achieve a good phone balance, we choose sentences from the sentences list to fill out our number. Finally, we had around 29995 unique sentences in our list of sentences, with each of them repeated less than 4 times among different speakers.The corpus contains the recordings of 270,711 utterances which were from 303 speakers. The recording time is about 477 hours (3-channel), including the leading silence (about 500 ms) and the trailing silence (about 500 ms). The total size of this database is about 51 GB.

    king-ASR-044

    This Taiwanese Speech Recognition database, which was collected in Taiwan, contains the voices of 2945 different native speakers who were demographic balanced according to age distribution (16-30, 31-45, 46-60), gender, dialectical regions, each speaker recorded 300 simple sentences in quiet environment, and there were 921905 audio files which were saved as uncompressed PCM files. All the speech data was manually transcribed and annotated. A pronunciation lexicon with a phonemic transcription is also included.

    King-ASR-281

    This is a 4-channel Spanish desktop speech database, which is collected over 4 different microphones simultaneously. The project was performed in Argentina, cover all the cities, for example: BuenosAires , Cordoba, Lanus, Cordoba...Each Speaker was recorded around 300 sentences which were selected from a pool of phonetically rich sentences in approximate 80 minutes as natural as possible. The recording was performed in a quiet office environment.This database is performed in quiet office environment. The corpus contains the recordings of 236,232 utterances of Spanish speech data which were from 200 speakers. The pure recording time is about 358 hours (4-channel), including the leading silence (about 500 ms) and the trailing silence (about 500 ms). The total size of this database is 141 GB.

    King-ASR-290

    This is a 3-channel Chilean Spanish speech database, which is collected over 3 different mobile operating systems: iOS, Android and Windows Phone platform. The project was performed in Chile, cover all the main cities. For example: Santiago, Rancagua, Antofagasta and Vi?a.300 speakers were recorded in total, and each speaker recorded in a quiet environment. The prompts were the phonetically rich sentences. The raw sentences are all selected from the News domain Twitter/Forum and SMS. We did remove a number of sentences that includes offensive or negative words or phrase. Finally, we had 108055 unique sentences in our list of sentences, that we generated the prompt sheets from with no more than 3 times for each. With discarding some unqualified utterances, the whole corpus contains the recordings of 268,704 utterances; the pure recording time is about 519 hours (including leading silence and tail silence). The total size of this database is about 55.8 G.
  • 探索记忆之谜 科学家成功向海兔移植记忆 2019-05-22
  • 强村带弱村结对共发展 2019-05-22
  • 上海06月份天气上海06月份气温上海2018年06月份历史天气 2019-05-21
  • 端午小长假 歌舞飞扬“剧”精彩 2019-05-21
  • 免费与收费混搭总会拖垮社会主义经济(原创首发) 2019-05-20
  • 中共一大会址纪念馆中那个装满母爱的书包 2019-05-20
  • 西农大学生研发“校徽月饼”走红 原料取材学校培育作物 2019-05-20
  • 用这杯啤酒跟世界杯问好! 2019-05-19
  • [猜想]——混沌至极!!!!后羿射日、精卫填海、愚公移山是不是神话故事??!! 2019-05-19
  • 顺义共有产权房今起申购:购房人可获六成产权 225套专配“新北京人” 2019-05-19
  • 推动能源革命 促山西资源型经济转型发展 2019-05-18
  • 西部网(陕西新闻网)www.cnwest.com 2019-05-18
  • 南方电网首座500千伏智能变电站投运 2019-05-18
  • 世界首台!我国量子计算机超越早期经典计算机 2019-05-17
  • 吕梁:交口公安侦破“5.24”疯狂砸车玻璃盗窃案 2019-05-17