Skip to content

Nexdata-AI/393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone

Description

Korean(Korea) Children Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering essay stories, and numbers. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1112?source=Github

Specifications

Format

16kHz, 16bit, uncompressed wav, mono channel

Recording environment

quiet indoor environment, without echo

Recording content (read speech)

children's books; human-machine interaction category; smart home command and control category; numbers; general category

Speaker

1,085 Korean children, all children are 6-15 years old

Recording device

Android Smartphone, iPhone

Country

Korea

Language

Korean

Accuracy rate

Sentence Accuracy Rate (SAR) 95%

Licensing Information

Commercial License