Construction of Large Scale Isolated Word Speech Corpus in Bangla
Keywords:
Bangla, speech corpora, BDNC01, vocabulary, isolated word, speech recognition
Abstract
A new speech corpus of isolated words in Bangla language has been recorded including high frequent words from a text corpus BdNC01 It has been specifically designed for various research activities related to speaker-independent Bangla speech recognition The database consists of speech of 100 speakers each of them speaking 1081 words Another 50 new speakers were employed to speak all the list of speech to construct a test database Every utterance was repeated 5 times in different days to avoid time variation of speaker property The total 400 hours of recording makes the corpora largest in its type size and language domain This paper describes the motivation for the corpora and the processes undertaken in its construction The paper concludes with the usability of the corpus
Downloads
- Article PDF
- TEI XML Kaleidoscope (download in zip)* (Beta by AI)
- Lens* NISO JATS XML (Beta by AI)
- HTML Kaleidoscope* (Beta by AI)
- DBK XML Kaleidoscope (download in zip)* (Beta by AI)
- LaTeX pdf Kaleidoscope* (Beta by AI)
- EPUB Kaleidoscope* (Beta by AI)
- MD Kaleidoscope* (Beta by AI)
- FO Kaleidoscope* (Beta by AI)
- BIB Kaleidoscope* (Beta by AI)
- LaTeX Kaleidoscope* (Beta by AI)
How to Cite
Published
2018-05-15
Issue
Section
License
Copyright (c) 2018 Authors and Global Journals Private Limited
This work is licensed under a Creative Commons Attribution 4.0 International License.