Skip to main content
Article
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
  • Ajinkya Kulkarni, Mohamed bin Zayed University of Artificial Intelligence
  • Atharva Kulkarni, Erisha Labs
  • Sara Abedalmon em Mohammad Shatnawi, Mohamed bin Zayed University of Artificial Intelligence
  • Hanan Aldarmaki, Mohamed bin Zayed University of Artificial Intelligence
Document Type
Conference Proceeding
Abstract

We present a Classical Arabic Text-to-Speech (ClArTTS) corpus to facilitate the development of end-to-end TTS systems for the Arabic language. The speech is extracted from a LibriVox audiobook, which is then processed, segmented, and manually transcribed and annotated. The ClArTTS corpus contains about 12 hours of speech from a single male speaker sampled at 40100 Hz. In this paper, we describe the process of corpus creation, details of corpus statistics, and a comparison with existing resources. Furthermore, we develop two TTS systems based on Grad-TTS and Glow-TTS and illustrate the performance of the resulting systems via subjective and objective evaluations. The ClArTTS corpus is publicly available at www.clartts.com for research purposes, along with the baseline TTS systems and an interactive demo. © 2023 International Speech Communication Association. All rights reserved.

DOI
10.21437/Interspeech.2023-2224
Publication Date
8-1-2022
Keywords
  • Arabic languages,
  • Arabic speech,
  • Arabic speech corpus,
  • Arabic texts,
  • Corpus creation,
  • End to end,
  • Open-source,
  • Speech corpora,
  • Text to speech,
  • TTS systems,
  • Speech communication
Comments

Green Open Access

Available at ISCA Archive

Citation Information
A. Kulkarni, A. Kulkarni, S. A. Shatnawi, and H. Aldarmaki, “Clartts: An open-source classical arabic text-to-speech corpus,” INTERSPEECH 2023, Aug 2023. doi:10.21437/interspeech.2023-2224