Journal Basic Info

  • Impact Factor: 1.995**
  • H-Index: 8
  • ISSN: 2474-1647
  • DOI: 10.25107/2474-1647
**Impact Factor calculated based on Google Scholar Citations. Please contact us for any more details.

Major Scope

  •  Urology
  •  Cardiovascular Surgery
  •  Otolaryngology - Head and Neck Surgery
  •  Transplant Surgery
  •  Surgical Oncology
  •  Neurological Surgery
  •  Oral and Maxillofacial Surgery
  •  Robotic Surgery

Abstract

Citation: Clin Surg. 2022;7(1):3534.Research Article | Open Access

Esophageal to Laryngeal Voice Conversion Using a Sequence to Sequence Mapping Model Including an Attention Mechanism

Kadria Ezzine1*, Joseph Di Martino2 and Mondher Frikha3

1National Engineering School of Carthage, Carthage University, Tunisia
2Lorraine Laboratory of Research in Computer Science and its Applications, Lorraine University, France
3ATISP - National School of Electronics and Telecommunications of Sfax, Sfax University, Tunisia

*Correspondance to: Kadria Ezzine 

 PDF  Full Text DOI: 10.25107/2474-1647.3534

Abstract

Esophageal Speech (ES) can be used as an alternative speaking method for laryngectomes. Compared to laryngeal voice, ES is characterized by poor intelligibility and poor quality due to chaotic fundamental frequency, specific noises that resemble belching, and low intensity. These issues are alleviated by converting ES into more natural speech that is an effective way to improve speech quality and intelligibility. To accomplish this, we propose in this work a novel esophagealto- laryngeal Voice Conversion (VC) system based on a Sequence-to-Sequence (Seq2Seq) technique combined with an attention mechanism. The originality of the proposed method is that it does not require any dynamic time alignment during the training phase, which avoids erroneous mappings and significantly reduces the computing time. In addition, to preserve the identity of the target speaker, excitation and phase coefficients are estimated by querying a binary search tree in the target training space through the coefficients of the vocal tract previously predicted by the proposed Seq2Seq mapping model. In experiments, we compare our approach with baseline methods using numerous measures for objective and subjective evaluations. Perceptual tests confirmed that our proposed method behaves better and achieves better performance even in some difficult cases. In fact, it consistently exceeds conventional methods as acoustic models in terms of speech quality and intelligibility.

Keywords

Cite the article

Ezzine K, Di Martino J, Frikha M. Esophageal to Laryngeal Voice Conversion Using a Sequence to Sequence Mapping Model Including an Attention Mechanism. Clin Surg. 2022; 7: 3534..

Search Our Journal

Journal Indexed In

Articles in PubMed

Mesh Sprayer Device with Liquefied Mesh Delivery System: Proposed Alternative for Currently Available Meshes in Hernia Repair and Supplement to Abdominal Closure
 PubMed  PMC  PDF  Full Text
Sildenafil Transiently Delays Early Alveolar Bone Healing of Tooth Extraction Sockets
 PubMed  PMC  PDF  Full Text
View More...

Articles with Grants

Inflammatory Pseudotumor of the Liver Misdiagnosed Malignancy: A Case Report
 Abstract  PDF  Full Text
Radiotherapy Enhances Kupffer Cell Function and Modifies the Spleen Macrophage Clearance
 Abstract  PDF  Full Text
View More...