Academia

wav2gloss: Speech to Glossed Text of Low Resource Languages

Multilingual TTS Accent Impressions for Accented ASR

Speaker Diariazation using Deep Learning Techniques

  • Guided by Dr. Rajeev Rajan
  • Implemented Bi-LSTM and Bi-GRU models with self-attention as baseline model and obtained an error rate of 25.7%.
  • Experimented with various deep learning techniques involving transformers to diarize a given audio with data obtained from AMI Speech Corpus
  • Paper accepted to ICNGIS 2022

Octoechos classification in liturgical music using Deep Learning frameworks

  • Guided by Dr. Rajeev Rajan
  • This automatic classification scheme is addressed using MTF-GRU, MTF-LSTM, MTF-SBU-LSTM, MTF-SB-GRU and MTF-SBU-GRU. The performance of these methods is compared using their accuracy (precision, recall and f1-score).
  • The proposed experiment demonstrates the potential of SBU-LSTM in learning information through MTF.
  • Paper accepted to INTERSPEECH 2022.

Understanding the scope of quantum technology to gauge its social impact - Independent and Self-started