Ananya Ayasi

Academia

wav2gloss: Speech to Glossed Text of Low Resource Languages

Guided by Prof. Lori, Prof. Shinji Watanabe and Prof. David Mortensen
Automatically generate Interlinear Glossed Text of low-resource languages from audio files.
Focusing on the speech enhancement of audio files using ESPNet2 to improve the ASR performance.

Multilingual TTS Accent Impressions for Accented ASR

Speaker Diariazation using Deep Learning Techniques

Guided by Dr. Rajeev Rajan
Implemented Bi-LSTM and Bi-GRU models with self-attention as baseline model and obtained an error rate of 25.7%.
Experimented with various deep learning techniques involving transformers to diarize a given audio with data obtained from AMI Speech Corpus
Paper accepted to ICNGIS 2022

Octoechos classification in liturgical music using Deep Learning frameworks

Guided by Dr. Rajeev Rajan
This automatic classification scheme is addressed using MTF-GRU, MTF-LSTM, MTF-SBU-LSTM, MTF-SB-GRU and MTF-SBU-GRU. The performance of these methods is compared using their accuracy (precision, recall and f1-score).
The proposed experiment demonstrates the potential of SBU-LSTM in learning information through MTF.
Paper accepted to INTERSPEECH 2022.

Understanding the scope of quantum technology to gauge its social impact - Independent and Self-started

Presented a research poster for “Careers in Quantum” conducted by QET Labs and University of Bristol.
Gave a talk on Quantum Technology and its Impact on Society for WomenTech Global Conference 2021.
Gave a talk on Basics of Quantum Computing” as the first speaker of “Vocalize”, conducted by IEEE Travancore Hub on 4th October, 2020.