This Repository contains the implementation of Automatic Assessment of Speaking Skills Using Aural and Textual Information. The details of the models implemented in this repo can be found in this paper: https://aclanthology.org/2021.icnlsp-1.19/
- Python3
- PyTorch
- scikit-learn
- numpy
- pyAudioAnalysis
- SpeechRecognition
- librosa
- fasttext
- tqdm
Run pip install -r requirements.txt to install all the required packages.
Automatic-Assessment-of-Speaking-Skills
├───AudioAnalysis
│ ├───arousal.py
│ ├───emotion.py
│ ├───valence.py
│ ├───TrainANN.py
│ ├───TrainSVM.py
│ └───Saved models
├───HighLevelAnalysis
│ ├───highLevel.py
│ ├───TrainANN.py
│ ├───TrainSVM.py
│ └───Saved models
├───Test
│ ├───Test.py
│ ├───sample.wav
│ └───Models saved
├───TextAnalysis
│ ├───arousal_BiLSTM.py
│ ├───arousal_FastText.py
│ ├───emotion_BiLSTM.py
│ ├───emotion_FastText.py
│ ├───valence_BiLSTM.py
│ ├───valence_FastText.py
│ ├───TrainANN.py
│ ├───TrainSVM.py
│ └───Saved models
├───README.md
├───requirements.txt
└───Presentation.pdf
The Datasets used in this paper are:
- IEMOCAP
- Savee
- Emovo
- Emo-db
- Ravdess
- PuSQ
To test the final model, run the following commands:
cd Test
python3 test.pyEnter the name of the audio file which needs to be tested
Enter Audio File Name: sample.txt
The output will two boolean values based on Enjoyment and Expressiveness.