Skip to content

AmirmohammadRostami/KeywordsSpotting-EfficientNet-A0

Repository files navigation

KeywordsSpotting-EfficientNet-A0

Keyword spotting in continuous speech using convolutional neural network

This is a PyTorch implementation of some popular CNN models architecture like Deep Residual Models, Convolutional Neural Networks for Keyword Spotting, and our proposed architecture based on EfficientNet. all models are trained on our new Persian Keyword Spotting Dataset that you can download from Football Keywords Dataset. For more details, please check out our paper Keyword spotting in continuous speech using convolutional neural network / DOI.

This repository is based on Honk-Repository. Honk models can be used to identify simple commands (e.g., "stop" and "go") that trained on Speech Commands Dataset. but our work has some improvements and advantages as below:

  1. We used the modified state of the art image classification architecture, efficientNet, as a based model.
  2. Improve performance in "continuous speech" mode by our Proposed continuous speech synthesis method.
  3. Improve robustness against noises in real samples by using various noises like bubble, stadium, ... .
  4. Better generalization by using SpecAugment.
  5. Using our new Persian Keywords Spotting Dataset that helped us to use this project in real scenarios and projects.

Demo Application

Use the instructions below to run the demo application (shown in the above video) yourself! Currently, PyTorch has official support for only Linux and OS X. Thus, Windows users will not be able to run this demo easily.

To deploy the demo, run the following commands:

  • change directory to KSM Repository.
  • If you do not have PyTorch, please see the website.
  • Install Python dependencies: pip install -r requirements.txt
  • Start the PyTorch server: python .
  • Run the demo: python -m utils.speech_demo_tk

If you need to adjust options, like turning off CUDA or change trained model file or ... , please edit config.json.

Pre trained Models

As soon as possible we release KSM trained-models in our repository. there are several pre-trained models for PyTorch.

Contact Us

Feel free to contact us for any further information via below channels.

Amirmohhammad Rostami:

Ali Karimi

Mohammad Ali Akhaee

Releases

No releases published

Packages

No packages published

Languages