Lip Reading with Hahn Convolutional Neural ...
Type de document :
Compte-rendu et recension critique d'ouvrage
Titre :
Lip Reading with Hahn Convolutional Neural Networks moments
Auteur(s) :
Mesbah, Abderrahim [Auteur]
Université Sidi Mohamed Ben Abdellah [USMBA]
Hammouchi, Hicham [Auteur]
Université Sidi Mohamed Ben Abdellah [USMBA]
Université Mohammed V de Rabat [Agdal] [UM5]
Berrahou, Aissam [Auteur]
Université Mohammed V de Rabat [Agdal] [UM5]
Berbia, Hassan [Auteur]
Université Mohammed V de Rabat [Agdal] [UM5]
Qjidaa, Hassan [Auteur]
Université Sidi Mohamed Ben Abdellah [USMBA]
Daoudi, Mohamed [Auteur]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Ecole nationale supérieure Mines-Télécom Lille Douai [IMT Lille Douai]
Université Sidi Mohamed Ben Abdellah [USMBA]
Hammouchi, Hicham [Auteur]
Université Sidi Mohamed Ben Abdellah [USMBA]
Université Mohammed V de Rabat [Agdal] [UM5]
Berrahou, Aissam [Auteur]
Université Mohammed V de Rabat [Agdal] [UM5]
Berbia, Hassan [Auteur]
Université Mohammed V de Rabat [Agdal] [UM5]
Qjidaa, Hassan [Auteur]
Université Sidi Mohamed Ben Abdellah [USMBA]
Daoudi, Mohamed [Auteur]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Ecole nationale supérieure Mines-Télécom Lille Douai [IMT Lille Douai]
Titre de la revue :
Image and Vision Computing
Éditeur :
Elsevier
Date de publication :
2019-04-22
ISSN :
0262-8856
Mot(s)-clé(s) en anglais :
Visual speech recognition
Lipreading
Laryngectomy
Deep learning
Lipreading
Laryngectomy
Deep learning
Discipline(s) HAL :
Informatique [cs]/Vision par ordinateur et reconnaissance de formes [cs.CV]
Résumé en anglais : [en]
Lipreading or Visual speech recognition is the process of decoding speech from speakers mouth movements. It is used for people with hearing impairment , to understand patients attained with laryngeal cancer, people with ...
Lire la suite >Lipreading or Visual speech recognition is the process of decoding speech from speakers mouth movements. It is used for people with hearing impairment , to understand patients attained with laryngeal cancer, people with vocal cord paralysis and in noisy environment. In this paper we aim to develop a visual-only speech recognition system based only on video. Our main targeted application is in the medical field for the assistance to la-ryngectomized persons. To that end, we propose Hahn Convolutional Neu-ral Network (HCNN), a novel architecture based on Hahn moments as first layer in the Convolutional neural network (CNN) architecture. We show that HCNN helps in reducing the dimensionality of video images, in gaining training time. HCNN model is trained to classify letters, digits or words given as video images. We evaluated the proposed method on three datasets, AVLetters, OuluVS2 and BBC LRW, and we show that it achieves significant results in comparison with other works in the literature.Lire moins >
Lire la suite >Lipreading or Visual speech recognition is the process of decoding speech from speakers mouth movements. It is used for people with hearing impairment , to understand patients attained with laryngeal cancer, people with vocal cord paralysis and in noisy environment. In this paper we aim to develop a visual-only speech recognition system based only on video. Our main targeted application is in the medical field for the assistance to la-ryngectomized persons. To that end, we propose Hahn Convolutional Neu-ral Network (HCNN), a novel architecture based on Hahn moments as first layer in the Convolutional neural network (CNN) architecture. We show that HCNN helps in reducing the dimensionality of video images, in gaining training time. HCNN model is trained to classify letters, digits or words given as video images. We evaluated the proposed method on three datasets, AVLetters, OuluVS2 and BBC LRW, and we show that it achieves significant results in comparison with other works in the literature.Lire moins >
Langue :
Anglais
Vulgarisation :
Non
Collections :
Source :
Fichiers
- https://hal.archives-ouvertes.fr/hal-02109397/document
- Accès libre
- Accéder au document
- https://hal.archives-ouvertes.fr/hal-02109397/document
- Accès libre
- Accéder au document
- https://hal.archives-ouvertes.fr/hal-02109397/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- Lip_Reading_with_Hahn_Convolutional_Neural_Networks.pdf
- Accès libre
- Accéder au document