xiaoyeye1117 / multimodalsr Goto Github PK
View Code? Open in Web Editor NEWThis project forked from matthijsvk/multimodalsr
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
License: MIT License