uzaymacar / attention-mechanisms Goto Github PK

Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with TensorFlow 2.0 and Keras.

License: MIT License

Python 100.00%

attention-mechanism natural-language-processing natural-language-understanding recurrent-neural-networks sequence-to-sequence-models many-to-many many-to-one sentiment-classification text-generation language-model

attention-mechanisms's People

Contributors

Stargazers

Watchers

Forkers

mingkin paojianghu templeblock limingmingli321 seraphyx floricaaa nbcstevenchen agiant laozhuang727 angelo337 sun-zhen astrogilda gvlokesh shubhamjain27 durga-prasad1 alongwithyou hieund12 austilphukir caofeifei946198 ai-hub-deep-learning-fundamental deep-learning-newbie hamidehkerdegari anjinqi liangye10086 alargente gabrielle-lau-forks wangtianzi lihongweimail surajitdb ansontgn ceste zhuleiustc1983 maybeee18 code-ishwar wliuxingxiangyu lkl2050 amirunpri2018 whoismanoj pdsxsf russellgill vikas1667 wzhao5 vinnie-palazeti amberfeb ibrahimth yugaljain1999 xingzhe996 dp-sun maplewzx parasteh karimarwah hheyang shashank68 apcc-geoslegend eadon999 3dimaging mennanawar sarahboufelja lee0701 ngts-aus eswarraop ronykalfarisi cnrhkn xysteve00 serco425 zhuangweikang rajeev-dw9 stephennfernandes salmankh47 zhengbo-ever alpbalcay hrishikeshkhade amt08 li-ke zarifrezaul yankikalfa sxlphur ksoumya jaqueszanon 5l1v3r1 jianhenghou chana678 loveplay1983 nis12ram

attention-mechanisms's Issues

Question to implement

First, Thank you for your work

By following your description, I'm trying to implement the attention layers each with tf 2.1.

I have a question that does the line 221 requires to be add a "squeeze" the inputs ?
attention_score = RepeatVector(source_hidden_states.shape[1])(tf.squeeze(attention_score))
because if i understood the full code correctly, the h_t is already expanded_dim and its attention score is (B, 1, H) before getting in the Repeatvector. However, when i feeding the (B, 1, H) to repeat vector, it rise an error as [repeat_vector is incompatible with the layer: expected ndim=2, found ndim=3.]

Thank you

attention-mechanisms/layers.py

Line 221 in 37f131d

 attention_score = RepeatVector(source_hidden_states.shape[1])(attention_score) # (B, S*, H) 

Bug Report

Dear sir or madam:

Thanks for your time. Recently I have been studying your codes of attention-mechanisms and felt it is very beneficial for me. However, a bug occurs when I run document_classification.py, which throws an error that: 'Dimension' object does not support indexing. It seems that some bugs exist in Class Attention(Layer). The Parameter config was set to 1 and 2 respectively but the same bug appears, and it points out that the codes "self.input_sequence_length, self.hidden_dim = input_shape[0][1], input_shape[0][2]" (line 103, layers.py) lead to the problem. Could you please help me to fix this bug. I'd appreciate it if you are kind to help me.

Best wishes,
Jie Yang

uzaymacar / attention-mechanisms Goto Github PK

attention-mechanisms's People

Contributors

Stargazers

Watchers

Forkers

attention-mechanisms's Issues

Question to implement

Bug Report

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent