In the function word2vecmodel, the model that is saved as word2vec throws error that i

Where is the pre-trained model for Word2Vec? <p di

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

not really <a class="user-mention notranslate" data-hovercard-type="user" data-hoverca

Word2vec.py about bidirectiona-lstm-for-text-summarization- HOT 6 OPEN

deepsmoseli commented on June 11, 2024

Word2vec.py

from bidirectiona-lstm-for-text-summarization-.

Comments (6)

PratikNalage commented on June 11, 2024 1

Where is the pre-trained model for Word2Vec?
Error in Word2Vec.py

  File "word2vec.py", line 178, in <module>
    corpus = createCorpus(data)
NameError: name 'data' is not defined

from bidirectiona-lstm-for-text-summarization-.

DeepsMoseli commented on June 11, 2024

Hi @sanjayb678 , I wrote and ran the whole script in spyder(python 3.6). i would advise you first keep the same configuration and I did not test is the code works exactly the same in a notebook. saving shouldn't be a problem as far as I know. however you can skip over this line as long as the model is in memory.

from bidirectiona-lstm-for-text-summarization-.

amanjaswani commented on June 11, 2024

why have you done
label_encoder,onehot_encoded,onehot=summonehot(data["summaries"])

shouldn't the function argument by corpus instead of data["summaries"]??

from bidirectiona-lstm-for-text-summarization-.

MuruganR96 commented on June 11, 2024

@PratikNalage

cnn_daily_load.py you too create function like this,

def cnn_daily_load():
    filenames=load_data(datasets["cnn"],data_categories[0])

    """----------load the data, sentences and summaries-----------"""
    for k in range(len(filenames[:400])):
            if k%2==0:
                try:
                    data["articles"].append(cleantext(parsetext(datasets["cnn"],data_categories[0],"%s"%filenames[k])))
                except Exception as e:
                    data["articles"].append("Could not read")
                    print(e)
            else:
                try:
                    data["summaries"].append(cleantext(parsetext(datasets["cnn"],data_categories[0],"%s"%filenames[k])))
                except Exception as e:
                    data["summaries"].append("Could not read")
                    print(e)
    return data

then simply import cnn_daily_load.py to word2vec.py

from cnn_daily_load import cnn_daily_load, cleantext
data = cnn_daily_load()

your first question is Where is the pre-trained model for Word2Vec?

i think,

Actually simply we are using skipgram model algorithm to generate own word embeddings.
that is why we no need for word2vec pre-trained model. it is another way of generating word embedding.

from bidirectiona-lstm-for-text-summarization-.

DeepsMoseli commented on June 11, 2024

Pre trained because I do not train it together with the neural network. I do pretrain skipgram

…

On Fri, 12 Jul 2019, 09:07 Murugan R, ***@***.***> wrote: @PratikNalage <https://github.com/PratikNalage> cnn_daily_load.py you too create function like this, def cnn_daily_load(): filenames=load_data(datasets["cnn"],data_categories[0]) """----------load the data, sentences and summaries-----------""" for k in range(len(filenames[:400])): if k%2==0: try: data["articles"].append(cleantext(parsetext(datasets["cnn"],data_categories[0],"%s"%filenames[k]))) except Exception as e: data["articles"].append("Could not read") print(e) else: try: data["summaries"].append(cleantext(parsetext(datasets["cnn"],data_categories[0],"%s"%filenames[k]))) except Exception as e: data["summaries"].append("Could not read") print(e) return data then simply import cnn_daily_load.py to word2vec.py from cnn_daily_load import cnn_daily_load, cleantext data = cnn_daily_load() your first question is Where is the pre-trained model for Word2Vec? Actually simply we are using skipgram model algorithm to generate own word embeddings. that is why we no need for word2vec pre-trained model. it is another way of generating word embedding. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#2?email_source=notifications&email_token=AG5XOUV3RX5UNKH7L4O4JC3P7AUTRA5CNFSM4FPTU3HKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODZY4VRI#issuecomment-510773957>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AG5XOUTORFPCKZ2QXF3NYH3P7AUTRANCNFSM4FPTU3HA> .

-- This message and attachments are subject to a disclaimer. Please refer to http://www.it.up.ac.za/documentation/governance/disclaimer/ <http://www.it.up.ac.za/documentation/governance/disclaimer/> for full details.

from bidirectiona-lstm-for-text-summarization-.

MuruganR96 commented on June 11, 2024

not really @DeepsMoseli. in this place you are using gensim - skipgram algorithm(word2vec) to build normal word2vec model then generating embedding for the words. training from scratch.

great stuff. we never used word2vec pretrained model here.

@amanjaswani i was not understood your question, but give you a hand.

label_encoder,onehot_encoded,onehot=summonehot(data["summaries"])

label_encoder for training label.
word2vec embedding for training data.

from bidirectiona-lstm-for-text-summarization-.

Word2vec.py about bidirectiona-lstm-for-text-summarization- HOT 6 OPEN

Comments (6)

Related Issues (10)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent