Git Product home page Git Product logo

xml-cnn's People

Contributors

yu54ku avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

xml-cnn's Issues

question about RCV1 tokenization

On the home page you stated that "Caution: This dataset is tokenized
differently than the one used by Liu et al."
May I ask more details about this?
Does this mean Liu et al. didn't use tokens provided at
the RCV1 page?
Thanks

Load large dataset.

Hello and thank you for this implementation.

I would like to ask if there is a way to load the data in batches since my dataset is quite large (50 GB) and cannot fit into the memory.

failed to run the code one machine

As the title. I can run it on one machine but on another machine
it failed with the following message. Do you see why this happened?
Many thanks

$ python3 train.py 

============================== Normal Train Mode ==============================

------------------------------------ Params -----------------------------------
[('batch_size', 64), ('cache_path', 'cache'), ('measure', 'p@1'), ('sequence_length', 500)]
-------------------------------------------------------------------------------

--------------------------------- Hyper Params --------------------------------
[('d_max_pool_p', [125, 128, 128]), ('filter_channels', 128), ('filter_sizes', [2, 4, 8]), ('hidden_dims', 512), ('learning_rate', 0.0005099137446356937), ('stride', [2, 1, 1])]
-------------------------------------------------------------------------------

Loading data...  /home/cjlin/.local/lib/python3.6/site-packages/torchtext/data/field.py:36: UserWarning: RawField class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.
  warnings.warn('{} class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.'.format(self.__class__.__name__), UserWarning)
/home/cjlin/.local/lib/python3.6/site-packages/torchtext/data/field.py:150: UserWarning: Field class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.
  warnings.warn('{} class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.'.format(self.__class__.__name__), UserWarning)
/home/cjlin/.local/lib/python3.6/site-packages/torchtext/data/example.py:68: UserWarning: Example class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.
  warnings.warn('Example class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.', UserWarning)
/home/cjlin/.local/lib/python3.6/site-packages/torchtext/data/example.py:78: UserWarning: Example class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.
  warnings.warn('Example class will be retired soon and moved to torchtext.legacy. Please see the most recent release notes for further information.', UserWarning)
Done.
Converting text to ID...  Traceback (most recent call last):
  File "train.py", line 129, in <module>
    main()
  File "train.py", line 106, in main
    trainer.preprocess()
  File "/home/cjlin/xml-cnn/build_problem.py", line 151, in preprocess
    self.TEXT.vocab.load_vectors("glove.6B.300d")
  File "/home/cjlin/.local/lib/python3.6/site-packages/torchtext/vocab.py", line 184, in load_vectors
    vectors[idx] = pretrained_aliases[vector](**kwargs)
  File "/home/cjlin/.local/lib/python3.6/site-packages/torchtext/vocab.py", line 487, in __init__
    super(GloVe, self).__init__(name, url=url, **kwargs)
  File "/home/cjlin/.local/lib/python3.6/site-packages/torchtext/vocab.py", line 326, in __init__
    self.cache(name, cache, url=url, max_vectors=max_vectors)
  File "/home/cjlin/.local/lib/python3.6/site-packages/torchtext/vocab.py", line 368, in cache
    with zipfile.ZipFile(dest, "r") as zf:
  File "/usr/lib/python3.6/zipfile.py", line 1131, in __init__
    self._RealGetContents()
  File "/usr/lib/python3.6/zipfile.py", line 1198, in _RealGetContents
    raise BadZipFile("File is not a zip file")
zipfile.BadZipFile: File is not a zip file

questions about the parameter settings

I noticed that you have
d_max_pool_p: [125, 128, 128]
filter_sizes: [2, 4, 8]
stride: [2, 1, 1]
May I ask why the first stride is 2?

Also how did you decide values shown in params.yml? The authors of
xml-CNN mentioned
filter_channels: 128
filter_sizes: [2, 4, 8]
hidden_dims: 512
but how about others?

Thanks

slightly worse p@1 than results in xml-cnn paper

My two runs of the code showed

-------------- Best Epoch: 16 (p@1: 0.95296472311019897461) -------------

-------------- Best Epoch: 14 (p@1: 0.95154267549514770508) -------------

This p@1 result is slightly worse than the 96.86 presented
in the xml-cnn paper. Do you think the reason is because
of the slightly different tokenization?
Indeed I think as yours used more tokens, more information is used
and results should be as good?

Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.