Git Product home page Git Product logo

pte2asc's Introduction

PTE2ASC

This is a word embedding resource built by ourselves with PTE which is a semisupervised representation learning tool proposed by [Tang et al., 2015]. This tool could leverage both labeled and unlabeled data to build a large-scale heterogeneous network and use the network to train the word vectors. In our implementation, on one hand, the labeled data is collected from Amazon by [McAuley et al., 2015]. Specifically, we pick 6 domains, i.e., Books, CDs, Clothing, Electronics, Restaurant and Health and each review is automatically assigned with a positive category if its rating score is 4 or 5 and a negative category if its rating score is 1 or 2. On the other hand, the unlabeled data is the data from SemEval-2015 Task [Pontiki et al., 2015]. The vocabulary size is about 1.2 million and the dimensionality of word vector is 300.

[Tang et al., 2015] Jian Tang, Meng Qu, and Qiaozhu Mei. PTE: predictive text embedding through large-scale heterogeneous text networks. In Proceedings of SIGKDD2015, pages 1165–1174, 2015.

[McAuley et al., 2015] Julian J. McAuley, Rahul Pandey, and Jure Leskovec. Inferring networks of substitutable and complementary products. In Proceedings of SIGKDD2015, pages 785–794, 2015.

[Pontiki et al., 2015] Maria Pontiki, Dimitris Galanis, Haris Papageorgiou, Suresh Manandhar, and Ion Androutsopoulos. Semeval-2015 task 12: Aspect based sentiment analysis. In Proceedings of NAACL-HLT-2015, pages 486–495, 2015.

The word embedding resource is released at https://pan.baidu.com/s/1Z7BxJ2rf0XlFlPfg7dEf4Q.

Discourse Segmentation Tool

Owing to some unknown reasons, the orginal url http://alt.qcri.org/tools/discourse-parser/ could not be accessed. Now, you can download the same discourse segmentation tool from the new address https://github.com/jjwangnlp/codra-rst-parser.

Usuage

python tests_demo.py

Prerequisition

python version >= 2.7

Citation

Jingjing Wang, Jie Li, Shoushan Li, Yangyang Kang, Min Zhang, Luo Si. Aspect Sentiment Classidication with both Word-level and Clause-level Attention Networks. In Proceeding of IJCAI-2018.

pte2asc's People

Contributors

jjwangnlp avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

pte2asc's Issues

ASC-QA

hello,excuse me, this issue is not related to the work on this page, but it still matters to me. Your team's 《Aspect Sentiment Classification Towards Question-Answering with Reinforced Bidirectional Attention Network》 work Github address has been 404 not found. I really want to learn about the dataset of this work. Could you please update Github or provide me with your contact information? Please send it to me privately. Thank you very much! Finally, I wish you success in your work and good health!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.