nobodywhu / gpudmm Goto Github PK
View Code? Open in Web Editor NEWTopic Modeling for Short Texts with Auxiliary Word Embeddings
Topic Modeling for Short Texts with Auxiliary Word Embeddings
Hi,
I was trying to test GPUDMM on my dataset.
It seems missing the source file for the GSDMM.
Could you please update or share the file?
thanks and best regards,
raz
what is the format of the file named qa_data.txt?
Each line is a document?
thank you for your time !
what is the format of the word_similarity.txt and the qa_word2id.txt? thank you very much @NobodyWHU @duanyu
Hi,
Thanks for sharing your work.
I have a compilation error when I'm trying to compile RatioGPUDMM.java
$ javac RatioGPUDMM.java
RatioGPUDMM.java:17: error: cannot find symbol
import RatioGPUDMM.Document;
^
symbol: class Document
location: package RatioGPUDMM
RatioGPUDMM.java:26: error: cannot find symbol
public ArrayList docList;
^
symbol: class Document
location: class RatioGPUDMM
RatioGPUDMM.java:60: error: cannot find symbol
public RatioGPUDMM(ArrayList doc_list, int num_topic, int num_iter, int save_step, double beta,
^
symbol: class Document
location: class RatioGPUDMM
RatioGPUDMM.java:445: error: cannot find symbol
Document doc = docList.get(i);
^
symbol: class Document
location: class RatioGPUDMM
RatioGPUDMM.java:473: error: variable topic is already defined in method init_GSDMM()
int topic = rg.nextInt(numTopic);
^
RatioGPUDMM.java:721: error: cannot find symbol
ArrayList doc_list = new Document.LoadCorpus("corpus.txt");
^
symbol: class Document
location: class RatioGPUDMM
RatioGPUDMM.java:721: error: package Document does not exist
ArrayList doc_list = new Document.LoadCorpus("corpus.txt");
^
7 errors
My version of java:
$ java -version
java version "1.8.0_111"
Java(TM) SE Runtime Environment (build 1.8.0_111-b14)
Java HotSpot(TM) 64-Bit Server VM (build 25.111-b14, mixed mode)
I'm not really an expert of Java (Python) and I haven't found the bug.
Could you help me please?
Thx mat
首先感谢你们做出的贡献
我在制作qa_data.txt时遇到了问题:你们引用的web snippet数据集我在其他人的论文中找到了,但是它的格式是这样的:
manufacture manufacturer directory directory china taiwan products manufacturers directory- taiwan china products manufacturer direcory exporter directory supplier directory suppliers business
empmag electronics manufacturing procurement homepage electronics manufacturing procurement magazine procrement power products production essentials data management business
......
请问你们在形成自己的qa_data.txt时候,比如business这一条,是找到所有在business前面出现过的词,然后放到 0 business| 的后面的吗?
@NobodyWHU @duanyu
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.