Comments (5)
Sorry for the late response, I have been busy
recently.
After my experiments, I found that DSHRED-WA is the best one. But it also costs lots of time to converge. I recommend you to follow the GPT-2, which is much more hopeful in the future. And I will release a package for transformer based dialog model in about a month.
from multiturndialogzoo.
Thanks for the response! I find GPT-2 pretty good, but I wanted to know if an RNN model the same size could potentially beat it.
from multiturndialogzoo.
Lol, this is also the motivation of this repo. But the transformer based model seems more powerful than the RNN based models. If you have some ideas, we can make some conversations about the improving the RNN based models.
from multiturndialogzoo.
But was there a fair comparison (model with the same number of parameters as GPT-2, trained on the same data)?
from multiturndialogzoo.
Hi, i compare the GPT-2 model (transformers), and use the data to train it from scratch.
GPT-2 model can achieve better distinct score (better diversity). But the BLEU and embedding-based score are similar with these models. Maybe I will leverage human annotations to measure the performance in the future.
Sorry for the late response 😅
from multiturndialogzoo.
Related Issues (16)
- baseline models HOT 29
- The performances when using ReCoSa HOT 2
- output = output[:, :, :self.hidden_size] + output[:, :, self.hidden_size:],why? HOT 1
- RuntimeError: The size of tensor a (31) must match the size of tensor b (30) at non-singleton dimension 0 HOT 1
- _
- _
- 关于ubuntu数据集,seq2seqNoAttention效果太好
- 使用中文数据集
- An inquiry about ReCoSa model HOT 3
- training epochs HOT 10
- The result of Seq2Seq HOT 4
- The dataset only have 3 file instead 6 file. HOT 4
- data/data_process 文件夹里貌似没有Ubuntu corpus的预处理脚本? HOT 1
- Performance on DailyDialog dataset HOT 1
- When can models stop training? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from multiturndialogzoo.