Comments (3)
Hi, what task did you train ESIM on ? The model is designed for natural language inference specifically, and it might not be appropriate for the task you're trying to solve, depending on what it is. Your issue might also be with the format of the data you fed to the model. It has to follow the same format as the SNLI and MNLI data sets.
from esim.
Hi Aurelien, many thanks for your feedback. The task I am doing is a natural language inference binary classification task. The input feature includes the context (facts + rules) and a question. The output is true/false. I can use RoBERTa, BERT to train/dev/test and get a high test accuracy above 90%. But it is 50% test accuracy by using ESIM. So it is quite weird for me. Here is the dataset format.
##Input feature (Context + Question)
##Context (Facts + Rules):
##Facts:
Red things are big. Anne is big. Erin is green. Anne is red. All red things are round.
##Rules:
If something is green and cold then it is furry. Anne is green. Anne is cold. Erin is round. If something is rough then it is furry. If something is cold then it is furry. Anne is rough. Anne is furry. All furry things are round. Erin is furry. Erin is cold. If something is round then it is rough. All rough things are cold.
##Question:
'Anne is red.'
##Output (1/0) True: 1, False: 0
##Correct label:
1
##Predict label:
1
from esim.
Sorry for the (very) late reply. You'll need to modify the training and testing scripts quite significantly if you want to process data in a different format than the SNLI or MNLI data sets. In particular, you'll need to rewrite the preprocessing steps to format your data so it is readable by the model. You also need to change the data's labeldict to only contain two output labels instead of three, if your classification task is binary.
from esim.
Related Issues (19)
- Complete Esim implementation HOT 2
- Test on MNLI model which was trained on SNLI
- Getting Segmentation fault while training on MNLI HOT 1
- Prediction for new set of premise and hypothesis pair HOT 4
- I have run the code for many times, but the test result fails to reach 88, which is only about 87.6. Is there any detail that needs attention HOT 1
- when i run python3 train_snli.py, it happens HOT 2
- Validation loss lower than training loss?
- training loss is not reduced and accuracy is not improved during training HOT 1
- Esim HOT 2
- question about model parameter initialization HOT 2
- X
- ImportError in train_snli.py HOT 1
- No such file or directory: worddict.pkl HOT 3
- ModuleNotFoundError: No module named 'esim' HOT 2
- What is the BNLI dataset? HOT 2
- Buffered data was truncated after reaching the output size limit. HOT 1
- dataset scitail HOT 1
- ESIM using keras HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from esim.