Comments (2)
Thank you for your interests in our work.
We extracted CLIP feature using the code from FrozenBiLM.
You can refer to 'Video Feature Extraction' part from the above repository.
Also, if your own dataset does not have enough temporal dynamics in the video, qav_loss
may not decrease.
If you have any questions, please let me know.
from flipped-vqa.
Thank you very much for your reply! I will try it.
from flipped-vqa.
Related Issues (19)
- Error when training with TVQA dataset: AttributeError in DataLoader worker process HOT 1
- How to use a trained checkpoint to make inference on validation set and resume from checkpoint. HOT 5
- From where to download LLaMA-v1 model? HOT 3
- Checkpoints HOT 1
- ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 3 (pid: 55662) of binary: /usr/bin/python3 HOT 3
- What is the function of the parameter `max_feats`? HOT 1
- A question about QAV task in the code HOT 2
- How many GPUs are needed to train the model? HOT 2
- Concerns and Clarifications Regarding MCQ to Generation Task Conversion HOT 3
- Not getting the reported number. HOT 4
- Cannot reproduce the result HOT 2
- finetuned using lamma-13B HOT 3
- Number of frames and its use in code and max_feats10 for video feature HOT 1
- meaning of qav loss HOT 1
- about self.gate2 HOT 1
- need llama-13B finetuned checkpoints
- What are the average stats being reported at the end of every epoch in training? HOT 2
- How was the STAR dataset preprocessed for this code HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from flipped-vqa.