<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

About Video Instruction Data Generation about video-chatgpt HOT 1 CLOSED

mbzuai-oryx commented on June 7, 2024

About Video Instruction Data Generation

from video-chatgpt.

Comments (1)

hanoonaR commented on June 7, 2024 2

Hi @jhj7905,

Thanks for your interest in our work and for your question about our data generation methods.

You're correct in your estimation. The dataset is indeed comprised of 30% human-annotated data and 70% semi-automatic annotations. As you correctly pointed out, creating a fully human-annotated dataset can be costly and time-consuming, so we complemented it with semi-automatic methods.

For the semi-automatic portion of our dataset, we developed a comprehensive method that combined predictions from SOTA models to extract relevant cues. We used specific models to eliminate any noisy or irrelevant context from the data. This rigorous process ensured that the data maintained its accuracy and relevance despite not being fully human-annotated.

I hope this clarifies the methodology we used for data generation. Please feel free to reach out if you have any more questions.

from video-chatgpt.

About Video Instruction Data Generation about video-chatgpt HOT 1 CLOSED

Comments (1)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent