Comments (5)
is this for allowing lower resolution and sizing up?
from video2dataset.
from video2dataset.
@rom1504 couldn't find any way of specifying "closest format according to resolution" but we could manually do it in python by finding:
- worst format >= target_resolution
- best format <= target_resolution
and comparing which is closer using metadata in the return formats
problem with this is not sure if we want to take on the extra overhead of playing with formats again. The speedup we recently got was from not playing with formats and just trusting yt-dlp to give us the best format. To that end I suggest 2 options, would like to hear some thoughts:
- we try the thing I suggested earlier, test it, etc.
- we implement resolution subsampler and use that on the output of data reader to get exact resolution we want and move this to v1
from video2dataset.
ChatGPT suggests using max_resolution with height and width and then combine with | best
from video2dataset.
summarizing my thoughts on this again:
we need to make it so that users have more choices about the output dimensionality than just - what YT has or EXACT video_size x video_size. To do this we need to implement the following functionality:
- try to pick the smallest video larger than the video_size
- if that's not available and all videos are smaller than take the largest video, smaller than video_size
- if some resize mode is "pad", pad the video to video_size x video_size
- otherwise just save the smaller version
from video2dataset.
Related Issues (20)
- Failed to download: 0.000 messages when downloading HOT 1
- provide a docker image
- investigate celery + redis distribution HOT 1
- Unexpected behaviour change after download worker refactor HOT 9
- list index out of range HOT 1
- Clean up tmp part files in case of d/l failure
- Question regarding slurm(+pyspark) distributed download HOT 1
- Recent regressions HOT 3
- Add efficiency test to make it unnecessary to run it manually for each major PR
- FrameSubsampler broken in version 1.3.0 HOT 10
- Add tests for all subsamplers HOT 1
- Default process group has not been initialized, please make sure to call init_process_group. HOT 1
- Example on how to handle a padded video frame in downstream network? HOT 1
- YouTube metadata is not saved HOT 3
- Example dataloader code for distributed training? HOT 3
- Clarification needed on yt-dlp video selection query
- how could I re-download those "failed_to_download"?
- HTTPSConnectionPool(host='ak.picdn.net', port=443): Read timed out.
- How can I use video2dataset to down load the specific clip of the youtube video?
- When I use video2dataset to download videos from youtube recently, I got a confirm error.How can I resolve the problem? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from video2dataset.