As I said <a href="https://github.com/TeamNewPipe/NewPipeExtractor/pull/5#issuecomment

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Remove static variables from extractors about newpipeextractor HOT 4 CLOSED

teamnewpipe commented on August 16, 2024 1

Remove static variables from extractors

from newpipeextractor.

Comments (4)

mauriciocolli commented on August 16, 2024

@theScrabi I was thinking, does the page number in channel extractor make sense?

Because there some bad things that it brings (despite that currently, it doesn't work at all):

You don't know how many pages there are
Some sites like YouTube doesn't have pages (I don't know one that have), it works as the user demand more videos (scroll all the way down)
I don't see a very clear use case for page number, maybe continue where you stop (but anyway, that's more complex/difficult to implement in services that don't have the concepts of pages, because you would have to keep loading the on demand urls, until you get to the "page")

So, wouldn't make more sense implement it that way, on demand?

from newpipeextractor.

theScrabi commented on August 16, 2024

Introduction to the problem

That page thing is indead a problem. I've tried to implement the NewPipe backend like a slingle state machine, like regular REST Api are set up. However a Website is not a single state machine, as you can see on this example: Making one and the same request will give you a different output.
This might be a flow of NewPipe, and it might also cause some trouble later on when we implement different services.

Concrete Problem

At the moment it works like this: Every request is a new Instance of the back end. So the back end does not remember anything from the last request (it does not remember it's state). This is why the page parameter has to be given to the back end, so it's possible for the front end to remember the state.

In our case the data send from youtube is not the same for every url (one time its html one time it's ajax with less information). That means that NewPipe has to simulate a stateless machine, and that's bad you are right.

What we need

Well I don't want to say it but It might be that the backed as it is designed at the moment will cause trouble. The extractor/collector thing

is bad for parallelization
requires all information to be given at every point of time

Solution

I see two solutions:

Make an instance of the back end that lives as long as the channel/video/bla is displayed
Do not collect all the data, but make the frontent reside what it needs.

What do you think?

from newpipeextractor.

mauriciocolli commented on August 16, 2024

I think the instance living as long as it's used, is good for me (at least for now, I'll think more about it).

And I'm doing a big refactor in the extractor (I implemented the "on-demand", as I discussed in the previous post), take a look at: https://github.com/mauriciocolli/NewPipeExtractor/tree/refactor-extractor

Also I want to implement the playlist asap too, but I want to, at least, get rid of these issues, as it suffers the exact same problem as the ChannelExtractor (copied the same code).

PS: Please take a look at those open pull requests.

from newpipeextractor.

theScrabi commented on August 16, 2024

"On-demand" means you are not pulling out all the data into a struct, but calling the required function when data is needed right?

from newpipeextractor.

Remove static variables from extractors about newpipeextractor HOT 4 CLOSED

Comments (4)

Introduction to the problem

Concrete Problem

What we need

Solution

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent