Git Product home page Git Product logo

linkedin-scraper's People

Contributors

akramaznakour avatar ayoubinoss avatar billydotws avatar sammrach avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

linkedin-scraper's Issues

Page does not increment

When using this, the page doesn't actually increment, it scans the same page over and over again.

I don't see anything to actually move page, just to click on cards on a singular page?

Issue and solution: the number of items in each page are sometimes not correct

Issue:
When the number of search result are huge, e.g. 500 results, usually each page (except the last page) will contain 25 items. However sometimes the scraper found out less than 25 items.

Root cause:
This issue is caused by the "scroll down" operation at each page did not go to the very bottom of that page.
This is because the HTML changes dynamically and load more items when you scroll down. In the code "endScrollTop " is calculated in the first place, the value is no longer valid when you scroll down.
This is the line of code which I talked about:
const endScrollTop = element.scrollHeight - element.clientHeight;
(content/linkedin-scraper.js, line 18)

Suggest Solution:
Move the above line of code into the start of the scrollStep(timestamp) function. So that the "endScrollTop" will be calculated every time in a scroll down operation.
You might also want to increase the time out and duration when calling the simulateRealScrollToEnd() method.

(Forgive me by giving word suggestions instead of code, I created a fork and added some of my own features, and because I am not good at javascript, my code looks pretty mess)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.