Git Product home page Git Product logo

Comments (3)

controversial avatar controversial commented on June 3, 2024

normally, that would be because the article contained a link back to itself. However, in this case, it looks like Facebook links to "face book". To try and combat duplicate nodes (sometimes cased by spelling differences), Wikipedia map removes spaces and sees these as the same article. I'll try to fix soon.

from wikipedia-map.

controversial avatar controversial commented on June 3, 2024

I think I'll look into whether Wikipedia assigns unique IDs to pages, then I could use these. The most major restriction with Wikipedia Map as a whole is that I can't perform any kind of operations on each created node, because that would slow down expansion a lot. Speed is hard to get right on this.

The easiest fix for this would be to forbid nodes linking to themselves altogether, I think. The whole ID system used now is pretty unreliable.

Maybe the MediaWiki API provides methods for getting IDs for a whole bunch of pages at once. I could fix if this was a thing.

from wikipedia-map.

controversial avatar controversial commented on June 3, 2024

“facebook” and “face book” are now treated separately, and there's a much better mechanism for resolving duplicates now. I don't think there are very many cases where a node would link to itself when it's not supposed to, besides some extreme edge cases.

from wikipedia-map.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.