Git Product home page Git Product logo

chardetng-py's Introduction

Anurag's GitHub stats

chardetng-py's People

Contributors

dependabot[bot] avatar john-parton avatar mr0grog avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

mr0grog

chardetng-py's Issues

Add build support for Windows x86 (x86_64 already built) and FreeBSD

Here's the current supported architecture/os matrix.

Linux Windows 8 and Newer FreeBSD 10 and Newer macOS Snow Leopard (10.6)
x86
x86_64
ppc64le
AArch64
ARMv7
s390x

Information about which architectures and operating systems Python supports

https://pythondev.readthedocs.io/platforms.html

Architectures

Well supported architectures:

  • Intel x86 (32-bit) and x86_64 (64-bit, aka AMD64)

Best effort support architectures:

  • ppc64le: should be well supported in practice
  • AArch64
  • ARMv7: should be well supported in practice
  • s390x

Operating systems

  • Linux
  • Windows 8 and newer for Python 3.9
  • FreeBSD 10 and newer
  • macOS Snow Leopard (macOS 10.6, 2008) and newer

Use guess_assess to determine confidence?

In compat.detect, the confidence is always 0.99:

# chardetng does not return a confidence value
# This is the value which is unconditionally returned
DEFAULT_CONFIDENCE: Final[float] = 0.99

It’s possible I’m misunderstanding things, but it seems like you could use guess_assess() instead of guess() to at least determine high vs. low confidence. From the chardetng docs:

Same as guess(), but also returns a Boolean indicating whether the guessed encoding had a higher score than at least one other candidate. If this method returns false, the guessed encoding is likely to be wrong.

On a related note, it would be nice if shortcuts.detect() returned the boolean from guess_assess, too (or had another method that did so), since the aliases buried in the shortcuts module seem pretty important, and possibly prone to change. Otherwise a user has to know about them if they want to safely use guess_assess instead. (Or the aliases could be documented and moved to a more accessible place.)

Add clearer links to documentation

Currently, can be difficult for new users to find docs. Add clearer links to documentation

  • Add link on pypi to documentation
  • Call out written documentation in readme

Originally suggested here: #109 (comment)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.