Git Product home page Git Product logo

unicode's People

Contributors

anomen-s avatar cben avatar davejagoda avatar dscorbett avatar garabik avatar julienpalard avatar mandel59 avatar mdirik avatar raylu-stripe avatar remram44 avatar sersorrel avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

unicode's Issues

Release on PyPi?

Is this package ready to be hosted on PyPI? We can try it out on https://test.pypi.org/ (under a dummy name) and if it works can we release it there?

Also, does anyone know where to find a "unicode-data" package for MacOS?

get_unihan_properties_internal raise AttributeError

When a CJK character is looked up with verbose option, AttributeError is thrown:

% ./unicode -v 夢
U+5922 CJK UNIFIED IDEOGRAPH-5922
UTF-8: e5 a4 a2 UTF-16BE: 5922 Decimal: 夢 Octal: \054442
夢
Category: Lo (Letter, Other); East Asian width: W (wide)
Unicode block: 4E00..9FFF; CJK Unified Ideographs
Bidi: L (Left-to-Right)


Traceback (most recent call last):
  File "./unicode", line 1014, in <module>
    main()
  File "./unicode", line 1011, in main
    print_characters(processed_args, options.maxcount, format_string, options.query_wikipedia, options.query_wiktionary)
  File "./unicode", line 746, in print_characters
    uhp = get_unihan_properties(c)
  File "./unicode", line 335, in get_unihan_properties_internal
    properties[key] = value.decode('utf-8')
AttributeError: 'str' object has no attribute 'decode'

Env. info:
macOS Big Sur version 11.2.1, MacBook Air (M1, 2020), Apple M1
Python 3.9.2

grep is placed in /usr/bin

Feature request: Unicode blocks

It would be useful to include Unicode block of characters (http://ftp.unicode.org/Public/UNIDATA/Blocks.txt) since name of character not always indicates which block it belongs to. For example:

U+A673 SLAVONIC ASTERISK
UTF-8: ea 99 b3 UTF-16BE: a673 Decimal: &#42611; Octal: \0123163
꙳
Block: Cyrillic Extended-B (U+A640..U+A69F)
Category: Po (Punctuation, Other)
Bidi: ON (Other Neutrals)
U+F0021  - No such unicode character name in database
UTF-8: f3 b0 80 a1 UTF-16BE: db80dc21 Decimal: &#983073; Octal: \03600041
󰀡 (󰀡)
Uppercase: F0021
Block: Supplementary Private Use Area-A (U+F0000..U+FFFFD)
Category: Co (Other, Private Use)
Bidi: L (Left-to-Right)

Request: suggest where to get UnicodeData.txt

The README says UnicodeData.txt is required and suggests the URL http://www.unicode.org/Public/. That URL presents me with 44 directories and no indication which of them might contain a file named UnicodeData.txt. I can understand that versions change and you might not want to link to a specific version, but at least could you give a hint as to which directory people should be looking at? There seems to be a file with that name in UNIDATA/ so I'll start with that, but since you didn't link directly there I'm wondering if there might be other versions in other directories (that file does seem to work).

This unicode module looks very useful, thanks.

would be nice to see version of unicode that introduced character

Hey there! I love your tool, and have been using and recommending it for years.

It would be nice to display the version of Unicode that introduced a character, e.g. "13.0" for say U+1FBF5 SEGMENTED DIGIT FIVE. I don't believe that this can be determined strictly from Block membership, but I'm not certain that it can't (i.e. I think non-full Blocks can have characters added over time). It seems reasonable to require an external data source, as the Block information does.

This is of course strictly a wishlist item. If you have no interest in doing it, but would accept a patch, feel free to assign it to me, and I'll kick one over to you when I get the time. Thanks!

unicode finds BEVERAGE but not HOT

Still there's a HOT BEVERAGE character.

"HOT" in fact matches the three letters "H", "O", "T", yet I expected to find "HOT BEVERAGE".

Speaking of ☕, I think I owe you one for this package!!! :)

ERROR: No matching distribution found for unidecode

Showing this error when I tried to install other package that depend on this one.

Building wheels for collected packages: unicode
Building wheel for unicode (setup.py) ... done
Stored in directory: C:\Users\XXXXXXXX
Successfully built unicode

I download and installed manually but I only see one file after in the site-packages folder which is unicode-2.7.dist-info, without unicode folder itself. Not sure why this happening, using win 10 python 3.6.

Need help. Thanks

Special case control chars

unicode 0x07 outputs a bell, for example

pchar is stuck right into the format string but should probably get special handling. this would allow me to easily embed the output with pchar into other apps (an irc bot that responds to "!unicode", in particular)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.