The decode method on string objects doesn't exist in Python 3.6 (<a href="https://docs

The main a2x.py file. <a href="#

I meant decode not decide. <a hr

I ran into the error when building a package for <a href="https://github.com/newsboat/

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

decode method doesn't exist on string object about asciidoc-py HOT 16 CLOSED

asciidoc-py commented on June 8, 2024

decode method doesn't exist on string object

from asciidoc-py.

Comments (16)

elextr commented on June 8, 2024

Line 263 of what file? Link?

from asciidoc-py.

lfkeitel commented on June 8, 2024

The main a2x.py file.

…

On Tue, Jun 12, 2018, 17:42 elextr ***@***.***> wrote: Line 263 of what file? Link? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGUCX1bdo1FD9dWaoEVC4SRb56rc9Eb9ks5t8EPbgaJpZM4UlQuU> .

from asciidoc-py.

elextr commented on June 8, 2024

here

from asciidoc-py.

lfkeitel commented on June 8, 2024

Yep that's it. When I run with Python 3.6 that line throws saying decide doesn't exist on str.

…

On Tue, Jun 12, 2018, 18:09 elextr ***@***.***> wrote: here <https://github.com/asciidoc/asciidoc-py3/blob/cd6762bd16b20e5cf19ceaffb8452df912bb5f6e/a2x.py#L263> — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGUCX2uCpZx9ceJOg0eJ4VR-wzdmIFxZks5t8EoNgaJpZM4UlQuU> .

from asciidoc-py.

lfkeitel commented on June 8, 2024

I meant decode not decide.

…

On Tue, Jun 12, 2018, 18:10 Lee Keitel ***@***.***> wrote: Yep that's it. When I run with Python 3.6 that line throws saying decide doesn't exist on str. On Tue, Jun 12, 2018, 18:09 elextr ***@***.***> wrote: > here > <https://github.com/asciidoc/asciidoc-py3/blob/cd6762bd16b20e5cf19ceaffb8452df912bb5f6e/a2x.py#L263> > > — > You are receiving this because you authored the thread. > Reply to this email directly, view it on GitHub > <#16 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AGUCX2uCpZx9ceJOg0eJ4VR-wzdmIFxZks5t8EoNgaJpZM4UlQuU> > . >

from asciidoc-py.

MasterOdin commented on June 8, 2024

Yeah, need to detect mode passed to read (and write) and then decode/encode) appropriate within the function to just expect to deal with strings going in and going out.

Do you have a file and arguments you've used to run this to test?

from asciidoc-py.

elextr commented on June 8, 2024

Possibly the read should read bytes if encoding is not known, not strings, and then its decoded to Unicode string. (Python 3 decode() moved to bytes objects).

from asciidoc-py.

lfkeitel commented on June 8, 2024

When I get back to my computer I can give you the exact source and command I used.

…

On Tue, Jun 12, 2018, 18:41 elextr ***@***.***> wrote: Possibly the read should read bytes if encoding is not known, not strings, and then its decoded to Unicode string. (Pythjon 3 decode() moved to bytes objects). — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#16 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AGUCX2NPWDG92nCWFkM7_7S21N-P1eJMks5t8FGSgaJpZM4UlQuU> .

from asciidoc-py.

lfkeitel commented on June 8, 2024

I ran into the error when building a package for newsboat on Fedora. Its documentation is generated using a2x. The current development version of Fedora uses this package instead of the original Python 2 version and was failing the doc builds. For a quick repro, from the newsboat repository run a2x -f xhtml doc/faq.txt. If you would like a specific tag I'm using r2.11.1.

from asciidoc-py.

elextr commented on June 8, 2024

@lfkeitel Fedora is maybe a bit early off the mark, this is still developmental, see #15 and the repository message.

from asciidoc-py.

lfkeitel commented on June 8, 2024

I'm well aware of that. It wasn't my decision. Fedora 29 is going with Python 3 as the default and they're trying their best to prepare. I just have to deal with it. I've already put in a comment with them about it so they may make a patch to the pre-release package until it's fixed upstream. I just thought you would like to know that I ran into the bug.

from asciidoc-py.

elextr commented on June 8, 2024

@lfkeitel no problem, thanks for reporting, certainly Fedora should be used to making patches to early release packages, and updating them regularly, its a fairly bleeding edge distro after all :)

from asciidoc-py.

MasterOdin commented on June 8, 2024

So looking at this further, it seems that we can either remove the whole encoding check and just assume UTF-8 always, or rewrite read_file slightly such that it loads the file as a byte string, reads the first line or two to check if there's an encoding and if there is, then we decode the whole file as that encoding, else we fallback to standard 'utf-8'. Let me know which route you'd like to follow @elextr.

For now, I've modified #5 to just always assume UTF-8 and that should fix @lfkeitel's problem at least.

from asciidoc-py.

lfkeitel commented on June 8, 2024

Thanks. This is how asciidoc3 handles it: https://github.com/asciidoc3/asciidoc3/blob/master/a2x3.py#L302. If it helps at all. They just call encode with the detected encoding.

from asciidoc-py.

MasterOdin commented on June 8, 2024

Well, except that you're still opening the file in your default locale (which might be UTF-8 or might not, docker alpine defaults to ASCII) so you're already doing a transformation on file load and then just encoding it later. If you're going to care about the encoding, it should be done at the file level when you're reading it in. Of course, that also just always encodes it in utf-8 if the file specifies an encoding which is...weird.

Also, that has a bug in that it does a str() around a byte object which results in a string that starts with b' and ends with ', though that's a separate issue.

from asciidoc-py.

MasterOdin commented on June 8, 2024

I think what you'd probably want to do is something like:

with open(filename, 'rb') as open_file:
        contents = open_file.read()
mo = re.search(b'\A<\?xml.* encoding="(.*?)"', contents)
contents = contents.decode(mo.group(1) if mo else 'utf-8')

to more properly read the file and get it as a proper unicode string without losing any characters, though I'm not sure how that would affect things in case it writes this stuff back out to a file (that's probably expecting utf-8).

from asciidoc-py.

decode method doesn't exist on string object about asciidoc-py HOT 16 CLOSED

Comments (16)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent