toxicphreak / python-docx-ng Goto Github PK

View Code? Open in Web Editor NEW

This project forked from python-openxml/python-docx

18.0 18.0 2.0 43.91 MB

Create and modify Word documents with Python (next-gen)

License: MIT License

Python 93.58% Makefile 0.09% Gherkin 6.33%

python-docx-ng's People

Contributors

Stargazers

Watchers

Forkers

laurentp13 nezanyat

python-docx-ng's Issues

Check/Create testcases for closed pull requests

Catchup with python-docx

Any plans to refresh the alignment between this project and the original python-docx? It looks like they've added a fair few changes since the fork was done.

Include VBA handling

Use olevba.py to basically support VBA in python.

Reading / Decompressing (nearly as is in file)
Writing ole objects to create some VBA content from nowhere

read TOC of a document through paragraph

Hi,

I am trying to get TOC of my document, but can not get through paragraphs, so I use etree to pragraph element.

here is my document, and I check source seems TOC packed in w:std, is it root cause?

just part from source( get source by rename to zip file andunzip)

<w:sdt>
<w:sdtPr>
<w:rPr>
<w:rFonts w:ascii="Tahoma" w:eastAsia="微软雅黑" w:hAnsi="Tahoma" w:cs="黑体"/>
<w:sz w:val="22"/>
<w:lang w:val="zh-CN"/>
</w:rPr>
<w:id w:val="1773670984"/>
<w:docPartObj>
<w:docPartGallery w:val="Table of Contents"/>
<w:docPartUnique/>
</w:docPartObj>
</w:sdtPr>
<w:sdtEndPr>
<w:rPr>
<w:b/>
<w:bCs/>
</w:rPr>
</w:sdtEndPr>
<w:sdtContent>
<w:p w14:paraId="0B4A64B5" w14:textId="77777777" w:rsidR="001A16C5" w:rsidRDefault="001A16C5">
<w:pPr>
<w:pStyle w:val="TOC1"/>
</w:pPr>
</w:p>
<w:p w14:paraId="400634CB" w14:textId="77777777" w:rsidR="001A16C5" w:rsidRDefault="00000000">
<w:pPr>
<w:pStyle w:val="TOC1"/>
<w:rPr>
<w:rFonts w:asciiTheme="minorHAnsi" w:eastAsiaTheme="minorEastAsia" w:hAnsiTheme="minorHAnsi" w:cstheme="minorBidi"/>
<w:kern w:val="2"/>
<w:sz w:val="21"/>
</w:rPr>
</w:pPr>
<w:r>
<w:fldChar w:fldCharType="begin"/>
</w:r>
<w:r>
<w:instrText xml:space="preserve"> TOC \o "1-3" \h \z \u </w:instrText>
</w:r>
<w:r>
<w:fldChar w:fldCharType="separate"/>
</w:r>
<w:hyperlink w:anchor="_Toc131613027" w:history="1">
<w:r>
<w:rPr>
<w:rStyle w:val="afa"/>
<w:rFonts w:ascii="黑体" w:hAnsi="黑体"/>
</w:rPr>
<w:t>摘　　要</w:t>
</w:r>
<w:r>
<w:tab/>
</w:r>
<w:r>
<w:fldChar w:fldCharType="begin"/>
</w:r>
<w:r>
<w:instrText xml:space="preserve"> PAGEREF _Toc131613027 \h </w:instrText>
</w:r>
<w:r>
<w:fldChar w:fldCharType="separate"/>
</w:r>
<w:r>
<w:t>I</w:t>
</w:r>
<w:r>
<w:fldChar w:fldCharType="end"/>
</w:r>
</w:hyperlink>
</w:p>
<w:p w14:paraId="49241CB2" w14:textId="77777777" w:rsidR="001A16C5" w:rsidRDefault="00000000">
<w:pPr>
<w:pStyle w:val="TOC1"/>
<w:rPr>
<w:rFonts w:asciiTheme="minorHAnsi" w:eastAsiaTheme="minorEastAsia" w:hAnsiTheme="minorHAnsi" w:cstheme="minorBidi"/>
<w:kern w:val="2"/>
<w:sz w:val="21"/>
</w:rPr>
</w:pPr>
<w:hyperlink w:anchor="_Toc131613028" w:history="1">
<w:r>
<w:rPr>
<w:rStyle w:val="afa"/>
<w:rFonts w:eastAsia="宋体"/>
<w:b/>
</w:rPr>
<w:t>ABSTRACT</w:t>
</w:r>
<w:r>
<w:tab/>
</w:r>
<w:r>
<w:fldChar w:fldCharType="begin"/>
</w:r>
<w:r>
<w:instrText xml:space="preserve"> PAGEREF _Toc131613028 \h </w:instrText>
</w:r>
<w:r>
<w:fldChar w:fldCharType="separate"/>
</w:r>
<w:r>
<w:t>II</w:t>
</w:r>
<w:r>
<w:fldChar w:fldCharType="end"/>
</w:r>
</w:hyperlink>
</w:p>
<w:p w14:paraId="096DC769" w14:textId="77777777" w:rsidR="001A16C5" w:rsidRDefault="00000000">
<w:pPr>
<w:pStyle w:val="TOC1"/>
<w:rPr>
<w:rFonts w:asciiTheme="minorHAnsi" w:eastAsiaTheme="minorEastAsia" w:hAnsiTheme="minorHAnsi" w:cstheme="minorBidi"/>
<w:kern w:val="2"/>
<w:sz w:val="21"/>
</w:rPr>
</w:pPr>
<w:hyperlink w:anchor="_Toc131613029" w:history="1">
<w:r>
<w:rPr>
<w:rStyle w:val="afa"/>
</w:rPr>
<w:t>第</w:t>
</w:r>
<w:r>
<w:rPr>
<w:rStyle w:val="afa"/>
</w:rPr>
<w:t>1</w:t>
</w:r>
<w:r>
331.docx

Go through all "original" issues

Go through all issues: https://github.com/python-openxml/python-docx/issues

Add contributing guidelines

Hello @toxicphreAK, so glad you started a fork like this and have begun to sift through things. Thanks for your work on this.

Would you like some help? Any thoughts about how you'd like contributions to happen? I can draft some guidelines if you give me a steer, and then hopefully start contributing in other ways.

Failing to import module

Hi, thank you for bringing python-docx back to life!

I'm trying to use python-docx-ng but a basic import of from docx import Document fails for me. Am I missing something here?

STR:

python -m venv .venv
source .venv/bin/activate
echo 'python-docx-ng' > requirements.txt
pip3 install -r requirements.txt

❯ python3
Python 3.10.11 (main, May 10 2023, 11:30:20) [Clang 11.1.0 ] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from docx import Document
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/__init__.py", line 3, in <module>
    from docx.api import Document  # noqa
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/api.py", line 14, in <module>
    from docx.package import Package
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/package.py", line 9, in <module>
    from docx.opc.package import OpcPackage
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/opc/package.py", line 9, in <module>
    from docx.opc.part import PartFactory
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/opc/part.py", line 13, in <module>
    from ..oxml import parse_xml
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/oxml/__init__.py", line 334, in <module>
    from .comment import CT_Comments,CT_Com, CT_CRE, CT_CRS, CT_CRef
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/oxml/comment.py", line 8, in <module>
    from ..text.paragraph import Paragraph
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/text/paragraph.py", line 13, in <module>
    from .run import Run
  File "/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/text/run.py", line 11, in <module>
    from docx.opc.part import PackURI, Part
ImportError: cannot import name 'PackURI' from partially initialized module 'docx.opc.part' (most likely due to a circular import) (/Users/erasmas/docx-ng/.venv/lib/python3.10/site-packages/docx/opc/part.py)

Check pull request comments and evaluate pulls

Pulls:

Word Default Template

Write a macro adding all default styles to new document and remove whole document content. Save it as default.docx.
Actually it is creating a new document adding all styles by hand and removing content from file as only by adding e.g. a new heading to the doc it will be in the XML (only styles used once are integrated).

Remove author from doc props after creation.

Suggest to add a Homepage link on pypi

Hey toxicphreAK,
Thanks for the great work done on this project!

Just a small suggestion: better to add a Homepage link on your pypi home(https://pypi.org/project/python-docx-ng/) as others do, that way people can find this repo on github and can follow much easier.

Thanks again!

Read commits from original repo

Next read: python-openxml#564 python-openxml#564