Comments (1)
The same parse error occurs when the web page has errors too. This can be invisible errors, that is missing closing tags, corrupt tags, or similar that the browser overcomes so that the page still renders ok. I think the browser must just ignore the error so the text still displays ok, hence the error is invisible, but the parser in save-as-ebook throws out the text so the ebook doesn't match the web page.
I have a work around for this for anyone having similar problems. The extension rewriter allows you to set up rules for rewriting a page and these rules apply to changing the html too. Rewriter seems to affect the the whole page, not just the visible text. Hence rewriter can be set to remove all <o:p> and </o:p> tags so that save-as-ebook will work ok (unless the page has other errors, which is how I found that this was another problem). Rewriter can be restricted to specific URLs so you can limit the effects to just where you need it. The matching and replacing use regex so it is very powerful if need be but replacing just the o:p tags does need anything complicated.
from save-as-ebook.
Related Issues (20)
- Capturing unneeded elements HOT 4
- Some minor grievances. HOT 2
- All saved EPUB have identical identifier
- Support defining title, author and cover HOT 1
- Need more control over the capture HOT 3
- Feature Request: Add all tabs as Chapters HOT 1
- request to get it work for thunderbird HOT 1
- Feature Request: extract H1,H2,etc. tags as chapters HOT 2
- display block missing in some classes HOT 1
- Table borders are not present in the ePub HOT 2
- replacing iframes with divs - issue concerning inline styles
- Ruby rb tags ignored by extractHtml.js HOT 1
- Feature request - button to change page direction of ebook HOT 1
- Unexpected behavior on custom stylesheet. HOT 1
- Congrats on 1001 stars! HOT 1
- Images?
- Ruby rp tags dropped by extractCss() of extractHtml.js
- Doesn't work in Reader Mode HOT 3
- Website pictures are not included on the protected websites HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from save-as-ebook.