Comments (4)
I think I have figured it out. My macOS laptop connects to this website directly, from China. But my Linux server does not. This website seems to put a read more
button for foreign visitors only. If I force macOS to use proxy, the resulted HTML file also contains this read more
button.
Glad to know monolith honors HTTP_PROXY
env var, though.
from monolith.
This is very peculiar... thank you for reporting this (I love puzzles!) Thankfully I have both GNU/Linux and macOS to recreate the scenario, I'll try it now and see what I get.
from monolith.
After trying to reproduce the behavior, I have to say that I failed to get different outputs from monolith on macOS and GNU/Linux: the document gets saved with the rest of the article hidden on both OSs. I'd like to recommend saving it with the -c
flag, that way the page would lose its style, but as a side-effect you'll be able to see all its text (usually works fantastic for articles paywalled via CSS).
To solve the mystery, could you please do diff
on both of those versions you get? That could give a good clue on why it's happening.
Thank you once again for the report.
from monolith.
The diff is rather large.
In case it helps, I've attached all the files. Diff command was diff macos.html linux.html > diff.txt
.
The Linux version of monolith I used can be downloaded from here.
from monolith.
Related Issues (20)
- Asynchronous execution error
- How to extract files in html HOT 2
- feature request : Rate-limiting CLI option HOT 3
- macOS release binary ? HOT 2
- Not able to full download a webpage HOT 2
- Unsupported document media type HOT 2
- XHTML documents get mangled
- Save pages with login HOT 2
- Query: Can monolith recursively follow links in a single domain and create a monolith for them ? HOT 3
- Can you create an option to remove links, buttons and input boxes? HOT 3
- Thank you HOT 1
- What does the -I flag do? HOT 2
- Suggestion: add bot indicator flag
- Scraping from archives feature
- Monolith cannot save 404 pages HOT 1
- Image not saved correctly
- error while loading shared libraries HOT 1
- Saved page is about blocked cookies HOT 2
- Build warnings HOT 4
- Error following redirect for url HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from monolith.