Comments (4)
Ah ok, sorry for the confusion, I didn't see where the user-agent could come into play in goquery. Well this is really just a helper function, you can't customize anything on the request as it is.
What you can do (though not tested, but along those lines) is this:
req, err := http.NewRequest("GET", url, nil)
if err != nil {
// handle error
}
req.Header.Set("User-Agent", ua)
res, err := http.DefaultClient.Do(req)
if err != nil {
// handle error
}
defer res.Body.Close()
root, err := html.Parse(res.Body)
if err != nil {
return
}
d := NewDocumentFromNode(root)
from goquery.
@raichu
hello,
Maybe this code can help you.
req, _ := http.NewRequest("GET", link, nil)
req.Header.Add("Content-Type", "application/x-www-form-urlencoded")
req.Header.Add("user-agent", "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36")
if resp, err := http.DefaultClient.Do(req); err == nil {
if doc, err := goquery.NewDocumentFromResponse(resp); err == nil {
}
}`
from goquery.
I think you're on the wrong project page, you must be referring to gocrawl https://github.com/PuerkitoBio/gocrawl ?
If so, yes you can configure the user-agent (both the user-agent of the crawler when requesting robots.txt and the user-agent used to request pages). It's on the Crawler.Options field. (https://github.com/PuerkitoBio/gocrawl#options)
from goquery.
I'm referring to this line specifically
from goquery.
Related Issues (20)
- How to count the number of words before and after a selection across a document? HOT 2
- If an id attribute contains dots tag will not be found HOT 2
- Find(selector)??????
- I am sorry, i can't find. May be some error. HOT 1
- Cant parse <tr> without <table> element. HOT 2
- how to get text 'CZ/KHN' HOT 3
- :first :first-child test failed,the expected nodes are not filtered out HOT 1
- Good
- How to query a shadow DOM? HOT 3
- Add a generic form of `Selection.Map` (requiring a more recent Go version) HOT 4
- Question about parsing nested tables and finding outer elements HOT 5
- Release Request for Current HEAD of golang.org/x/net with Security Updates HOT 5
- v1 (2).json
- Как добавить "github.com/PuerkitoBio/goquery" в свой код в визуал студио коде? HOT 1
- помогите с установкой GOPATH И GOROOT HOT 1
- Fails to remove some img elements HOT 2
- Has anyone ever tried goquery in a webbrowser with wasm? HOT 1
- Question: Is it possible support the iterator in version 1.23? HOT 3
- Bug about Text method HOT 1
- Resolve WARNING in CI of github actions. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from goquery.