jekyc / wig Goto Github PK
View Code? Open in Web Editor NEWWebApp Information Gatherer
License: BSD 2-Clause "Simplified" License
WebApp Information Gatherer
License: BSD 2-Clause "Simplified" License
x
The Wig can work 2 months ago when I knew that. But today I ran the same code, it failed, and I try to download the latest version just now and run again, either can run, the screenshot is below, thanks for answers!
python version is 3.5.0
The wig.py I jusr download is under \wig-master
Thank you so much!
python3 setup.py install
Traceback (most recent call last):
File "setup.py", line 3, in
from setuptools import setup, find_packages
ImportError: No module named 'setuptools'
It is currently not possible to run wig as a command (e.g. python3 wig.py example.com) due to packaging issues.
I am a developer for the archassualt project, and we would like to add your tool to our repo. However I noticed you are missing a license file, can you please add a license file?
I have been using your CMS/app sniffer quite frequently and are happy with its accuracy.
I have a feature request that would make this tool much better for my use cases.
Feature:
Have the option to make Wig read URL's from a list of URL and then write the result to file.
Solution suggestion
wig could read a simple .txt file with one URL on each line.
Wig could then write the result to one file for each url (sub.domainname.txt) or aggregated the output to one common result file.
I would prefer if the result was written in a structured manner like a .csv file or similar.
Example use case
Thanks for sharing this tool and keep up the good work.
in wig.py on line 87: except UnknownHostName as e:
Suggestion: commenting and returning keeps the loop going when using a list with -l
90: - sys.exit(1)
90: + return
Hello
what's problem?
# ./wig.py -l peyw.txt
wig - WebApp Information Gatherer
Scanning https://de.wordpress.com...
Traceback (most recent call last):
File "./wig.py", line 12, in <module>
wig.run()
File "/root/web_apps/wig/wig/wig.py", line 276, in run
self.scan_site()
File "/root/web_apps/wig/wig/wig.py", line 148, in scan_site
self.data['cache'].load()
File "/root/web_apps/wig/wig/classes/cache.py", line 166, in load
for cache_file in os.listdir(self.cache_dir):
AttributeError: 'Cache' object has no attribute 'cache_dir'
File "./wig.py", line 76
print(p.name, end=" \r")
^
SyntaxError: invalid syntax
(On OSX 10.9)
Please if possible make support for sites with russian language. Now any site return error
Traceback (most recent call last):
File "wig.py", line 229, in
wig.run()
File "wig.py", line 171, in run
print(outputter.get_results())
File "C:\python3\App\lib\encodings\cp866.py", line 19, in encode
return codecs.charmap_encode(input,self.errors,encoding_map)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\u2014' in position 37: character maps to
Sometimes a domain have more than one ip address. In the file "results.py", class "Results" ,function "init" ,change the "site_info.ip" to a set. In the file "discovery.py",class "DiscoverIP" ,function "run",change line "ip = socket.gethostbyname(hostname)" to "ip = socket.gethostbyname_ex(hostname)[2]".
Please make this package available in http://pypi.python.org/ so that users can install it using:
pip install wig
I can help you out with the packaging and automatic deployment, if you are interested...
Hi!
First of all, WIG is awesome!!!
Second, I don't know how to post code, so, I'm opening an issue, but that isn't really an issue.
I've wrote a code to call WIG like a function that returns the results array like the obtained with de "get_results()" function.
The code it's below:
import sys
import argparse
from wig import Wig
def wigcall(url):
parser = argparse.ArgumentParser()
args = parser.parse_args()
args.input_file = None
args.url = url
args.user_agent = 'Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2049.0 Safari/537.36'
args.proxy = None
args.verbosity = 0
args.run_all = False
args.match_all = False
args.stop_after = 1
args.no_cache_load = False
args.no_cache_save = False
args.output_file = None
w = Wig(args)
w.options['interactive'] = False
w.reset()
oldprint = print
oldstdout = sys.stdout
sys.stdout = None
w.scan_site()
sys.stdout = oldstdout
return w.data['results'].get_results()
And you can call it with a code like this:
from wigcall import wigcall
results = wigcall('www.google.com')
print(results)
PD: I've think that the code to supress stdout can be deleted or improved, I've just appended that to simplify the utilization of the code.
Regards! :)
Hello.
Is there any support for detecting a CMS bitrix?
Need to check "response" for "NoneType"
Traceback (most recent call last):
File "./wig.py", line 318, in
wig.run()
File "./wig.py", line 242, in run
self.scan_site()
File "./wig.py", line 156, in scan_site
DiscoverInteresting(self.options, self.data).run()
File "/home/ubuntu/wig/classes/discovery.py", line 265, in run
redirected = response.md5_404 in self.error_pages
AttributeError: 'NoneType' object has no attribute 'md5_404'
python wig.py anywebsite.domain
results in
Traceback (most recent call last):
File "wig.py", line 317, in wig.run()
File "wig.py", line 244, in run self.scan_site()
File "wig.py", line 158, in scan_site DiscoverMore(self.options, self.data).run()
File "C:\Users\x\Desktop\wig-0.5.2\classes\discovery.py", line 385, in run parser = LinkExtractor(strict=False)
File "C:\Users\x\Desktop\wig-0.5.2\classes\discovery.py", line 333, in init
super().init(strict=strict)
TypeError: init() got an unexpected keyword argument 'strict'
I tried the 'strict' fix commit to no avail, I'm then getting this:
Traceback (most recent call last):
File "wig.py", line 317, in wig.run()
File "wig.py", line 244, in run self.scan_site()
File "wig.py", line 125, in scan_site title = DiscoverTitle(self.options, self.data).run()
File "C:\Users\x\Desktop\wig-0.5.2\classes\discovery.py", line 769, in run
self.printer.print_debug_line('Getting title ...', 1)
AttributeError: 'Printer' object has no attribute 'print_debug_line'
When i try to get wig to parse i file with links it fails.
Command:
python3 wig.py -l sites.txt
I get this error mesage
Traceback (most recent call last):
File "wig.py", line 383, in <module>
wig = Wig(args)
File "wig.py", line 62, in __init__
'url': args.url.lower(),
AttributeError: 'NoneType' object has no attribute 'lower'
Using wig for singular url work fine
python3 wig.py domain.com
I have tried url's with and without http/www
My environment
Ubuntu 12.04
editor NANO
Both python 2.7.3 and python3.2.3 installed
sites.txt --> file permission 777
wig git pull --> up to date
I have used url from list before
Hello,
When using wig against a target from the .com.br domain, wig shows the subdomains of .com.br and not <target>.com.br
Would it be possible to add an option to allow the user to define the location of the cache directory?
The cache storage does not work properly on Kali Linux. A temp fix has been applied, so that wig doesn't fail to run, but as a side effect the cache storage, used for data storage between wig executions, is alway disabled.
The issue might be related to the version of Python3 or requests used in Kali.
Hi,
Trying to use wig with python 2.7 I'm getting this error:
"nsck (most recent call last):
File "wig.py", line 29, in
import time, queue, sys, argparse
ImportError: No module named queue
"
Which queue module actually has to be installed? I'm asking that because doing a "pip search queue" I'm getting way to much related results.
Thanks!
Could you add support for HTTP headers?
i get infomation :
for cache_file in os.listdir(self.cache_dir):
AttributeError: 'Cache' object has no attribute 'cache_dir'
please help me
Is there any reason in particular that there isn't support for python 2.7?
eZ publish is a common CMS i would like to support.
I tried to implement and commit the changes myself, but got som errors.
Here is what i have done:
dictionary.json
"ezpublish": {"name": "eZ publish"},
cms/string/ezpublish.json (this is working)
`
[
{
"url": "",
"type": "string",
"match": "<meta name=\"generator\" content=\"eZ Publish",
"output": ""
}
]`
cms/regex/ezpublish.json (i got som errors on this)
`
[
{
"url": "/ezinfo/about",
"type": "regex",
"match": "eZ Publish .*: (\d.*)",
"note": "Ez publish infopage",
"weight": 100,
"output": "%s"
},
{
"url": "/ezinfo/copyright",
"type": "regex",
"match": ".*eZ Systems.*",
"note": "Ez publish copyright",
"weight": 100,
"output": ""
}
]
`
About the CMS:
https://ez.no/
https://en.wikipedia.org/wiki/EZ_Publish
https://github.com/ezsystems/ezpublish-community
New version from 6.x will change name to eZ platform
Example sites running EZ to test against:
https://www.informs.org/ezinfo/about
http://www.afasi.no/ezinfo/about
https://www.pw.edu.pl/ezinfo/about
http://www.amnh.org/
http://www.hioa.no/eng/
http://www.defense.gouv.fr/ezinfo/about
More sites using EZ can be found here: https://ez.no/Clients-Partners/Clients
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.