Unja

Fetch Known Urls

What's Unja?

Unja is a fast & light tool for fetching known URLs from Wayback Machine, Common Crawl, Virus Total, UrlScan.io & AlienVault's Otx it uses a separate thread for each provider to optimize its speed and use Wayback resumption key to divide scan into multiple parts to handle a large scan & it uses direct filters on API to get only filtered data from API to do less work on your system.

Why Unja?

Supports Wayback/Common-Crawl/Virus-Total/Otx/UrlScan.io
Automatically handles rate limits and timeouts
Export results: text or detailed output with status,mime,length in JSON
MultiThreading: separate thread for each provider to fetch data simultaneously
Filters: apply filters dirtly on provider to avoid unnecessary data

Installing Unja

You can install Unja with pip as following:

pip3 install unja

or, by downloading this repository and running

python3 setup.py install

Updating Unja

You can update Unja with pip as following:

pip3 install unja -U

Usage

unja -h

This will display help for the tool.

Flag	Description	Example
-d	doimain	unja -d ninjhacks.com
-f	List of domains file seprated by new line	unja -f domains.txt
--sub	Include subdomain	unja --sub
-p	Providers (wayback,commoncrawl,otx,virustotal,urlscan)	unja -p wayback
--wbf	(default : statuscode:200 ~mimetype:html)	unja --wbf statuscode:200
--ccf	(default : =status:200 ~mime:.*html)	unja --ccf =status:200
--wbl	Wayback results per request (default : 10000)	unja --wbl 1000
--otxl	Otx results per request (default : 500)	unja --otxl 500
-r	Amount of retries for http client (default : 3)	unja -r 3
-v	Enable verbose mode to show errors	unja -v
-j	Enable json mode for detailed output in json format	unja -j
-s	Silent mode don't print header	unja -s
--ucci	Update CommonCrawl Index	unja --ucci
--vtkey	Change VirusTotal Api in config	unja --vtkey
--uskey	Change UrlScan Api in config	unja --uskey

Output Methods

text = ( default ) Output urls only.

json = ( -j ) Output url,status,mime,length in json format it's can help you later filtering result based on those variables.

Filters

Filters directly apply on providers to get only useful filtered data from provider.

Wayback	Commoncrawl	Description
statuscode:200	=status:200	return only those urls which status code is 200
!statuscode:200	!=status:200	return only non 200 status code
mimetype:text/html	mime:text/html	return only those url which response type is text/html
!mimetype:text/html	!=mime:text/html	return only non text/html response type
~mimetype:html	~mime:.*html	return all those url which have html word in response type
~original:unja	~url:.*unja	return all those url which have unja word in url

Oneliners

Get only urls with parameters & status code 200

unja -s -d target.com --sub -p wayback,commoncrawl --wbf 'statuscode:200 ~original:=' --ccf '=status:200 ~url:.*=' | anew | tee output

Looking for open redirects

unja -s -d target.com --sub -p wayback,commoncrawl --wbf '~statuscode:30 ~original:=http' --ccf '~status:30 ~url:.*=http' | anew | tee output

Clean result ( Exclude images,css,javascripts,woff & 404)

unja -s -d target.com --sub -p wayback,commoncrawl --wbf '!statuscode:404 ~!mimetype:image ~!mimetype:javascript ~!mimetype:css ~!mimetype:woff' --ccf '!=status:404 !~mime:.*image !~mime:.*javascript !~mime:.*css !~mime:.*woff' | anew | tee output

Let me know if you have any other good oneliner ./

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
unja		unja
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unja

Fetch Known Urls

What's Unja?

Why Unja?

Installing Unja

Updating Unja

Usage

Output Methods

Filters

Oneliners

About

Releases 6

Packages

Languages

License

ninjhacks/unja

Folders and files

Latest commit

History

Repository files navigation

Unja

Fetch Known Urls

What's Unja?

Why Unja?

Installing Unja

Updating Unja

Usage

Output Methods

Filters

Oneliners

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

Packages