Skip to content

aedenmurray/toc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧅 toc - The Onion Crawler

The Onion Crawler is a simple, straightforward web crawler designed to traverse .onion sites.

The TOR Proxy

To access the TOR network, you must utilize the tor daemon with a SOCKS5 proxy.

This will allow toc to programatically proxy all traffic through the TOR network.

$ brew install tor
$ brew services start tor

Getting Started

$ git clone https://github.com/aedenmurray/toc && cd toc
$ go run main.go 

0 https://github.com/fastfire/deepdarkCTI/blob/main/forum.md (deepdarkCTI/forum.md at main · fastfire/deepdarkCTI · GitHub)
1 http://xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.onion (Threat Actors | Onion Forums)
1 http://xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.onion/login (Forum)
1 http://xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.onion (CryptBB)

...etc

By default, toc will print out the depth, url, & title of any sites that it sucessfully visits.

However, you may extend this functionality by utilizing the OnRequest & OnResponse hooks.

Check out main.go for an example.

Usage

  • -url - Initial URL to crawl: https://github.com/fastfire/deepdarkCTI/blob/main/forum.md
  • -skip - File containing list of domains to skip. Separated by a newline.
  • -shost - The SOCKS5 host: localhost
  • -sport - The SOCKS5 port: 9050