Skip to content

Distributed web crawler engine ๐ŸŒŽ๐Ÿ•ท๏ธ written in Scala. Akka framework used for concurreny and data persistence on Redis

Notifications You must be signed in to change notification settings

FatCache/camelcrawler

ย 
ย 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

12 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Camel Crawler

To-Do

Web Crawler

  • Write basic crawler to get list of URLs for single domain
  • Feed base root URLs from text file
  • Convert the list of domains into a data frame/structure for analysis
  • Connect crawler to a persistence database [MySQL]

Middleware develops API for the database - Redis

  • Connect to a database - Redis
  • Create APIs...

Team Members

  • Abdusamed
  • Ming
  • Chi

About

Distributed web crawler engine ๐ŸŒŽ๐Ÿ•ท๏ธ written in Scala. Akka framework used for concurreny and data persistence on Redis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • XSLT 69.8%
  • Scala 15.7%
  • CSS 14.5%