Job Spider

Web crawling & scraping with scrapy: a job finder with MongoDB backend

In this project, I am simply demonstrating how to use scrapy spiders to crawl and scrape web pages.

The use case I chose was a job finder bot which goes and gathers those jobs that match a candidate's criteria. Prsently, only a single spider is implemented which goes through StackOverflow's job board. Adding more spiders would be a trivial matter. Just follow the model I show.

The results are stored into a MongoDB, so that is a prerequisite the way it is implemented. But of course, changing to a different DB is pretty straightforward

To run the bot as is:

python run.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
job_spider		job_spider
.gitignore		.gitignore
README.md		README.md
run.py		run.py
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Job Spider

Web crawling & scraping with scrapy: a job finder with MongoDB backend

About

Releases

Packages

Languages

pbrackin/Job-Spider

Folders and files

Latest commit

History

Repository files navigation

Job Spider

Web crawling & scraping with scrapy: a job finder with MongoDB backend

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages