JanLinders-Webscraper

Web scraper that collects information about products at Jan linders.

Gets its data from jan linders.

How to use

All information is uploaded to a MySQL database. You can either run a database server locally using a program like xampp, or get a company to host one for you.

Change your credentials and database name in the db.py file to your own login details.

The table that is automatically created in your database will look like this:

CREATE TABLE `products` (
  `name` varchar(100) NOT NULL,
  `price` double NOT NULL,
  `brand` varchar(45) NOT NULL,
  `weight` varchar(45) NOT NULL,
  `group` varchar(45) NOT NULL,
  PRIMARY KEY (`name`,`price`,`brand`,`weight`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1;

Ideas

Improve UI.
Show elapsed time when done.
Scrape offers page and store them in a seperate table.
Option to choose which groups to scrape.
Build test to see if the site structure is still the same.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
db.py		db.py
main.py		main.py
progressbar.py		progressbar.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JanLinders-Webscraper

How to use

Ideas

About

Releases

Packages

Languages

License

StefanPahlplatz/jan-linders-webscraper

Folders and files

Latest commit

History

Repository files navigation

JanLinders-Webscraper

How to use

Ideas

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages