WebCrawler

The project simulates on how web search engine works on web by following every link on the web. They use sophisticated algorithms to search efficiently. For example, they don't follow each link equally often; content that changes often is followed more often.

This project will demonstrate crawl a web site that implement socket programming, which is fundamental to writing all internet applications, and also about the HTTP application layer protocol.

Program Execution

To execute the program:

clone this repository and use linux terminal and makefile to compile the program with the cmd: make
Run the program with the execution name crawler followed by the url. Example : crawler http://web1.comp30023

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.gitlab-ci.yml		.gitlab-ci.yml
Makefile		Makefile
README.md		README.md
crawler.c		crawler.c
crawler.h		crawler.h
crawlerfunc.c		crawlerfunc.c
web_crawler.pdf		web_crawler.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WebCrawler

Program Execution

About

Releases

Packages

Languages

williamputraintan/WebCrawler

Folders and files

Latest commit

History

Repository files navigation

WebCrawler

Program Execution

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages