Skip to content

☕️ use scrapy to get web news title, news url, news abstract.

Notifications You must be signed in to change notification settings

daisenryaku/news-scrapy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

naive-scrapy

介绍

基于scrapy的爬虫,爬取门户网站新闻标题,新闻链接,新闻摘要,新闻正文,存入MongoDb或者生成csv文件

支持网站:网易,腾讯,新浪,搜狐, 凤凰,新华,中国青年, 中国,火狐中文,人民

运行

爬取一个门户,以爬取网易主页新闻为例:

scrapy crawl 163

一次全部爬取

scrapy crawlall

About

☕️ use scrapy to get web news title, news url, news abstract.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages