Skip to content

A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.

License

Notifications You must be signed in to change notification settings

BanglaLLM/bd-newspaper-crawlers

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bd-newspaper-crawlers

Author MIT Contributions welcome Stars

A collection of Bangla Newspaper and Blog crawlers. Can be used to mine Bangla text data for Natural Language Processing tasks.

List of Crawlers

Sl Site Name Site Type Language Crawler
1 Prothom Alo - Bangla News Bangla prothomalo_bn.py
2 Prothom Alo - English News English prothomalo_en.py
3 Bangladesh Pratidin News Bangla bdpratidin.py
4 Kalerkantho News Bangla kalerkantho.py
5 Daily Inqilab News Bangla inqilab.py
6 Samakal News Bangla samakal.py
7 Jugantor News Bangla jugantor.py
8 Ittefaq - Bangla News Bangla ittefaq_bn.py
9 Ittefaq - English News English ittefaq_en.py
10 The Daily Star - Bangla News Bangla daily_star.py
11 Anandabazar News Bangla anandabazar.py
12 Zee News - Bangla News Bangla crawler_zeenews.py
13 Voice of America - Bangla News Bangla crawler_voabangla.py
14 Hindustan Times - Bangla News Bangla hindustantimes.py
15 The Business Standard - Bangla News Bangla crawler_tbs.py
16 Dhaka Tribune News Bangla dhakatribune.py
17 NTV News Bangla ntvbd.py
18 Indian Express - Bangla News Bangla indianexpress.py
19 Ei Samay News Bangla eisamay.py
20 Amader Shomoy News Bangla dainikamadershomoy.py
21 Daily Bangladesh News Bangla daily_bangladesh.py
22 Sangbad Pratidin News Bangla sangbadpratidin.py
23 24 Live News News Bangla 24livenews.py
24 Amra Bondhu Blog Bangla amrabondhu.py
25 Bangla Blog Blog Bangla banglablog.py
26 Bangla News 24 News Bangla banglanews24.py
27 Biggani.org Blog Bangla biggani.py
28 Biggan Blog Blog Bangla bigganblog.py
29 Biggan Projukti Blog Bangla bigganprojukti.py
30 Bigyan Blog Bangla bigyan.py
31 Cadet College Blog Blog Bangla cadetcollegeblog.py
32 cpbook by Subeen Blog Bangla cpsubeen.py
33 Porjotonlipi Blog Bangla crawler_porjotonlipi.py
34 Tagore Web Blog Bangla crawler_tagoreweb.py
35 Dakghar News Bangla dakghar.py
36 Dmp News News Bangla dmpnews.py
37 hindime Blog Hindi hindime.py
38 Jagran News Hindi jagran.py
39 Nirbik Blog Bangla nirbik.py
40 Onnodristy News Bangla onnodristy.py
41 Department of Agricultural Extension Govt. Portal Bangla portalgov.py
42 Sastha Bangla Blog Bangla sasthabangla.py
43 Shopnobaz Blog Bangla shopnobaz.py
44 Songramer Notebook Blog Bangla songramernotebook.py
45 Subeen Blog Bangla subeen.py
46 Tech Tunes Blog Bangla techtunes.py

About

A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%