Skip to content

Latest commit

 

History

History
67 lines (43 loc) · 3.6 KB

dev-notes.md

File metadata and controls

67 lines (43 loc) · 3.6 KB

Git scraping: track changes over time by scraping to a Git repository

Simon Willison on gitscraping  git-scraping (Github Topics) track changes over time by scraping to a Git repository | Hacker News  Raspando dados com o GitHub Actions e analisando com Datasette  

Examples

git-history

reads through the entire history of a file and generates a SQLite database reflecting changes to that file over time. git-history: a tool for analyzing scraped data collected using Git and SQLite git-history - a tool for Datasette      


    datadesk/california-coronavirus-scrapers The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.     


   

githubocto/flat-demo-bitcoin-price: A Flat Data GitHub Action demo repo  pierrotsmnrd/flat_data_py_example: How to use Python in Github's Flat Data  https://github.com/githubocto?q=flat-demo 
   


 

swyxio/gh-action-data-scraping

Instanțe

Structured data.gov.ro monthly dumps versioned as: https://data.gov.ro/dataset/activity/mecanismul-de-feed-back-al-pacientului-2023  check docs w multiple versions  

Roadmap

Prometeu

  • read target into local (to repository) json
    • cmteb
    • andnet
  • add meaningful commit msgs
  • save as csv
  • check if changed before saving
  • write to external repo
  • build to datasette, FlatGithub
  • get historical data