Skip to content

WebArchives powered crawls

Compare
Choose a tag to compare
@boogheta boogheta released this 23 Sep 11:00
· 202 commits to master since this release

ChangeLog:

  • Allow to start crawls on Web Archives to browse disappeared or modified webentities in the past (#372)
  • Allow to setup advanced individual crawl settings (using a specific cookie, adjusting the depth, using a web archive...)
  • Allow to display only crawled pages in a webentity's webpages list
  • Upgraded fake user agents dependency for more recent UAs
  • Add to the API a route to collect crawled webentity's webpages content as clear text instead of zipped base64
  • Minor fixes (#397, #416, #418, 8b8f73f, 3b48755, 6aea48a, f3c1e85, e97b9d0, b05d470, 01aac8a, ...)