Skip to content

Summer 2019 (fix issues with big webentities)

Compare
Choose a tag to compare
@boogheta boogheta released this 06 Aug 10:00
· 581 commits to master since this release

Changelog:

  • Use traph 1.2.0 with paginated queries to fix issues collecting all pages and pagelinks of a single webentity at once (#293), also fasten collecting childentities and cache number of pages by entity during network computation
  • Fix broken WebEntity pages network view
  • Add number of pages per webentity to WebEntities Lists, as well as exports and network view
  • Fix creationrules missing after resetting a corpus (#320)
  • Fix password protected access to corpora
  • Always include homepage as a startpage when crawling a discovered (#322)
  • Fix various crawler errors
  • Allow editing a tag in a single API call instead of removing then adding
  • Add script to trigger backup for all existing corpora