- update experiments
- run NLP experiments
- talk about demos
- ...
/group-mounts/trec1/shared/backup/gov2-data
- 100GB compressed
- 500 GB uncompressed
- https://lintool.github.io/Ivory/docs/exp-gov2.html
/group-mounts/trec1/shared/backup/blogdata
- 16GB compressed (homepages + links)
- 108 GB uncompressed
- http://trec.nist.gov/data/blog06.html
- http://ir.dcs.gla.ac.uk/test_collections/blog06info.html
- can use NDCG