We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Updated Hadoop Streaming (markdown)
Updated Event specific cluster setup and job information (markdown)
Update key, username and path data
Add a few general data sources and split out Toronto-specific sources left over from last hackathon
Add entry for Common Crawl - not sure whether it's feasible to access from S3 for hackathon or not
Add pointer to some additional Hubway bikesharing data for Boston
Updated Datasets (markdown)
Added a few links to data sets people might want to request be added to the cluster. -G
Updated Home (markdown)
Updated Amazon review dataset (markdown)
Created Amazon review dataset (markdown)