Skip to content

uiuc-ischool-scanr/WikiCSSH

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WikiCSSH

Binder Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

If you are using WikiCSSH please cite the following papers:

Han, Kanyao; Yang, Pingjing; Mishra, Shubhanshu; Diesner, Jana. 2020. “WikiCSSH: Extracting Computer Science Subject Headings from Wikipedia.” In Workshop on Scientific Knowledge Graphs (SKG 2020).

Han, Kanyao; Yang, Pingjing; Mishra, Shubhanshu; Diesner, Jana. 2020. "WikiCSSH - Computer Science Subject Headings from Wikipedia." University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0424970_V1

Download the WikiCSSH files from: https://doi.org/10.13012/B2IDB-0424970_V1 or from the data/v1 folder.

The data contains the following files:

  • WikiCSSH_categories.csv - Categories in WikiCSSH
  • WikiCSSH_category_links.csv - Links between categories in WikiCSSH
  • Wikicssh_core_categories.csv - Core categories as mentioned in the paper
  • WikiCSSH_category_links_all.csv - Links between categories in WikiCSSH (includes a dummy category called which is parent of isolates and top level categories)
  • WikiCSSH_category2page.csv - Links between Wikipedia pages and Wikipedia Categories in WikiCSSH
  • WikiCSSH_page2redirect.csv - Links between Wikipedia pages and Wikipedia page redirects in WikiCSSH

Examples of using WikiCSSH can be found in the notebooks folder:

How it was generated

Algorithm for generating WikiCSSH