Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update National Assembly crawler due to webpage renewal #8

Closed
e9t opened this issue Mar 17, 2014 · 0 comments
Closed

Update National Assembly crawler due to webpage renewal #8

e9t opened this issue Mar 17, 2014 · 0 comments
Assignees
Labels

Comments

@e9t
Copy link
Member

e9t commented Mar 17, 2014

The previous crawler is broken due to National Assembly's webpage renewal. Update it.

Input

  1. 현재 국회의원 목록
  2. 각 국회의원 정보

    위 링크에서 각 member의 이름에 hover하면 javascript:jsMemPop(*someNumber*)와 같은 링크가 뜨는데, 괄호 안 번호가 의원의 국회 시스템 내 id임. 그 id를 아래 링크에 넣어가며 페이지를 추출 받으면 됨.
    http://www.assembly.go.kr/assm/memPop/memPopup.do?num=*someNumber*

Output

  1. 데이터 출력 형태: csv + json
    • 기존에는 csv만 남겼는데, diff를 보기 위해 json 형태로도 남기기로.
    • Attribute는 기존 데이터 파일 참고
  2. README update
@e9t e9t added the bug label Mar 17, 2014
@e9t e9t closed this as completed Apr 15, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants