Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import Malaysia case data #510

Open
iamleeg opened this issue Jul 9, 2020 · 3 comments
Open

Import Malaysia case data #510

iamleeg opened this issue Jul 9, 2020 · 3 comments

Comments

@iamleeg
Copy link
Contributor

iamleeg commented Jul 9, 2020

https://www.outbreak.my/stats

@iamleeg iamleeg added this to the Milestone 3: Beta release milestone Jul 9, 2020
@dixitaayush8 dixitaayush8 self-assigned this Aug 31, 2020
@sratcliffe118 sratcliffe118 removed this from the Milestone 3: Beta release milestone Sep 16, 2020
@sratcliffe118 sratcliffe118 added this to the Post Launch milestone Oct 12, 2020
@calremmel calremmel removed this from the Post Launch milestone Mar 2, 2021
@joe-brilliant joe-brilliant added this to the Holding Bin milestone Sep 10, 2021
@joe-brilliant joe-brilliant removed this from the Holding Bin milestone Feb 24, 2022
@rbevansp
Copy link
Collaborator

rbevansp commented Jul 22, 2022

@abhidg can I have a go at this issue with the https://moh-malaysia-covid19.s3.ap-southeast-1.amazonaws.com/linelist_cases.parquet case line list from the Ministry of Health Malaysia?

See data setup:
Screen Shot 2022-07-22 at 3 19 51 PM

Here is the documentation.

The latest cases were added earlier today, i.e. the data is up to date.

The data in the issue title https://www.outbreak.my/stats does not seem to be up to date.

@abhidg
Copy link
Contributor

abhidg commented Jul 23, 2022

@rbevansp good find! the data portal does not support parquet #2780, do they have a CSV format?

@rbevansp
Copy link
Collaborator

@abhidg I can't find a CSV format for this data and in the documentation it says

"From the 4th of June onwards, the cases linelist is accessible as a single file via Amazon S3 in parquet format. Prior to the 4th of June, the cases linelist was split into chunks of 500,000 cases each to manage file size - we have ported it over to Amazon S3 to avoid this practice."

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Development

No branches or pull requests

7 participants