Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于数据预处理的代码 #1

Open
yuanyuansiyuan opened this issue Nov 10, 2019 · 3 comments
Open

关于数据预处理的代码 #1

yuanyuansiyuan opened this issue Nov 10, 2019 · 3 comments

Comments

@yuanyuansiyuan
Copy link

作者您好,

想问下您从98334条医案中提取出33765条医案的数据预处理代码可以分享下么,或者是这33765条医案的原始数据可以公开么?我想进一步从33765条医案中提取出来剂量属性,谢谢!

@yao8839836
Copy link
Owner

yao8839836 commented Nov 12, 2019

@yuanyuansiyuan

您好,当时为了简洁,那个Java文件被我删了,代码暂时找不到。

不过原始的98334条就在这个文件里:/data/prescriptions.txt。

如果用症状列表/data/symptom_contains.txt和药物列表/data/herbs_contains.txt过滤98334条 (每条应同时包含其中一个症状和一个药物,简单字符串匹配),就可以得到33765条。

@moon290
Copy link

moon290 commented Jun 6, 2020

请问如果一条数据同时包含其中一个症状和一个药物,但有一些药不在药物列表/data/herbs_contains.txt中,那这条数据里面不在药物列表中的药是被删除了吗?

@yao8839836
Copy link
Owner

@moon290

你好,是被删除了

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants