-
-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Presubmission inquiry: medrxivr #369
Comments
Hello @mcguinlu and many thanks for your presubmission enquiry! In general, we would consider the package in scope. However, there are couple of questions we'd like a bit more info on:
Here are also some initial comments too:
I look forward to some clarification in response to our queries. |
Hi @annakrystalli, thanks for getting back to me, and for your inital comments! Great to hear that the pacakge is provisionally in scope - I've provided responses to the four queries you raised below:
Thanks again for your inital feedback, and do let me know if I can provide any further clarifications! |
Thanks for the clarifications @mcguinlu !
Great and yes, I imagine they will be fine with it. They even be able to provide better access to the data rather than crawling their site.
Depending on the size of the files, I wonder if using pins might be a good solution for this problem. You could also consider getting feedback by posting on rOpenSci Discuss. It's quite an interesting topic.
I saw that. Nice work! Will look out for any feedback. We're pretty much satisfied with response to our comments at this stage, so let's see what the feedback is from your enquiries. |
In the interest of reducing load on reviewers and editors as we manage the COVID-19 crisis, rOpenSci is temporarily pausing new submissions for software peer review for 30 days (and possibly longer). Please check back here again after 17 April for updates. In this period new submissions will not be handled, nor new reviewers assigned. Reviews and responses to reviews will be handled on a 'best effort' basis, but no follow-up reminders will be sent. Other rOpenSci community activities continue. We express our continued great appreciation for the work of our authors and reviewers. Stay healthy and take care of one other. The rOpenSci Editorial Board |
Submitting Author: Luke McGuinness (@mcguinlu)
Repository: https://github.com/mcguinlu/medrxivr
Scope
Please indicate which category or categories from our package fit policies this package falls under: (Please check an appropriate box below.:
Explain how the and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:
medrxivr
allows users to programmatically access and manipulate a snapshot of medRxiv, a preprint respository for papers in medical, clinical, and related health sciences. The snapshot is automatically updated each morning (webscraping/cleaning script can be found here).Who is the target audience and what are scientific applications of this package?
The primary target of this package is systematic reviewers (i.e. me!), who frequently wish both to use more complicated queries (e.g. regular expresssions/Boolean combinations) when searching medRxiv than the official site currrently allows for, and who also wish to be easily able to download the full text PDFs of records matching their search.
medrxivr
helps with both of these challenges. However, anyone who wishes to find and retrieve relevant medRxiv records in R, for example to explore the distribution of preprints by subject area, will find the package useful.Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category?
As far as I am aware, no other package allows users to access medRxiv data in R.
Any other questions or issues we should be aware of?:
medrxivr
is not yet ready for submission - there is some additional functionality that I would like to incorporate into the package and I need to fix one or two issues with the webscraping/data cleaning script. However, I did have two questions it would be great to get feedback on:medrxivr
provides access to a static snapshot of the medRxiv data, which I also maintain, in addition to functions to search for and download relevant full text records. The reason this approach was taken, rather than providing tools for users to search the site dynamically, is that therobots.txt
on medRxiv forbids the scraping of thesearch/
path. In this case, providing functions that will query this path multiple times seemed like a bad idea. However, I want to check that my setup - a package providing access to a snapshot of the data, both of which are maintained by the same person - is an acceptable setup for an rOpenSci package?medrxivr
could be a useful addition tofulltext
. Is it worth contacting the maintainers of this package about intergration before or after submittingmedrxivr
to the peer review process?[Tagging my co-author @L-ENA for reference]
The text was updated successfully, but these errors were encountered: