-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Suggestion for discussion about conversion from result to data.frame / tibble #247
Comments
Thank you Rainer for putting this together and phrasing everything in such a kind way. As always, I appreciate your insights as a power user of openalexR! My first thoughts:
That said, I agree a major update to v2.0.0 is warranted for openalexR to be more robust to future changes in the API.
|
Dear Trang always happy to help - and I really mean what I said. This sounds like a useful discussion! I re-ordered your thoughsts a bit, as I think they can be grouped and probably one can find a way forward more easily. OK - the first complex concerns the aim of
I agree that the beauty of I am starting to look at the backward compatibility <-> simplicity for the user. I could envisage a function, which
But the more I think about it, the more complex it gets. Maybe it would be enough, to expose robust conversion functions and an example how a complete conversion function can be assembled (this could be the What would be in both cases essential, is that the structures stay the same - so e.g. of not authors are available, a data.frame with all fields set to NA is returned, and not Have you considered creating graphs (I love plantuml and have written a small package to create this in R (https://github.com/rkrug/plantuml/)), which illustrate which fields fro OpenAlex are converted to the
Sorry for not being clear here - with caching I mean for example, when a large number of works need to be downloaded, the individual pages are saved to disk. The conversion would then read one after the other, and save them again on disk. In a last step, these would be concatenated. I completely agree that the caching of the API responses is completely out of the scope of openalexR.
Sorry - I do not understand your point here. Nested structures can obviously easily be created and used in R, and also saved as
I think that if this can be achieved, and some lower level conversion functions can be exported so that one can build easily own conversion functions, a lot would be achieved (and there are a few other functions which should be exported imho).
Good points, and |
Oh - and I am using I would be happy to contribute vignettes. |
My idea is to declare the function oa2bibliometrix() deprecated.
|
One point which came to mind: It might be useful to define the different outputs (list, data.frame, for authors, works, etc.) as S3 classes, which would make writing build-on functions to the package easier. S3 would be the easiest. |
Sounds great I'm happy with starting a deprecation plan for oa2bibliometrix.
I think we're overcomplicating the package a bit here. openalexR is simply an API client that helps build queries and returns a list or dataframe/tibble as a result. Therefore, I don't see build-on functions as in-scope for the package. |
I do not see any complications on the openalexR by simply adding e.g. |
I apologise in advance for the length of this post, but I think it deserves (and needs) the space.
Thanks a lot for a great package - I really love it and it works (most off the time - I will come back to this), is reliable, no crashes, perfect.
Thanks a lot for this and for updating it regularly.
But after the last changes on OpenAlex side, the reports of problems I the conversion from the returned values from OpenAlex into a
data.frame
become more and more, and for me things broke down due to broken backward compatibility.In the same way tat OpenAlex is growing more and more in data and usage, the usage of
openalexR
is increasing as well, and initially not envisaged use cases are cioming u.. Therefore I think that the conversion of the return values to adata.frame
should be fixed in aI think that unfortunately, this is not the case at the moment (which is perfectly understandable given the scope of the package).
My suggestion would be to start a discussion on how a rewrite of the conversion functions could be done which fulfills a list of requirements.
I would start with the following requirements which I see as important:
oa_request()
(oroa_fetch_()
), and necessary fields are not included in the results to fill the requested data frame, the resulting data.frame should contain NA in these fields, and issue a warning.A conversion fulfilling these requirements, would be extremely valuable, and make the support burden much lighter (I have the impression, that most of the Issues raised del with the conversion).
I would be happy to help with this (at least certain aspects), but I would like to have some consensus in which direction to move, and an agreed framework in which the conversion function(s) would fit.
With your permission, I would be happy to post this in the rOpenSci Slack channel to get a broader input.
Thanks a lot for your patience of reading through this,
Rainer (a very happy user of openalexR)
The text was updated successfully, but these errors were encountered: