-
Notifications
You must be signed in to change notification settings - Fork 492
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexing Dataverses in Google Scholar #2717
Comments
👍 I think that the idea of implementing schema.or dataset extension is a great one. |
Would be happy to help in any way I can with this. |
@eugene-barsky thanks for opening this issue! I like the focus on Google Scholar as a use case but I'd like to point out some related ideas:
Does anyone know how this works? Are DSpace and ePrints using sitemaps? |
Here is Google Scholar Anurag Acharya's presentation on how they index in GS - https://media.dlib.indiana.edu/media_objects/avalon:16122. It is fresh from Summer 2015. I'm not sure if Google Scholar actually indexes schema.org tags. However, Anurag goes to great detail in that presentation on what they want to see in GS... |
I just came across an interesting comment from @bnosek at https://groups.google.com/d/msg/openscienceframework/-5sOS4bH-M0/lG4NwxnUAAAJ who says, "At present, the Google Scholar team has expressed interest in only indexing manuscripts/articles, not other research products (data, materials, etc.). That may change over time as these other research products are recognized as unique intellectual contributions." |
Yeah, this is the sense that I have been getting from GS team, specifically E On Tue, Apr 12, 2016 at 9:19 AM, Philip Durbin notifications@github.com
|
It would help if dataverse would use the Dataset markup from schema.org. If you need i could try to make a first draft of it in your templates? |
That would be great, Olof. I've been pushing the idea that data repositories should use schema.org, and help schema.org define better metadata for data. Please share your first draft when you have it! Merce Sent from my iPhone
|
(Please re-direct if this isn't the right place): |
Thanks @adam3smith. We just had another request (via ticketing system) for the experimental metadata schema that you linked. I'll ask the requestor to drop in this issue to add any additional information that's valuable. I'll bring this up in my next meeting with @mcrosas in order to get an idea of where it could fit into our next few releases.
Well said :) |
Thanks @djbrooke . To the extent it's relevant (i.e. as a signal for the degree of uptake we'll see), I just heard from Figshare that they're implementing this schema before the end of the year. |
Should this issue and #2243 be combined? They seem highly related to me. |
This was posted two days ago: https://research.googleblog.com/2017/01/facilitating-discovery-of-public.html . Thanks for pointing it out, @eugene-barsky |
@eugene-barsky does the recent work on #1393 help? |
I don't think that these are related. Google Scholar made a decision not to
index stand alone data repos some time ago, and I don't think that they
have changed their mind since...
E.
…On Fri, Jun 23, 2017 at 6:10 AM, Philip Durbin ***@***.***> wrote:
@eugene-barsky <https://github.com/eugene-barsky> does the recent work on
#1393 <#1393> help?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#2717 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKYX-I-XmaNJBR8LoiK-MZlZn67Trcj0ks5sG7kogaJpZM4GagWz>
.
|
The scope of the issue changed during the conversation from asking if Dataverse had been in touch with the Google Scholar team about having our repositories indexed in Google Scholar, to improving dataset discoverability in search engines in general by using schema.org metadata to describe datasets. I'm in favor of closing this issue since Google Scholar still has no plans to index data repos. More discussion and resources about using schema.org metadata is in #2243. Having metadata tags in dataset landing page html (#1393), especially the dataset PID, will help with the first-step approach in #3793, where we would add schema.org metadata to datasets using a DataCite script that needs "the DOI from the page via a "DC.identifier" meta tag." |
@jggautier I'm in favor of closing the issue since both you and @eugene-barsky seem to agree that Google Scholar still has no plans to index data repositories. @eugene-barsky what do you think? If you want, you could open a new issue if this one is getting a bit too sprawling. |
No problem, let's do that.
E.
…On Sun, Jun 25, 2017 at 5:01 PM, Philip Durbin ***@***.***> wrote:
@jggautier <https://github.com/jggautier> I'm in favor of closing the
issue since both you and @eugene-barsky <https://github.com/eugene-barsky>
seem to agree that Google Scholar still has no plans to index data
repositories.
@eugene-barsky <https://github.com/eugene-barsky> what do you think? If
you want, you could open a new issue if this one is getting a bit too
sprawling.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#2717 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKYX-OYNtlHGA1a7KWGQyc6h5hd1C4PSks5sHvTbgaJpZM4GagWz>
.
|
Thanks. Closing. |
@eugene-barsky it struck me that Google really emphasized sitemaps in the video at https://www.rd-alliance.org/making-data-discoverable-web-search-engines . Thanks for putting that session on my radar! Can you please open a new issue about sitemaps? I don't have the slides but here's a screenshot from the video from about 24 minutes in: |
@eugene-barsky thanks for opening #4261! |
Our data is only good if people can find/discover it. And in academia, many people are using Google Scholar to search for research. Also, Google Scholar is a place many of our faculty go for tenure and promotion metrics.
As I was reading Google Scholar (GS) inclusion guidelines - https://scholar.google.ca/intl/en/scholar/inclusion.html, and I could see that Institutional repositories are often indexed in GS automatically (DSpace, ePrints, etc)
However, data repositories, even these issuing DOIs, seem not to be indexed. Of course, the vague and unclear scope of GS indexing does not help either. Well, I wrote about it back in 2005 - https://ejournals.library.ualberta.ca/index.php/jchla/article/viewFile/22437/16666
Therefore, I was wondering whether you had any conversations with Google Scholar team to include your Dataverses and/or other to the GS database?
Also, for instance, The National Snow and Ice Data Center implemented the schema.org dataset extension last year to enable crawlers to index their datasets. It is a small, machine-friendly chunk of code that basically tells crawlers that data live here. The nice thing about this is that, rather than actions on the search engine side, the schema.org implementation works for all crawlers... ie. so as independent data crawlers come up to speed, they will be able to see your data in addition to google.
I would be delighted to assist your team in this discoverabilty work with Google Scholar as I have collaborated with them before.
With thanks,
Eugene
The text was updated successfully, but these errors were encountered: