-
Notifications
You must be signed in to change notification settings - Fork 460
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add citations training data from crossref unstructured references #864
Conversation
Thanks a lot @miku ! |
Thanks for the review and please let me know, if there's a way to make them less tough. The set is basically a random shuffle of citation strings from crossref. |
@kermitt2 - would it be better, if I prepare another batch? |
I think you could not see my review (made 21 days ago :( ) |
<tei xmlns="http://www.tei-c.org/ns/1.0" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML"> | ||
<listBibl> | ||
<bibl><author>Jasanoff, S.</author> (<date>2005</date>): <title level="m">States of Knowledge. The Co-production of Science and Social Order</title> -<pubPlace>London</pubPlace>: <publisher>Routledge</publisher>.</bibl> | ||
<bibl><author>Bojko Krzysztof, Magdalena Góra</author>, <title level="m">Wybrane aspekty polityki Izraela, Stanów Zjednoczonych i Unii Europejskiej wobec Palestyńskiej Władzy Narodowej</title>, 2000-2007, <publisher>Księgarnia Akademicka</publisher>, <pubPlace>Kraków</pubPlace> <date>2007</date>.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think , 2000-2007
is part of the title.
<bibl><author>Jasanoff, S.</author> (<date>2005</date>): <title level="m">States of Knowledge. The Co-production of Science and Social Order</title> -<pubPlace>London</pubPlace>: <publisher>Routledge</publisher>.</bibl> | ||
<bibl><author>Bojko Krzysztof, Magdalena Góra</author>, <title level="m">Wybrane aspekty polityki Izraela, Stanów Zjednoczonych i Unii Europejskiej wobec Palestyńskiej Władzy Narodowej</title>, 2000-2007, <publisher>Księgarnia Akademicka</publisher>, <pubPlace>Kraków</pubPlace> <date>2007</date>.</bibl> | ||
<bibl><author>Wickel</author>: <title level="a">Über stationäre Paralyse</title>. <title level="j">Allg. Z. Psychiatr.</title> <biblScope unit="volume">71</biblScope>, <biblScope unit="issue">360</biblScope> (<date>1914</date>).</bibl> | ||
<bibl><author>Heinzel, C.</author>: <title level="m">Methoden zur Untersuchung und Optimierung der Kühlschmierung beim Schleifen</title>. <note type="report">Dissertation</note>, <publisher>University of Bremen</publisher>, <pubPlace>Bremen</pubPlace> (<date>1999</date>)</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
<orgName>
-> the institution for theses or technical reports
It applies to dissertation too.
<bibl><author>Bojko Krzysztof, Magdalena Góra</author>, <title level="m">Wybrane aspekty polityki Izraela, Stanów Zjednoczonych i Unii Europejskiej wobec Palestyńskiej Władzy Narodowej</title>, 2000-2007, <publisher>Księgarnia Akademicka</publisher>, <pubPlace>Kraków</pubPlace> <date>2007</date>.</bibl> | ||
<bibl><author>Wickel</author>: <title level="a">Über stationäre Paralyse</title>. <title level="j">Allg. Z. Psychiatr.</title> <biblScope unit="volume">71</biblScope>, <biblScope unit="issue">360</biblScope> (<date>1914</date>).</bibl> | ||
<bibl><author>Heinzel, C.</author>: <title level="m">Methoden zur Untersuchung und Optimierung der Kühlschmierung beim Schleifen</title>. <note type="report">Dissertation</note>, <publisher>University of Bremen</publisher>, <pubPlace>Bremen</pubPlace> (<date>1999</date>)</bibl> | ||
<bibl><author>Benguigui Y.</author> <title level="m">Infecções Respiratórias Agudas: Fundamentos Técnicos das Estratégias de Controle</title>. Série HCT / AIEPI -8.P. <pubPlace>Washington, DC</pubPlace>, OPS; c<date>1997</date>.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
<title level="s">Série HCT / AIEPI</title> -<biblScope unit="volume">8</biblScope>.P.
After some Google check, 8
is the volume for sure, but I could not clarify the P
.
<bibl><author>Birkmann, J., Bach, C., Guhl, S., Witting, M., Welle, T. and Schmude, M.</author> (<date>2010</date>) '<title level="m">State of the Art der Forschung zu kritischen Infrastrukturen am Beispiel Strom/Stromausfall</title>', <title level="s">Schriftenreihe Sicherheit, Forschungsforum Öffentliche Sicherheit der FU Berlin</title>.</bibl> | ||
<bibl>/// <author>Szacki J.</author> <date>2002</date>. <title level="m">Historia myśli socjologicznej</title>, <publisher>Wydawnictwo Naukowe PWN</publisher>.</bibl> | ||
<bibl><author>C. Seaman, & V. Basili</author>, \"<title level="a">An empirical study of communication in code inspections</title>\", in <title level="m">Proceedings of the 19th International Confer- ence on Software Engineering</title>, <date>1997</date>, pp. <biblScope type="page">96-106</biblScope>.</bibl> | ||
<bibl><author>Silins, H. and Mulford, W.</author> (<date>2001</date>). <title level="a">Reframing Schools: The Case for System, Teacher and Student Learning</title>. <note>Paper presented at the Australian Association for Research in Education (AARE)</note>, Fremantle, December.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
<bibl><author>Silins, H. and Mulford, W.</author> (<date>2001</date>). <title level="a">Reframing Schools: The Case for System, Teacher and Student Learning</title>. Paper presented at the <title level="m">Australian Association for Research in Education (AARE)</title>, <pubPlace>Fremantle</pubPlace>, <date>December</date>.</bibl>
This is apparently a conference event.
<bibl><author>C. Seaman, & V. Basili</author>, \"<title level="a">An empirical study of communication in code inspections</title>\", in <title level="m">Proceedings of the 19th International Confer- ence on Software Engineering</title>, <date>1997</date>, pp. <biblScope type="page">96-106</biblScope>.</bibl> | ||
<bibl><author>Silins, H. and Mulford, W.</author> (<date>2001</date>). <title level="a">Reframing Schools: The Case for System, Teacher and Student Learning</title>. <note>Paper presented at the Australian Association for Research in Education (AARE)</note>, Fremantle, December.</bibl> | ||
<bibl><author>V. F. Stolba</author>, « <title level="a">Graffiti and Dipinti</title> », p. <biblScope type="page">229</biblScope>, H 2, pl. 150, 156.</bibl> | ||
<bibl><author>Ekingen, G.</author> <date>2004</date>. <title level="m">>A key to marine fishes of Turkey</title> <note>(in Turkish)</note>. <publisher>Mersin Üniversitesi Yayınları</publisher> No:12, Su Ürünleri Fakültesi Yayınları No:4, <pubPlace>Mersin</pubPlace>, 193 s.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tough one :)
<bibl><author>Ekingen, G.</author> <date>2004</date>. <title level="m">A key to marine fishes of Turkey</title> <note>(in Turkish)</note>. <title level="s">Mersin Üniversitesi Yayınları</title> No:<biblScope type="volume">12</biblScope>, <title level="s">Su Ürünleri Fakültesi Yayınları</title> No:<biblScope type="volume">4</biblScope>, <pubPlace>Mersin</pubPlace>, <biblScope type="page">193</biblScope> s.</bibl>
<bibl><author>A. v. Griesheim, W. Koehs, E. Pflfiger</author>, <title level="m">Beitr~ge zur Physiologie der Zeugung</title>; namentlieh die 2. Abh <author>Pflfiger, E.</author>, <title level="a">Einige Beobach- tungen fiber die das Geschlecht bestimmenden Ursachen</title>. <title level="j">Pfiiiger's Archiv</title>, Bd. <biblScope unit="volume">XXVI</biblScope>, p. <biblScope type="page">237--258</biblScope>.</bibl> | ||
<bibl><author>L. ESPOSITO and A. TUCCI</author>, in <title level="m">Proceedings of the Third European Ceramic Society Conference</title>, Madrid, 12?17 September1993, Vol. <biblScope unit="volume">3</biblScope>, edited by <editor>P. DURAN and J. F. FERNANDEZ</editor> (Faenza Editrice Iberica, Castellon de la Plana, <date>1993</date>) p. <biblScope type="page">301</biblScope>.</bibl> | ||
<bibl><author>Junge, M.</author> (<date>2007</date>). <title level="m">Simulationsgestützte Entwicklung und Optimierung einer energieeffizienten Produktionssteuerung.</title> <note type="report">Dissertation</note>. <orgName>Universität Kassel</orgName>.</bibl> | ||
<bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, noviembre. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum?</note> Lectora, 9/2003, <biblScope type="page">3-10</biblScope>.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
<bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, <date>noviembre</date>. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum? Lectora, 9/2003</note>, <biblScope type="page">3-10</biblScope>.</bibl>
<bibl><author>L. ESPOSITO and A. TUCCI</author>, in <title level="m">Proceedings of the Third European Ceramic Society Conference</title>, Madrid, 12?17 September1993, Vol. <biblScope unit="volume">3</biblScope>, edited by <editor>P. DURAN and J. F. FERNANDEZ</editor> (Faenza Editrice Iberica, Castellon de la Plana, <date>1993</date>) p. <biblScope type="page">301</biblScope>.</bibl> | ||
<bibl><author>Junge, M.</author> (<date>2007</date>). <title level="m">Simulationsgestützte Entwicklung und Optimierung einer energieeffizienten Produktionssteuerung.</title> <note type="report">Dissertation</note>. <orgName>Universität Kassel</orgName>.</bibl> | ||
<bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, noviembre. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum?</note> Lectora, 9/2003, <biblScope type="page">3-10</biblScope>.</bibl> | ||
<bibl><author>Zoltman, Gerald and Burger, P h i l i p C.</author>, <title level="m">Marketing Research: Fundamentals and D y n a m i c s</title> , Hinsdale, Ill.: <publisher>The Dryden Press</publisher>, <date>1975</date>.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
<pubPlace>Hinsdale, Ill<pubPlace>
some OCR error for IL Illinois :)
<bibl><author>BOYNTON, R. S.</author> (<date>1999</date>) <title level="a">¿Quién necesita la filosofía? Entrevista a Martha Nussbaum.</title> <title level="j">The New York Times Magazine</title>, noviembre. <note>Traducción de Carme Castells: ¿Quién teme a Martha Nussbaum?</note> Lectora, 9/2003, <biblScope type="page">3-10</biblScope>.</bibl> | ||
<bibl><author>Zoltman, Gerald and Burger, P h i l i p C.</author>, <title level="m">Marketing Research: Fundamentals and D y n a m i c s</title> , Hinsdale, Ill.: <publisher>The Dryden Press</publisher>, <date>1975</date>.</bibl> | ||
<bibl><author>Keupp, H./Röhrle, B.</author>: <title level="m">Soziale Netzwerke</title>. <pubPlace>Frankfurt a. M.</pubPlace> <date>1987</date></bibl> | ||
<bibl><author>SILVA, A. C. da.</author> <title level="a">A desconstrução da Discriminação no Livro didático</title>. In: <author>Munanga, Kabengele</author>. <title level="m">Superando o Racismo na escola</title>. <pubPlace>Brasília</pubPlace>: <orgName>Ministério da Educação, Secretaria de Educação continuada, Alfabetização e Diversidade</orgName>, p. <biblScope type="page">21-37</biblScope>. <date>2005</date>.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In: <editor>Munanga, Kabengele</editor>.
(after google check)
<bibl><author>SILVA, A. C. da.</author> <title level="a">A desconstrução da Discriminação no Livro didático</title>. In: <author>Munanga, Kabengele</author>. <title level="m">Superando o Racismo na escola</title>. <pubPlace>Brasília</pubPlace>: <orgName>Ministério da Educação, Secretaria de Educação continuada, Alfabetização e Diversidade</orgName>, p. <biblScope type="page">21-37</biblScope>. <date>2005</date>.</bibl> | ||
<bibl><author>B�ssler R</author> (<date>1974</date>) <title level="a">Pathologische Anatomie der Gallenwegserkrankungen</title>. In: <editor>Becker V</editor> (Hrsg) <title level="m">Gastroenterologie und Stoffwechsel, Aktionen und Interaktionen</title>. <publisher>Witzstrock</publisher>, <pubPlace>Baden-Baden</pubPlace></bibl> | ||
<bibl>Vgl. zu den Merkmalen der Informationsqualität <author>Weißenberger</author> (<date>1997</date>). S. <biblScope type="page">35</biblScope> sowie allgemein zu den Eigenschaften von Informationen <author>Wild</author> (<date>1982</date>), S. <biblScope type="page">124ff.</biblScope></bibl> | ||
<bibl><author>Goldstein</author>: <title level="m">Handbuch der inneren Medizin</title> von <author>Mohr und Staehelin</author>, Bd. <biblScope unit="volume">5</biblScope>, 1. <date>1925</date>.</bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's not easy to understand this one. Is Goldstein
an author? (he's not an author of the handbuch
. What is the 1.
?
We might better drop the example?
<bibl><author>Keupp, H./Röhrle, B.</author>: <title level="m">Soziale Netzwerke</title>. <pubPlace>Frankfurt a. M.</pubPlace> <date>1987</date></bibl> | ||
<bibl><author>SILVA, A. C. da.</author> <title level="a">A desconstrução da Discriminação no Livro didático</title>. In: <author>Munanga, Kabengele</author>. <title level="m">Superando o Racismo na escola</title>. <pubPlace>Brasília</pubPlace>: <orgName>Ministério da Educação, Secretaria de Educação continuada, Alfabetização e Diversidade</orgName>, p. <biblScope type="page">21-37</biblScope>. <date>2005</date>.</bibl> | ||
<bibl><author>B�ssler R</author> (<date>1974</date>) <title level="a">Pathologische Anatomie der Gallenwegserkrankungen</title>. In: <editor>Becker V</editor> (Hrsg) <title level="m">Gastroenterologie und Stoffwechsel, Aktionen und Interaktionen</title>. <publisher>Witzstrock</publisher>, <pubPlace>Baden-Baden</pubPlace></bibl> | ||
<bibl>Vgl. zu den Merkmalen der Informationsqualität <author>Weißenberger</author> (<date>1997</date>). S. <biblScope type="page">35</biblScope> sowie allgemein zu den Eigenschaften von Informationen <author>Wild</author> (<date>1982</date>), S. <biblScope type="page">124ff.</biblScope></bibl> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It would be 2 references here I think:
<bibl>Vgl. zu den <title level="m">Merkmalen der Informationsqualität</title> <author>Weißenberger</author> (<date>1997</date>). S. <biblScope type="page">35</biblScope></bibl>
<bibl>sowie allgemein zu den <title level="m">Eigenschaften von Informationen</title> <author>Wild</author> (<date>1982</date>), S. <biblScope type="page">124ff</biblScope>.</bibl>
Hi @miku ! I am preparing slowly a new Grobid release... shall I update the annotations myself and merge the PR? |
Corrected ref data from PR #864
A follow up on #854.