problem with whitespaces around = #53

HedvigS · 2022-01-14T15:56:16Z

I've discovered that when I have an entry like this:

@book{fassberg2019modern,
  title      = {Languages of the Eastern Section: Great Lakes to Indian Ocean},
author={Fassberg, Steven E},
  lgcode={west2763},
  hhtype={overview},
  pages={632652},
  year={2019},
  publisher={Routledge}
}

I get a table that looks like this from bib2df::bib2df()

CATEGORY	BIBTEXKEY	ADDRESS	ANNOTE	AUTHOR	BOOKTITLE	CHAPTER	CROSSREF	EDITION	EDITOR	HOWPUBLISHED	INSTITUTION	JOURNAL	KEY	MONTH	NOTE	NUMBER	ORGANIZATION	PAGES	PUBLISHER	SCHOOL	SERIES	TITLE	TYPE	VOLUME	YEAR	AUTHOR..FASSBERG.	LGCODE..WEST2763..	HHTYPE..OVERVIEW..	PAGES..632652..	YEAR..2019..	PUBLISHER..ROUTLEDGE.
BOOK	fassberg2019modern																					Languages of the Eastern Section: Great Lakes to Indian Ocean				Fassberg, Steven E	west2763	overview	632652	2019	Routledge

I've isolated the problem down to the lack of whitespaces before and after the equal sign at the field assignment. It's an easy fix, I basically just inserted whitespaces before and after every equal sign before a curly bracket, but it was a bit frustrating to debug. Can this be included in the documentation, or fixed?

The text was updated successfully, but these errors were encountered:

agricolamz · 2022-05-13T09:30:42Z

I've spent half an hour for figuring out that it was spaces, not the uppercase categories...

HedvigS · 2022-05-13T11:53:06Z

I've spent half an hour for figuring out that it was spaces, not the uppercase categories...

Haha oh no! I'm sorry!

nucleic-acid · 2022-06-12T13:12:11Z

Hi, could this be fixed by refining the regular expressions in bib2df_gather.R?
Would you accept a pull request on this?

HedvigS · 2022-12-06T15:10:15Z

Here's a hacky solution for desperate folks in the meantime ^^

https://hedvigsr.tumblr.com/post/702901773084524544/bib2df-bug-hacky-solution

nguyentruonglt · 2023-08-11T09:42:17Z

I have the same problem. But I have a bibtex file with 3000 citation. It's extremely exhausting to add spaces before and after equal signs (=) manually. Do you know any solution to do it automatically? Do R or any tools support us to do it?

agricolamz · 2023-08-11T11:08:55Z

The bib-files are plain texts, so you can do with it whatever you want. If I were you, I'd do something like this:

library(tidyverse)

read_lines("your_bib_file.bib") |> 
  str_replace_all("=", " = ") |> # add desired spaces
  str_replace_all("\\s{2,}", " ") |>  # remove double spaces in case you have it
  write_lines("your_bib_file.bib")

I didn't check the code on real files, but I'm pretty confident that it should work.

HedvigS · 2023-08-12T10:26:38Z

@nguyentruonglt here's my scripted solution:

Here's a hacky solution for desperate folks in the meantime ^^

https://hedvigsr.tumblr.com/post/702901773084524544/bib2df-bug-hacky-solution

HedvigS · 2023-08-12T10:27:33Z

@nguyentruonglt here's my scripted solution:

Here's a hacky solution for desperate folks in the meantime ^^
https://hedvigsr.tumblr.com/post/702901773084524544/bib2df-bug-hacky-solution

This is the function I used:

add_spaces_for_bib2df <- function(bib_fn){

new_fn <- paste0( str_replace(bib_fn, ".bib", ""), "_sep", ".bib")

  read_lines(bib_fn) %>% 
  str_replace_all(regex("\\=\\{"), regex(" \\= \\{")) %>% 
  write_lines(new_fn)
}

HedvigS · 2024-04-03T12:41:07Z

@giabaio I'd like to help by adjusting bib2df_gather and adjust one of the regexes and make a PR, like @nucleic-acid suggests. But, I'm struggling a bit with parsing the function and I'm concerned I'd cause problems unknowingly. I've made a suggesting in PR #59

HedvigS mentioned this issue Jan 14, 2022

replace bib2df SimonGreenhill/rcldf#13

Open

mattwarkentin mentioned this issue Jun 9, 2022

bib2df parsing issues nucleic-acid/namedropR#49

Open

HedvigS mentioned this issue Apr 3, 2024

Update bib2df_read.R #59

Merged

HedvigS closed this as completed Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

problem with whitespaces around = #53

problem with whitespaces around = #53

HedvigS commented Jan 14, 2022 •

edited

Loading

agricolamz commented May 13, 2022

HedvigS commented May 13, 2022

nucleic-acid commented Jun 12, 2022

HedvigS commented Dec 6, 2022

nguyentruonglt commented Aug 11, 2023

agricolamz commented Aug 11, 2023

HedvigS commented Aug 12, 2023

HedvigS commented Aug 12, 2023 •

edited

Loading

HedvigS commented Apr 3, 2024

problem with whitespaces around = #53

problem with whitespaces around = #53

Comments

HedvigS commented Jan 14, 2022 • edited Loading

agricolamz commented May 13, 2022

HedvigS commented May 13, 2022

nucleic-acid commented Jun 12, 2022

HedvigS commented Dec 6, 2022

nguyentruonglt commented Aug 11, 2023

agricolamz commented Aug 11, 2023

HedvigS commented Aug 12, 2023

HedvigS commented Aug 12, 2023 • edited Loading

HedvigS commented Apr 3, 2024

HedvigS commented Jan 14, 2022 •

edited

Loading

HedvigS commented Aug 12, 2023 •

edited

Loading