-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(data): correct page and line numbers #1791
Conversation
0155.json had many page number issues
Since there are files for "source_page" 156 and 157, how does json 155 contain lines that go all the way to 158? Only other confusion I have is how source_line was figured out |
There is only
For which composition? |
Looking at the branch it seems 156 and 157 exist:
Just wondering how any of them were done. E.g. programmatically/manually? |
I just realized that you've used the SGGS PDF, but you've also made changes to multiple compositions. So instead, I wonder what was the procedure for Sri Dasam Granth, for instance? Was this done manually? |
@bhajneet its mentioned in the PR description |
How was the line number/source line obtained programmatically from a PDF? Were there newline characters or something you used? |
I should clarify its a re-import of the page numbers from Kulbir S. Thind's original DB which is based on the SGPC PDF (which Thind had created as well) As mentioned in #1763, our original source for the metadata was iGurbani. This also raises the question if our author ids are correctly assigned. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@sarabveer please track exploration of author ids in a new issue
Summary of PR
For SGGS, the page and line numbers were corrected according to the SGGS PDF available on SGPC.net
I made a csv file with 3 columns:
id, source_page, source_line
This was then imported into the DB using the following query:
Ideally, this would be done through #1388. I could be able to feed the bot the csv file and it could automatically commit changes.
For Sri Dasam Granth, the changes were done manually with the ones mentioned by @SSAT in #1763
Time spent on PR
nine_thousand_years
Linked issues
Fix #1763
Closes #1764
Closes #1765