We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
After calling pdf_text; i got the text, nevertheless some pages are clipped. Also missing data.
it's similar to the landscape problem, but not the same. As not all pages are same size. Also some data is missing
im calling the function directly on the file , no other configurations
reference issue: #7
Script
# after downloading the file and saving it as 0003_PDF198_206_articulo.pdf current_pdf <- '0003_PDF198_206_articulo.pdf' pdf_ejemplo <- paste0(current_pdf) texto_extraido <- pdf_text(pdf_ejemplo) pdf_output_file_name <- str_replace(current_pdf,".pdf",".txt") pdf_output_file <- paste0(pdf_output_file_name) write.table(x=texto_extraido,file = pdf_output_file,row.names = FALSE,col.names = FALSE,quote = FALSE,fileEncoding = 'UTF-8') pdf_output_file_name
Data
The example PDF: https://revistas.unlp.edu.ar/raab/article/view/198/206 The output of pdf_text: 0003_PDF198_206_articulo.txt
some clipped:
some missing:
Thanks in advance! Also great work with pdftools , love it :D!
The text was updated successfully, but these errors were encountered:
Copy of the pdf file: document.pdf
Sorry, something went wrong.
No branches or pull requests
After calling pdf_text; i got the text, nevertheless some pages are clipped. Also missing data.
it's similar to the landscape problem, but not the same. As not all pages are same size. Also some data is missing
im calling the function directly on the file , no other configurations
reference issue: #7
Script
Data
The example PDF: https://revistas.unlp.edu.ar/raab/article/view/198/206
The output of pdf_text: 0003_PDF198_206_articulo.txt
some clipped:
some missing:
Thanks in advance! Also great work with pdftools , love it :D!
The text was updated successfully, but these errors were encountered: