-
-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DocX Reader Bug: <w:instrText>HYPERLINK \l "bm1"</w:instrText> parsed incorrectly #9246
Comments
What does
mean? Can you point to documentation? Does |
\l means location. Either by itself as shown in my example (internal file), but I believe also as a second argument to a remote file like so: I have not been successful at finding documentation at any official sources. Snippet from docx file where in ms word the links works fine: This should have created an internal link to #CEGEIDCF, however it creates a link to "\l" Pandoc v. 3.1.9 |
Can you upload a small docx that uses this, so I can test? |
I just tested and confirmed that the issue occurs when there is no argument in-front of the location switch (\l) This works and is parsed as "http://example.com#hash" This does not work because of the missing expected parameter ("http://example.com") I think the first argument should be optional. However I'm not able to find any official documentation that supports this. An additional gotcha: I have seen cases of multiple (two) whitespaces in-front of the \l switch. Example: |
Can I please get an update on the status of this issue? Will it be fixed or ignored? |
Just a friendly remark: this is the kind of comment that can demoralize open-source maintainers, who are working on a volunteer basis. We are not the service desk of a company whose product you have purchased. If there were an update on this issue, it would have been posted here, on the issue tracker. Will the issue be ignored? No, it is not being ignored. Will it have as high a priority for your volunteer maintainer as it does for you? That's unlikely. |
Thank you, and sorry if I came across as rude. I am fully aware of the situation you are describing. |
Hi!
I have been trying to figure out why some links are converted to <a href="\l">.. when converting from docx (legacy MS Word documents) to Epub3. I have narrowed it down to the following:
Does not work:
<w:instrText>HYPERLINK \l "bm1"</w:instrText>
Works:
<w:instrText>HYPERLINK "#bm1"</w:instrText>
Also the <w:hyperlink> element works.
Relevant code:
pandoc/src/Text/Pandoc/Readers/Docx/Fields.hs
Line 108 in 5bd301e
Best regards
Rune
The text was updated successfully, but these errors were encountered: