Fix realpath on Windows #13582

malmaud · 2015-10-13T15:16:10Z

There was a small inefficiency that would usually cause the call to GetFullPathName to be made twice. See #13542 (diff).

stevengj · 2015-10-13T16:04:06Z

base/path.jl

-        if (p < buflength)
-            resize!(buf, p+1)
-            return utf8(UTF16String(buf))
+        if (p > length(buf))


Should be p+1 > length(buf), I think. According to the documentation, the return value is:

If the function succeeds, the return value is the length, in TCHARs, of the string copied to lpBuffer, not including the terminating null character.
If the lpBuffer buffer is too small to contain the path, the return value is the size, in TCHARs, of the buffer that is required to hold the path and the terminating null character.

In particular, note the inconsistency regarding whether the return value includes the terminating null codeunit.

The same correction is needed for longpath, since the docs for GetLongPathName are similar.

Actually, I don't think it's possible for p==length(buf), which would be the only affected case by this change.

stevengj · 2015-10-13T16:49:29Z

In fact, I see a lot of utf8(UTF16String(buf)) calls. Probably we should just have a utf8(x::Vector{UInt16}, len=length(x)) to convert from a UInt16 buffer. If the buffer is NUL-terminated, you would call utf8(x, length(x)-1).

malmaud · 2015-10-13T17:32:48Z

I'm not sure it's a great idea for utf8 to assume that the data is intended to be interpreted as UTF16 just because it's of type Vector{UInt16} - that might muddle the waters of encoding scheme vs backend storage. But certainly all these utf8(UTF16String(.)) constructs are unfortunate.

stevengj · 2015-10-13T18:12:14Z

@malmaud, utf8 already assumes a Ptr{UInt8} is UTF-8 encoded, and utf16 assumes a Ptr{UInt16} is UTF-16 encoded. And utf16(a::Vector{UInt16}) already exists and assumes a is UTF-16 data. So, utf8(::Vector{UInt16}) would be consistent. It would also be really weird to pass a Vector{UInt16} to a utf* conversion function and expect it to be treated as any encoding other than UTF-16.

malmaud · 2015-10-13T18:13:36Z

Ok, I'm convinced. I'll make a PR for that.

On Tue, Oct 13, 2015 at 2:12 PM Steven G. Johnson notifications@github.com
wrote:

@malmaud https://github.com/malmaud, utf8 already assumes a Ptr{UInt8}
is UTF-8 encoded, and utf16 assumes a Ptr{UInt16} is UTF-16 encoded. So,
utf8(::Vector{UInt16}) would be consistent. It would also be really weird
to pass a Vector{UInt16} to a utf* conversion function and expect it to
be treated as any encoding other than UTF-16.

—
Reply to this email directly or view it on GitHub
#13582 (comment).

vtjnash · 2015-10-30T05:14:44Z

however, utf32(a::Vector{UInt8}) also already exists and does not mean a is utf8-data, but that it is utf32 data in a byte vector (with unknown endianness): utf32(UInt8['A',0,0,0,'B',0,0,0]) => "AB"

it's best to stick with utf8(UTF16String(data)) because it is then clear that the encoding and transcoding operations are independent.

ScottPJones · 2015-10-31T21:41:43Z

If you recall, I'd already tried to add more of these conversions, and was made to remove them.
See if Tony will finally let them in.

tkelman · 2015-11-10T16:49:45Z

superseded by #12819?

vtjnash · 2015-11-10T17:01:49Z

yes -- i was thinking of this PR as I made those changes, I just couldn't remember where I had seen this.

Fix realpath on Windows

5af6596

stevengj reviewed Oct 13, 2015
View reviewed changes

Fix bug in realpath and longpath

d227a6c

malmaud mentioned this pull request Oct 13, 2015

Define more conversions to UTF8String #13588

Closed

vtjnash closed this Nov 10, 2015

tkelman deleted the jmm/windows_realpath branch March 22, 2016 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix realpath on Windows #13582

Fix realpath on Windows #13582

malmaud commented Oct 13, 2015

stevengj Oct 13, 2015

stevengj Oct 13, 2015

malmaud Oct 13, 2015

malmaud Oct 13, 2015

stevengj commented Oct 13, 2015

malmaud commented Oct 13, 2015

stevengj commented Oct 13, 2015

malmaud commented Oct 13, 2015

vtjnash commented Oct 30, 2015

ScottPJones commented Oct 31, 2015

tkelman commented Nov 10, 2015

vtjnash commented Nov 10, 2015

Fix realpath on Windows #13582

Fix realpath on Windows #13582

Conversation

malmaud commented Oct 13, 2015

stevengj Oct 13, 2015

Choose a reason for hiding this comment

stevengj Oct 13, 2015

Choose a reason for hiding this comment

malmaud Oct 13, 2015

Choose a reason for hiding this comment

malmaud Oct 13, 2015

Choose a reason for hiding this comment

stevengj commented Oct 13, 2015

malmaud commented Oct 13, 2015

stevengj commented Oct 13, 2015

malmaud commented Oct 13, 2015

vtjnash commented Oct 30, 2015

ScottPJones commented Oct 31, 2015

tkelman commented Nov 10, 2015

vtjnash commented Nov 10, 2015