Silent downcast overflow #21

crepererum · 2022-01-20T13:11:58Z

When using VarintReader with types smaller than 64bits, the current implementation first reads a 64bit integer:

Lines 40 to 58 in 14cd007

    
           impl VarIntProcessor { 
        
               fn push(&mut self, b: u8) -> Result<()> { 
        
                   if self.i >= 10 { 
        
                       return Err(io::Error::new( 
        
                           io::ErrorKind::InvalidData, 
        
                           "Unterminated varint", 
        
                       )); 
        
                   } 
        
                   self.buf[self.i] = b; 
        
                   self.i += 1; 
        
                   Ok(()) 
        
               } 
        
               fn finished(&self) -> bool { 
        
                   self.i > 0 && (self.buf[self.i - 1] & MSB == 0) 
        
               } 
        
               fn decode<VI: VarInt>(&self) -> Option<VI> { 
        
                   Some(VI::decode_var(&self.buf[0..self.i])?.0) 
        
               } 
        
           }

and then casts it to a smaller integer:

integer-encoding-rs/src/varint.rs

Lines 74 to 77 in 14cd007

    
           fn decode_var(src: &[u8]) -> Option<(Self, usize)> { 
        
               let (n, s) = u64::decode_var(src)?; 
        
               Some((n as Self, s)) 
        
           }

integer-encoding-rs/src/varint.rs

Lines 90 to 93 in 14cd007

    
           fn decode_var(src: &[u8]) -> Option<(Self, usize)> { 
        
               let (n, s) = i64::decode_var(src)?; 
        
               Some((n as Self, s)) 
        
           }

Now imagine a data stream w/ 9 bits 0xff followed by 0x00, which would be a pretty large 64bit integer. If you use VarIntReader::read_varint::<i32>(...) to decode that, it will sucessfully read the 64bit varint, consume all the bytes and during the conversion to 32bit will silently truncate the result. What it should do instead is to read only the number of bytes that are at max required for 32bit (5?) and then fail with "Unterminated varint".

The text was updated successfully, but these errors were encountered:

dermesser · 2022-01-22T19:29:45Z

Yes this is an unsatisfactory state. However, with an unterminated varint error, there would be a fragment being left in the input stream which will give a nonsense number on next read (at least as nonsensical as a truncated int).

Now one could write documentation that the next integer after such an error must be ignored... but that also doesn't seem optimal to me.

In any case, I'm open for debate on this topic.

crepererum · 2022-01-24T08:45:04Z

I think if your read / deserialize data from a stream and get an error, you have to stop reading or re-synchronize the stream (the latter one is mostly only possible for low-level network protocols). I think a user cannot expect a parser to automatically recover from broken data, because there is no way to know which part was broken (in the example case above was it one byte that was broken or was a 32bit value serialized as 64bit? or did we mess up before reading the varint?). It's not only about the "next integer" btw. because likely the protocol the user is trying to deserialize consists of other data types before and after the varint in question.

With your argument, I think you should NEVER return an "unterminated varint error" because you could always argue that there could in theory be a 128bit, 256bit, ... wide varint.

dermesser · 2022-01-25T13:10:41Z

That's a good point, I will look into it.

dermesser · 2022-01-28T21:26:49Z

@crepererum would you mind reviewing the change in #22?

Better size limit for #21

dermesser · 2022-06-23T04:40:40Z

I believe that this issue can be closed with the fix implemented.

dermesser added a commit that referenced this issue Jan 28, 2022

for #21: Respect size limits of varints of different sizes

422691d

dermesser added a commit that referenced this issue Feb 22, 2022

Merge pull request #22 from dermesser/better-size-limit

51be5aa

Better size limit for #21

dermesser closed this as completed Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Silent downcast overflow #21

Silent downcast overflow #21

crepererum commented Jan 20, 2022

dermesser commented Jan 22, 2022

crepererum commented Jan 24, 2022

dermesser commented Jan 25, 2022

dermesser commented Jan 28, 2022 •

edited

Loading

dermesser commented Jun 23, 2022

Silent downcast overflow #21

Silent downcast overflow #21

Comments

crepererum commented Jan 20, 2022

dermesser commented Jan 22, 2022

crepererum commented Jan 24, 2022

dermesser commented Jan 25, 2022

dermesser commented Jan 28, 2022 • edited Loading

dermesser commented Jun 23, 2022

dermesser commented Jan 28, 2022 •

edited

Loading