Improve the pb_decode_varint implementations.

Results for ARM: -4% execution time, +1% code size
1 file changed