arch: arm: cortex_m: make reading tls pointer faster on v7m and v8m.main

Encoding T3 allows for an offset of up to 12bits in size allowing for a
single instruction instead of 3.

Signed-off-by: Wilfried Chauveau <wilfried.chauveau@arm.com>
1 file changed