Reading the sentencepiece paper they say: > The main difference to other compres...

astrange · on April 5, 2023

> I don't see why Huffman encoding doesn't give you that same interpretability?

It might just be that a Huffman encoding is a bit-string and not a byte-string.

BPE encoding causes interesting failures, like how it can't do anagrams or spell words backwards properly. And yet it can make rhyming poems now.

thomasahle · on April 5, 2023

> BPE encoding causes interesting failures, like how it can't do anagrams or spell words backwards properly. And yet it can make rhyming poems now.

I don't think BPE encoding makes anagrams impossible. Just harder.