Notes on BitNet and the slow rise of ternary weight models
Ternary weight models promise a step change in local inference economics. Microsoft's bitnet.cpp is the most concrete piece of infrastructure to date. A practitioner level read on where it is at and what still needs to happen.