My Local LLM On A 32 GB M1 Pro

Sun, 14 Jun 2026 00:00:00 +0000

My pi setup has a fast mode: a slot I wired into my own config for the small, high-volume questions, pointed at Bedrock’s Haiku 4.5. It is a good model. Cheap, responsive, and it answers the kind of small, contained question I throw at it twenty times a day without complaining. It is also not mine. It runs on hardware I do not own, it can be deprecated or quietly swapped the week I have come to lean on it, and every one of those small questions hands a slice of my code to a third party.

Pi on ⎨ Saurabh Kumar ⎬

My Local LLM On A 32 GB M1 Pro