Minions: embracing small LMs, shifting compute on-device, and cutting cloud costs in the process A protocol from Together AI for AI Orchestration https://youtu.be/L-WfRaSPE2A?si=FPgevz7fNtm9gyok