Why three layers?

Most AI products use one big model for everything. That's expensive for simple questions and often not enough for hard ones. We use a small, fast model for most questions and escalate to a larger reasoning model only when it's needed.