πŸ“¦ Full runnable example: github.com/sm1ck/honeychat/tree/main/tutorial/02-routing β€” docker compose up exposes POST /complete on localhost:8000. Every snippet below is pulled from that repo. Most introductory "chat with AI" tutorials pick one model and call it a day.

That works in a toy. It stops being enough in production, where users have different price sensitivity, different conversation sty