Discussion about this post

User's avatar
Neural Foundry's avatar

The 75% vs 25% breakdown is spot on. Most projects I've seen hit a wall not because of the model choice but becase retrieval is garbage or they're not syncing knowledge bases properly. That MCP connection reduction from N×M to N+M is elegant, reminds me of how message brokers simplified distributed systems years ago. The short-term memory point about ordering is underrated too, seen teams stuff everything at the bottom and wonder why context gets lost.

No posts

Ready for more?