Discussion about this post

User's avatar
Neural Foundry's avatar

Super clean breakdown of the voyage-4 architecture. The shared vector space idea is lowkey genius because re-indexing has always been such a pain point when trying to upgrade embeddings in prodution. We ran into this exact problem last quarter when migrating from ada to text-embedding-3 and had to re-embed like 2M docs. Having that MoE architecture with 40% cost reduction while keeping accuracy is prettycrazy timing for teams scaling RAG systems.

No posts

Ready for more?