The Right Way to Use Multiple Embedding…

Jul 13, 2024

A mistake that often goes unnoticed.

3 Comments

Jul 13, 2024Edited

you could retrain the two embedding models jointly by minimzing the L2 distance between the embeddings of the same input and maximizing it for different inputs.

this training can be achieved using a contrastive loss.

Expand full comment

That's an interesting point... Thank you, AVI CHAWLA !

Expand full comment

Would you do a dimensional reduction, say PCA before or after concatenation?

Expand full comment

Reply

Share

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts

Daily Dose of Data Science

The Right Way to Use Multiple Embedding…