[Answer.AI] Small but Mighty: Introducing answerai-colbert-small

Say hello to answerai-colbert-small-v1, a tiny ColBERT model that punches well above its weight.

August 13, 2024 · Ben

[Answer.AI] JaColBERTv2.5🇯🇵: Optimising Retrieval Training for Lower-Resources Languages

Introducing JaColBERTv2.5🇯🇵, the new best Japanese retrieval model. Through this release, we present a thorough analysis to better understand what helps in training a good multi-vector retrieval model.

August 2, 2024 · Ben

[Answer.AI] A little pooling goes a long way for multi-vector representations

Blog post offering a quick overview of ColBERT and how it works, and introducing an efficient pooling trick to alleviate the issues it faces.

April 8, 2024 · Ben

Questions & Answer(s): thoughts and joining Answer.AI

If you’re old school, you can find a raw HTML version of this post here This is a fairly long, stream-of-thought post about how I currently (Sunday Feb 4, 2024) feel about the broader ML/NLP/IR ecosystem and its future. Everything here’s on-the-fly opinion and can & will change. In summary: I think ML developments are incredibly exciting, and we need to continue to work on bridging the gap between ML-as-a-commodity-for-ML-practitioners to ML-as-a-commodity-for-everyone....

February 6, 2024 · Ben

Fio

日本語版は近日公開予定です(日本語を勉強中なので、間違いはご容赦ください!) Welcome to this stream-of-thoughts report on fio-base-japanese-v0.1, the first version of Fio, a family of Japanese sentence embeddings. These are all notes I took while training the models, vaguely ordered in relevant categories! I hope that they can be useful to anyone interested in Japanese embeddings. In short: Fio-v0.1 is currently (18/12/2023) the best similarity sentence embeddings model for Japanese, as well as the best overall monolingual model (similarity + retrieval)....

December 18, 2023 · Ben