GitHub topics: embeddings
etalab-ia/mediatech
Collection of public datasets from the French administration, vectorized and ready to use in AI projects.
Language: Python - Size: 340 KB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 4 - Forks: 1
VIGINUM-FR/D3lta
A Python implementation of the D3lta algorithm for duplicated textual content detection
Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 52 - Forks: 8
ina-foss/twembeddings
Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora
Language: Jupyter Notebook - Size: 40.9 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 31 - Forks: 5
France-Travail/embcompare
A simple python tool for embedding comparison
Language: Python - Size: 27.9 MB - Last synced at: about 10 hours ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0