An open API service providing repository metadata for many open source software ecosystems.

Topic: "embeddings"

VIGINUM-FR/D3lta

A Python implementation of the D3lta algorithm for duplicated textual content detection

Language: Jupyter Notebook - Size: 20.8 MB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 59 - Forks: 8

ina-foss/twembeddings

Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora

Language: Jupyter Notebook - Size: 40.9 MB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 33 - Forks: 5

etalab-ia/mediatech

Collection of public datasets from the French administration, vectorized and ready to use in AI projects.

Language: Python - Size: 486 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 12 - Forks: 4

France-Travail/embcompare 📦

A simple python tool for embedding comparison

Language: Python - Size: 27.9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

lias-laboratory/asos-crm

Automated semantic scoping for CIDOC CRM using multilingual embeddings and OWL validation for LLM-driven RDF extraction.

Language: Jupyter Notebook - Size: 286 KB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0