An open API service providing repository metadata for many open source software ecosystems.

Package Usage: pypi: gensim

Python framework for fast Vector Space Modelling
91 versions
Latest release: presque 2 ans ago
426 dependent packages
4 453 375 downloads last month

View more package details: https://packages.ecosystem.code.gouv.fr/registries/pypi.org/packages/gensim

Dependent Repos 32

florilege-team/florilege-website
Florilège est un projet français d'annotation participative de RELs (Ressource Educatives Libres).

Last synced: 10 mois ago - Pushed: environ un an ago

x5gon/lamapi
X5GON Learning Analytics Machine (LAM) API

Last synced: 10 mois ago - Pushed: environ un an ago

connes-v/order_inference

Last synced: plus d'un an ago - Pushed: environ un an ago

entrepreneur-interet-general/tf-han

Size: 260 ko - Last synced: 6 jours ago - Pushed: plus de 2 ans ago

medialab/motel
Exploration tool for word embeddings.

Size: 27,3 ko - Last synced: 5 jours ago - Pushed: plus de 6 ans ago

agora-gouv/agora-nlp

Size: 112 ko - Last synced: environ 24 heures ago - Pushed: 20 jours ago

pass-culture/data-gcp
Repo pour la team data sur GCP

Size: 19,6 Mo - Last synced: 5 jours ago - Pushed: 5 jours ago

etalab-ia/ami-ia
Recense les ressources utiles au programme AMI IA

Size: 36,4 Mo - Last synced: 2 jours ago - Pushed: presque 4 ans ago

medialab/DeFacto 📦
Tools to enrich De Facto's database

Size: 16,9 Mo - Last synced: 5 jours ago - Pushed: plus de 2 ans ago

guigue/recsys

Last synced: plus d'un an ago - Pushed: environ un an ago

abes-esr/labo-indexation-ai

Size: 102 Mo - Last synced: 5 jours ago - Pushed: 5 mois ago

ecolabdata/2021-NLP_AE

Size: 204 Mo - Last synced: 5 jours ago - Pushed: presque 4 ans ago

etalab-ia/pseudonymisation_decisions_ce 📦
Temporary repo to split the pseudo livrable

Size: 25,8 Mo - Last synced: 2 jours ago - Pushed: plus de 5 ans ago

ina-foss/twembeddings
Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora

Size: 40,9 Mo - Last synced: 6 jours ago - Pushed: 23 jours ago

medialab/chatgpt-study
Exploratory research on discourses around ChatGPT and AI.

Size: 79,1 ko - Last synced: 5 jours ago - Pushed: presque 2 ans ago

21901956/deep-learning-voiture-autonome-projet-m1

Last synced: environ un an ago - Pushed: environ un an ago

remy.decoupes/covid19-tweets-mood-tetis
Extract terms from tweets about covid19 : 1/ Extact and Preprocess tweets from https://github.com/echen102/COVID-19-TweetIDs 2/ Index in elasticsearch and build H-TFIDF 3/ Analyse resultst

Last synced: environ un an ago - Pushed: environ un an ago

etalab-ia/ami-ia-dgs
Dépôt de code pour le projet AMI IA 2 de la DGS.

Size: 38,1 Mo - Last synced: 2 jours ago - Pushed: plus de 2 ans ago

adesacy/pymotifs

Last synced: plus d'un an ago - Pushed: environ un an ago

equipebd/atem
ATEM is a novel framework for studying topic evolution in scientific archives. It is based on dynamic topic modeling and dynamic graph embedding techniques that explore the dynamics of content and citations of documents within a scientific corpus. ATEM explores a new notion of contextual emergence for the discovery of emerging interdisciplinary research topics based on the dynamics of citation links in topic clusters. Our experiments show that ATEM can efficiently detect emerging cross-disciplinary topics within the DBLP archive of over five million computer science articles.

Last synced: 10 mois ago

deep-learning-applied-on-web-and-iot-security/concatenation-deep-learning-detector
The Concatenation Detector helps you to build deep learning models to detect statically web vulnerability - especially Cross-Site Scripting XSS - based on Natural Language Processing (NLP)

Last synced: 10 mois ago

E182295X/ontology-project-2023

Last synced: plus d'un an ago - Pushed: environ un an ago

advanse/glimpse-med

Last synced: 10 mois ago

cedar/statstical_mentions
Extracting statistical mentions from textual claims to provide trusted content

Last synced: 10 mois ago

cedar/excel-search

Last synced: 10 mois ago

ecrinum/graph-ta-recherche

Last synced: 10 mois ago

magnet/wordnet-embeddings
Repository for message passing based NNs over Wordnet

Last synced: 10 mois ago

cedar/connection-studio
ConnectionStudio integrates highly heterogeneous data into graphs, enriched with extracted entities. Studio users can discover the entities in their data, navigate across connections between datasets, explore and query the data in many ways. The Studio currently supports: CSV, JSON, XML, RDF, text, property graphs, all Office formats, and PDF datasets. For more information, see: https://connectionstudio.inria.fr The scientific publications behind the platform: https://team.inria.fr/cedar/connectionlens/

Last synced: 10 mois ago

mgillelevenson/tei_collator
Outil de collation automatisée TEI > TEI produit dans le cadre de ma thèse de doctorat

Last synced: plus d'un an ago - Pushed: environ un an ago

mgillelevenson/lemmatisation_xml_tei
Lemmatisation d'un fichier xml-tei avec Pie (latin médiéval), Freeling (castillan/castillan médiéval) ou CLTK (latin classique).

Last synced: plus d'un an ago - Pushed: environ un an ago

magnet/GLE_emnlp

Last synced: 10 mois ago

almanach/time-us/corpora-prep
Scripts to prepare and clean corpora.

Last synced: 10 mois ago

Inist-CNRS/ezmaster-apps
Repository containing all ezmaster applications (monorepo)

Size: 26,7 Mo - Last synced: 6 jours ago - Pushed: 4 mois ago

Inist-CNRS/web-services
Web services at Inist-CNRS

Size: 30,1 Mo - Last synced: 6 jours ago - Pushed: 6 jours ago

France-Travail/embcompare
A simple python tool for embedding comparison

Size: 27,9 Mo - Last synced: 7 jours ago - Pushed: plus d'un an ago

VIGINUM-FR/D3lta
A Python implementation of the D3lta algorithm for duplicated textual content detection

Size: 20,8 Mo - Last synced: 5 jours ago - Pushed: 17 jours ago