An open API service providing repository metadata for many open source software ecosystems.

Package Usage: pypi: smart-open

Utils for streaming large files (S3, HDFS, GCS, Azure Blob Storage, gzip, bz2...)
62 versions
Latest release: plus d'un an ago
248 dependent packages
32 367 417 downloads last month

View more package details: https://packages.ecosystem.code.gouv.fr/registries/pypi.org/packages/smart-open

Dependent Repos 29

florilege-team/florilege-website
Florilège est un projet français d'annotation participative de RELs (Ressource Educatives Libres).

Last synced: 9 mois ago - Pushed: environ un an ago

pub/ecolab/liriae/liriae-form

Last synced: 8 mois ago - Pushed: environ un an ago

aaristov/multichip-snakemake
Process the droplet chips using snakemake pipelines.

Last synced: environ un an ago - Pushed: environ un an ago

umr-tetis/mood/mood_tweets_ner_content

Last synced: 9 mois ago - Pushed: environ un an ago

entrepreneur-interet-general/tf-han

Size: 260 ko - Last synced: 5 jours ago - Pushed: plus de 2 ans ago

medialab/keyfayqua
Qui fait quoi ? : NLP tools to detect subject-object-verb triples in French and English

Size: 111 ko - Last synced: 5 jours ago - Pushed: presque 2 ans ago

medialab/spsm-database

Size: 2,46 Mo - Last synced: 5 jours ago - Pushed: plus d'un an ago

agora-gouv/agora-nlp

Size: 102 ko - Last synced: 7 jours ago - Pushed: environ un an ago

diplomatiegouvfr/bna
Baromètre Numérique de l’Agent

Size: 81,4 Mo - Last synced: 6 jours ago - Pushed: plus de 2 ans ago

pass-culture/data-gcp
Repo pour la team data sur GCP

Size: 19,4 Mo - Last synced: 5 jours ago - Pushed: 6 jours ago

medialab/DeFacto 📦
Tools to enrich De Facto's database

Size: 16,9 Mo - Last synced: 5 jours ago - Pushed: environ 2 ans ago

ecolabdata/2021-NLP_AE

Size: 204 Mo - Last synced: 5 jours ago - Pushed: presque 4 ans ago

etalab-ia/pseudonymisation_decisions_ce 📦
Temporary repo to split the pseudo livrable

Size: 25,8 Mo - Last synced: 1 jour ago - Pushed: environ 5 ans ago

medialab/chatgpt-study
Exploratory research on discourses around ChatGPT and AI.

Size: 79,1 ko - Last synced: 5 jours ago - Pushed: presque 2 ans ago

21901956/deep-learning-voiture-autonome-projet-m1

Last synced: environ un an ago - Pushed: environ un an ago

remy.decoupes/covid19-tweets-mood-tetis
Extract terms from tweets about covid19 : 1/ Extact and Preprocess tweets from https://github.com/echen102/COVID-19-TweetIDs 2/ Index in elasticsearch and build H-TFIDF 3/ Analyse resultst

Last synced: environ un an ago - Pushed: environ un an ago

etalab-ia/ami-ia-dgs
Dépôt de code pour le projet AMI IA 2 de la DGS.

Size: 38,1 Mo - Last synced: 1 jour ago - Pushed: plus de 2 ans ago

aphp/eds-pseudo
EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports

Size: 4,78 Mo - Last synced: 5 jours ago - Pushed: 3 mois ago

almanach/lectaurep/ner
Un projet Gitlab pour rassembler le travail fait sur le NER dans le cadre de Lectaurep.

Last synced: 9 mois ago

herelles/herelles-corpora-builder
Automatic protocol for the constitution of spatio-temporal and thematic corpora for the Herelles project.

Last synced: 9 mois ago - Pushed: environ un an ago

deep-learning-applied-on-web-and-iot-security/concatenation-deep-learning-detector
The Concatenation Detector helps you to build deep learning models to detect statically web vulnerability - especially Cross-Site Scripting XSS - based on Natural Language Processing (NLP)

Last synced: 9 mois ago

cedar/statstical_mentions
Extracting statistical mentions from textual claims to provide trusted content

Last synced: 9 mois ago

cedar/excel-search

Last synced: 9 mois ago

ecrinum/graph-ta-recherche

Last synced: 9 mois ago

cedar/connection-studio
ConnectionStudio integrates highly heterogeneous data into graphs, enriched with extracted entities. Studio users can discover the entities in their data, navigate across connections between datasets, explore and query the data in many ways. The Studio currently supports: CSV, JSON, XML, RDF, text, property graphs, all Office formats, and PDF datasets. For more information, see: https://connectionstudio.inria.fr The scientific publications behind the platform: https://team.inria.fr/cedar/connectionlens/

Last synced: 9 mois ago

mgillelevenson/tei_collator
Outil de collation automatisée TEI > TEI produit dans le cadre de ma thèse de doctorat

Last synced: environ un an ago - Pushed: environ un an ago

mgillelevenson/lemmatisation_xml_tei
Lemmatisation d'un fichier xml-tei avec Pie (latin médiéval), Freeling (castillan/castillan médiéval) ou CLTK (latin classique).

Last synced: environ un an ago - Pushed: environ un an ago

methal/corpus-methal-all
Travaux du mémoire M2 de Heng Yang sur l'extraction de règles de variation orthographique dans les dialectes alsaciens sur la base d'un corpus de pièces de théâtre

Last synced: 9 mois ago