GitHub topics: embeddings
etalab-ia/mediatech
Collection of public datasets from the French administration, vectorized and ready to use in AI projects.
langage: Python - taille: 340 ko - dernière synchronisation: il y a environ 22 heures - enregistré: il y a 3 jours - étoiles: 4 - forks: 1
VIGINUM-FR/D3lta
A Python implementation of the D3lta algorithm for duplicated textual content detection
langage: Jupyter Notebook - taille: 20,8 Mo - dernière synchronisation: il y a 7 jours - enregistré: il y a 4 mois - étoiles: 52 - forks: 8
ina-foss/twembeddings
Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora
langage: Jupyter Notebook - taille: 40,9 Mo - dernière synchronisation: il y a 6 jours - enregistré: il y a 4 mois - étoiles: 31 - forks: 5
France-Travail/embcompare
A simple python tool for embedding comparison
langage: Python - taille: 27,9 Mo - dernière synchronisation: il y a 7 jours - enregistré: il y a plus d'un an - étoiles: 7 - forks: 0