Document Embedding (DocEm)

The repository contains the code and notebooks for the tutorials on:

How to extract embedding features from COCO pictures using the ResNext model developed by Facebook AI.
Visualizing the picture embedding vectors in a 3D space using PCA and t-SNE.
Find the nearest neighbors of each picture based on the cosine distance.
Reduce the embedding space dimensionality while preserving manifold structures using UMAP.
Find the optimal GMM clusters using the BIC elbow method and the Silhouette analysis.
Visualize the pictures closest to each centroid to identify the cluster topic.
Apply an adapted version of the p-SIF (partition averaging) algorithm in order to produce document embeddings from the bag-of-word model and the original picture embedding vectors.
Test the effectiveness of the novel proposed method against the baseline methods for document averaging (weighted averaging and TF-IDF).

Overview of the p-SIF algorithm

Algorithm overview diagram:

You can view and execute the development notebook in Colab:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Deep_picture_embedding_features_COCO_ResNext.ipynb		Deep_picture_embedding_features_COCO_ResNext.ipynb
LICENSE		LICENSE
README.md		README.md
p-sif-overview.png		p-sif-overview.png