🔍 Interactive Multimodal Embedding Explorer

This project is a Streamlit-based visualization tool built as part of Google Summer of Code 2025 under the Intel Corporation (OpenVINO Toolkit).
It helps you visualize and compare CLIP-based embeddings for images and their captions.

📌 Features

Upload a dataset of images and captions
Extract CLIP embeddings (image, text, or both)
Project embeddings using PCA, UMAP, or t-SNE
Visualize Image, Text, and Joint embeddings
Explore with interactive Plotly scatter plots
Download embeddings and projections
Supports multiple joint projection methods:
- Concatenated Projection
- Averaged Pairs
- Aligned Projections
- Connected Pairs
Smart .npy-based caching to skip recomputation if dataset remains unchanged

🛠 How to Run

Clone the repository:

git clone https://github.com/your-username/embedding-explorer.git
cd embedding-explorer

Install dependencies:
```
pip install -r requirements.txt
```

3.Make sure your image dataset is placed inside data/images and the captions are in data/captions1.json.

4.Run the app:

streamlit run app.py

FOLDER STRUCTURE

.
├── app.py
├── utils/
│   └── embedding_utils.py
├── data/
│   ├── images/
│   └── captions1.json
├── embeddings/
├── projections/
├── requirements.txt
└── README.md

💡 Tech Stack

Python
Streamlit
CLIP (via HuggingFace Transformers)
PCA / UMAP / t-SNE for projection
Plotly for interactive plots

🚀 Project Motivation

This tool was built to provide researchers and practitioners a simple way to explore how multimodal embeddings behave—both separately and jointly. It was inspired by the need to visually debug and compare image and text representations.

🙋‍♀️ Author

Aarya Pandey

GSoC 2025 Contributor @ Intel (OpenVINO Toolkit)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍 Interactive Multimodal Embedding Explorer

📌 Features

🛠 How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
embeddings		embeddings
projections		projections
utils		utils
.gitignore		.gitignore
OpenVino_GSoC2025_Proposal.pdf		OpenVino_GSoC2025_Proposal.pdf
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🔍 Interactive Multimodal Embedding Explorer

📌 Features

🛠 How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages