GitHub - THPengL/SMART

Official PyTorch Implementation of SMART

Liang Peng†, Yixuan Ye†, Cheng Liu*, Hangjun Che, Fei Wang, Zhiwen Yu, Si Wu, and Hau-San Wong. "SMART: Semantic Matching Contrastive Learning for Partially View-Aligned Clustering". IEEE TCSVT.

Abstract

Multi-view clustering has been empirically shown to improve learning performance by leveraging the inherent complementary information across multiple views of data. However, in real-world scenarios, collecting strictly aligned views is challenging, and learning from both aligned and unaligned data becomes a more practical solution. Partially View-aligned Clustering (PVC) aims to learn correspondences between misaligned view samples to better exploit the potential consistency and complementarity across views, including both aligned and unaligned data. However, most existing PVC methods fail to leverage unaligned data to capture the shared semantics among samples from the same cluster. Moreover, the inherent heterogeneity of multi-view data induces distributional shifts in representations, leading to inaccuracies in establishing meaningful correspondences between cross-view latent features and, consequently, impairing learning effectiveness. To address these challenges, we propose a Semantic MAtching contRasTive learning model (SMART) for PVC. The main idea of our approach is to alleviate the influence of cross-view distributional shifts, thereby facilitating semantic matching contrastive learning to fully exploit semantic relationships in both aligned and unaligned data. Specifically, we mitigate view distribution shifts by aligning cross-view covariance matrices, which enables the inference of a semantic graph for all data. Guided by the learned semantic graph, we further exploit semantic consistency across views through semantic matching contrastive learning. After the optimization of the above mechanisms, our model smoothly performs semantic matching for different view embeddings instead of the cumbersome view realignment, which enables the learned representations to enjoy richer category-level semantics and stronger robustness. Extensive experiments on eight benchmark datasets demonstrate that our method consistently outperforms existing approaches on the PVC problem.

Requirements

numpy==1.26.1
torch==1.12.1+cu116
tqdm==4.66.1
logging==0.5.1.2

Demo

Train a model with default settings.

python run.py

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
datasets		datasets
LICENSE		LICENSE
MyLogger.py		MyLogger.py
README.md		README.md
clustering.py		clustering.py
configure.py		configure.py
datasets.py		datasets.py
evaluation.py		evaluation.py
model.py		model.py
run.py		run.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Official PyTorch Implementation of SMART

Abstract

Requirements

Demo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Official PyTorch Implementation of SMART

Abstract

Requirements

Demo

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages