A theoretical framework for self-supervised contrastive learning for continuous dependent data

  • 2025-09-02 09:59:25
  • Alexander Marusov, Aleksandr Yugay, Alexey Zaytsev
  • 0

Abstract

Self-supervised learning (SSL) has emerged as a powerful approach to learningrepresentations, particularly in the field of computer vision. However, itsapplication to dependent data, such as temporal and spatio-temporal domains,remains underexplored. Besides, traditional contrastive SSL methods oftenassume \emph{semantic independence between samples}, which does not hold fordependent data exhibiting complex correlations. We propose a novel theoreticalframework for contrastive SSL tailored to \emph{continuous dependent data},which allows the nearest samples to be semantically close to each other. Inparticular, we propose two possible \textit{ground truth similarity measures}between objects -- \emph{hard} and \emph{soft} closeness. Under it, we derivean analytical form for the \textit{estimated similarity matrix} thataccommodates both types of closeness between samples, thereby introducingdependency-aware loss functions. We validate our approach, \emph{DependentTS2Vec}, on temporal and spatio-temporal downstream problems. Given thedependency patterns presented in the data, our approach surpasses modern onesfor dependent data, highlighting the effectiveness of our theoreticallygrounded loss functions for SSL in capturing spatio-temporal dependencies.Specifically, we outperform TS2Vec on the standard UEA and UCR benchmarks, withaccuracy improvements of $4.17$\% and $2.08$\%, respectively. Furthermore, onthe drought classification task, which involves complex spatio-temporalpatterns, our method achieves a $7$\% higher ROC-AUC score.

 

Quick Read (beta)

loading the full paper ...