Publications

You can also find my articles on my Google Scholar profile.

Click here for an overview of my research.
My PhD research was at the intersection of Mathematics and Machine Learning for time series analysis. More precisely, my publications are on symbolic representations and distances between univariate and multivariate time series. My PhD manuscript [T1] sums up my research and contains 2 literature reviews, 2 major contributions (ASTRIDE [IC3] and $d_{symb}$ [IC1]), 1 interactive tool on Streamlit (the $d_{symb}$ playground [IC2]) and 1 open-access data set (the arm-CODA data set [J1]). The abstract of my manuscript is as follows.

The objectives of this thesis are to define novel symbolic representations and distance measures that are suited for time series that can be multivariate and non-stationary. In addition, they should preserve the time information, be interpretable, and fast to compute. We review symbolic representations of time series (that transform a real-valued series into a shorter discrete-valued series), as well as distance measures on time series, strings, and symbolic sequences (that result from a symbolization process).

We propose two contributions: ASTRIDE for a data set of univariate time series, and $d_{symb}$ for a data set of multivariate time series. We also developed the $d_{symb}$ playground, an online interactive tool that allows users to apply $d_{symb}$ to their uploaded data. ASTRIDE and $d_{symb}$ are data-driven as they use change-point detection for the segmentation step, then either quantiles or a $K$-means clustering algorithm for the quantization step. Finally, they apply the general edit distance with custom costs between the resulting symbolic sequences.

We show the performance of ASTRIDE compared to 4 other symbolic representations on reconstruction and, when applicable, on classification tasks. For $d_{symb}$, experiments show how interpretable the symbolization is. Moreover, compared to 9 elastic distances on a clustering task, $d_{symb}$ achieves a competitive performance while being several orders of magnitude faster.

PhD manuscript

T1Sylvain Combettes. Symbolic representations of time series. Machine Learning [cs.LG]. Université Paris-Saclay, 2024. English. NNT : 2024UPASM002. tel-04573912.
Links: theses.fr / HAL / manuscript PDF / defense slides PDF.

International conference papers

IC3S. W. Combettes, C. Truong, and L. Oudre. "Symbolic representation for time series." To appear in Proceedings of the European Signal Processing Conference (EUSIPCO), Lyon, France, 2024.
Method name: ASTRIDE.
Links: paper / paper PDF / poster PDF / GitHub / EUSIPCO 2024.
IC2S. W. Combettes, P. Boniol, C. Truong, and L. Oudre. "d_{symb} playground: an interactive tool to explore large multivariate time series datasets." In Proceedings of the International Conference on Data Engineering (ICDE), Utrecht, Netherlands, 2024.
Method name: $d_{symb}$ playground.
Links: paper / paper PDF / poster PDF / GitHub / Streamlit app / 4 min YouTube video / ICDE 2024 / ICDE 2024 demos.
IC1S. W. Combettes, C. Truong, and L. Oudre. "An Interpretable Distance Measure for Multivariate Non-Stationary Physiological Signals." In Proceedings of the International Conference on Data Mining Workshops (ICDMW), Shanghai, China, 2023.
Method name: $d_{symb}$.
Links: paper / paper PDF / slides PDF / poster PDF / GitHub / ICDM 2023 / AI4TS workshop.

National conference papers

NC1S. W. Combettes, C. Truong, and L. Oudre. "SAX-DD: une nouvelle représentation symbolique pour séries temporelles" In Proceedings of the Groupe de Recherche et d’Etudes En Traitement Du Signal et Des Images (GRETSI), Nancy, France, 2022.
Method name: SAX-DD.
Links: paper PDF / GRETSI 2022.

Journal papers

J1S. W. Combettes, P. Boniol, A. Mazarguil, D. Wang, D. Vaquero-Ramos, M. Chauveau, L. Oudre, N. Vayatis, P.-P. Vidal, A. Roren, and M.-M. Lefèvre-Colau. "Arm-CODA: A Data Set of Upper-limb Human Movement During Routine Examination." Image Processing On Line (IPOL), 14:1-13, 2024.
Data set name: Arm-CODA.
Links: paper / paper PDF / demo.

Preprints

PR1S. W. Combettes, C. Truong, and L. Oudre. "ASTRIDE: Adaptive Symbolization for Time Series Databases." arXiv, abs/2302.04097, 2023.
Method name: ASTRIDE.
Links: paper / paper PDF / GitHub.