Facial Spatiotemporal Graphs: Leveraging the 3D Facial Surface for Remote Physiological Measurement

1Australian National University, Canberra, Australia 2Data61, Commonwealth and Scientific Industrial Research Organization, Canberra, Australia 3University of Turku, Turku, Finland

Abstract

Facial remote photoplethysmography (rPPG) methods estimate physiological signals by modeling subtle color changes on the 3D facial surface over time. However, existing methods fail to explicitly align their receptive fields with the 3D facial surface—the spatial support of the rPPG signal. To address this, we propose Facial Spatiotemporal Graphs (STGraphs), a novel representation that encodes facial color and structure using 3D facial mesh sequences-enabling surface-aligned spatiotemporal processing. We introduce MeshPhys, a lightweight spatiotemporal graph convolutional network that operates on the STGraph to estimate physiological signals. Across four benchmark datasets, MeshPhys achieves state-of-the-art or competitive performance in both intra- and cross-dataset settings. Ablation studies show that constraining the model’s receptive field to the facial surface acts as a strong structural prior, and that surface-aligned, 3D-aware node features are critical for robustly encoding facial surface color. Together, STGraphs and MeshPhys constitute a novel, principled modeling paradigm for facial rPPG, enabling robust, interpretable, and generalizable estimation.

BibTeX


      @article{cantrill2025facialstgraph,
        author    = {Cantrill, Sam and Ahmedt-Aristizabal, David and Petersson, Lars and Suominen, Hanna and Armin, Mohammad Ali},
        title     = {Facial Spatiotemporal Graphs: Leveraging the 3D Facial Surface for Remote Physiological Measurement},
        year      = {2026},
      }