MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation

1University of Bonn 2Lamarr Institute for Machine Learning and Artificial Intelligence 3Khalifa University
ICCV 2025
Project teaser image

The current state-of-the-art method employs standard Mamba blocks for stochastic dense action anticipation. In contrast, we introduce the MixMamba block, where each block dynamically selects an appropriate A matrix based on the input sequence.

Abstract

We present MixANT, a novel architecture for stochastic long-term dense anticipation of human activities. While recent State Space Models (SSMs) like Mamba have shown promise through input-dependent selectivity on three key parameters, the critical forget-gate (A matrix) controlling temporal memory remains static. We address this limitation by introducing a mixture of experts approach that dynamically selects contextually relevant A matrices based on input features, enhancing representational capacity without sacrificing computational efficiency. Extensive experiments on the 50Salads, Breakfast, and Assembly101 datasets demonstrate that MixANT consistently outperforms state-of-the-art methods across all evaluation settings. Our results highlight the importance of input-dependent forget-gate mechanisms for reliable prediction of human behavior in diverse real-world scenarios.

Poster

TODO

BibTeX

@inproceedings{wasim2024mixant,
        author    = {Syed Talal Wasim and Hamid Suleman and Olga Zatsarynna and Muzammal Naseer and Juergen Gall},
        title     = {MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation},
        booktitle = {ICCV},
        year      = {2025},
    }