Conference Proceedings
Partners in crime: Multi-view sequential inference for movie understanding
N Papasarantopoulos, Lea Frermann, M Lapata, SB Cohen
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing | Association for Computational Linguistics | Published : 2019
DOI: 10.18653/v1/D19-1212
Abstract
Multi-view learning algorithms are powerful representation learning tools, often exploited in the context of multimodal problems. However, for problems requiring inference at the token-level of a sequence (that is, a separate prediction must be made for every time step), it is often the case that single-view systems are used, or that more than one views are fused in a simple manner. We describe an incremental neural architecture paired with a novel training objective for incremental inference. The network operates on multi-view data. We demonstrate the effectiveness of our approach on the problem of predicting perpetrators in crime drama series, for which our model significantly outperforms ..
View full abstract