dmm: (Default)
[personal profile] dmm
arxiv.org/abs/1909.10893

Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and are only updated at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for dramatically improved generalization on tasks where some factors of variation differ systematically between training and evaluation.



After the first two pages, particularly interesting tidbits are in Section 2.3, Section 3, and Appendix A.




Profile

dmm: (Default)
Dataflow matrix machines (by Anhinga anhinga)

April 2026

S M T W T F S
   1234
56 7891011
12131415161718
19202122232425
26 27282930  

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jun. 5th, 2026 05:15 am
Powered by Dreamwidth Studios