JZC-002 [Paper] Learning to Reason with Third-Order Tensor Products Oct 4, 2024 Transformer 用三维张量作为RNN的hidden state,意图在于记忆节点之间的关系;update的时候分为三步骤,依次做write, move, backlink.