← 返回方法列表

Sequence-to-sequence models

英文原文文件：seq2seq.md

计算解释

Encoder-decoder 序列建模，在 Transformer 令 dense attention 成为主流之前，利用 GPU 并行加速的 recurrent 工作负载。

支撑阅读卡

Neural Machine Translation by Jointly Learning to Align and Translate (2014, single_gpu_deep_learning)
Sequence to Sequence Learning with Neural Networks (2014, single_gpu_deep_learning)
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation (2016, multi_gpu_dense_training)

后续计算范式下过时或退居次要的内容

仅通过已链接的阅读卡追踪，不将本方法页视为独立证据来源。