Sequence-to-sequence models
英文原文文件:seq2seq.md
计算解释
Encoder-decoder 序列建模,在 Transformer 令 dense attention 成为主流之前,利用 GPU 并行加速的 recurrent 工作负载。
支撑阅读卡
- Neural Machine Translation by Jointly Learning to Align and Translate (2014,
single_gpu_deep_learning) - Sequence to Sequence Learning with Neural Networks (2014,
single_gpu_deep_learning) - Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation (2016,
multi_gpu_dense_training)
后续计算范式下过时或退居次要的内容
仅通过已链接的阅读卡追踪,不将本方法页视为独立证据来源。