Neural Machine Translation and Sequence-To-sequence Models: A Tutorial

ageitgey · on March 7, 2017

This is a good paper for anyone interested in how modern Machine Translation works at the level of detail you might get from a well-written text in a college-level CS course (which is what I believe this is from). The paper starts with a background on statistical machine translation and then goes through the newer approach of sequence-to-sequence learning for translation, including word replacement and attention mechanisms. It's a good overview.

But if you are looking for a higher-level introduction that covers the same big ideas in ~10 minutes for a more general audience, here's my take: https://medium.com/@ageitgey/machine-learning-is-fun-part-5-...

tchalla · on March 7, 2017

This is one of my favorite general high level introduction to this area. Most people whom I have shared this with understand it even if they do not have deep technical knowledge. Great material!

rrherr · on March 8, 2017

Adam, I've been using Machine Learning Is Fun Part 1 at work, to introduce non-technical business leaders to supervised machine learning concepts. Thanks for the great series!

ericjang · on March 8, 2017

Stephen Merity of Metamind has a nice visual tutorial here as well: http://smerity.com/articles/2016/google_nmt_arch.html

nourishingvoid · on March 8, 2017

I was lucky enough to study in the same lab as the author while he was doing his doctorate. Graham has a real talent for explaining complicated concepts in a way that's easy to understand. He's also strongly committed to putting as much of his code and data as he can online so that anyone can play around with it, including people who aren't academics.

iraphael · on March 8, 2017

If you're interested in a more introductory talk, I gave one a few weeks ago that goes over the basics of Deep Learning, and how TensorFlow works internally.

https://www.youtube.com/watch?v=DYlHnxfrrZY