Introduction to Recurrent Neural Networks in Pytorch

TheAnig · on March 14, 2018

Sound like a pretty neat introduction! This is exactly the kind of thing I needed, coming from tf/keras and looking to switch to pytorch for research

Larrikin · on March 14, 2018

As someone learning Keras right now, why are you wanting too switch?

chrisfosterelli · on March 14, 2018

Not the parent, but the imperative interface supported by the dynamic graph approach Pytorch takes is much nicer.

Additionally, in my personal opinion Tensorflow is often too low level and Keras is often too high level for the things I'm trying to do for research. While you can jump between the two of course, I think PyTorch hits a much more natural middle ground in its API.

Tensorflow/Keras is making improvements in these areas with the eager execution, and is still great for putting models into production, but I think PyTorch is much better for doing research or toying with new concepts.

This article has some good comparison: http://www.goldsborough.me/ml/ai/python/2018/02/04/20-17-20-...

220V_USKettle · on March 14, 2018

Also, I believe the newer fast.ai course uses Pytorch, and the prior used Keras?

I would like to know more from the article about setting x,y,m1, and m2. Any explanation is appreciated.

make3 · on March 15, 2018

any reason why you wouldn't use tf eager?

grandmczeb · on March 15, 2018

tf.eager is much slower than pytorch.

make3 · on March 15, 2018

this is a strong claim. what is your source for this? I'm legitimately curious.

grandmczeb · on March 15, 2018

No hard benchmarks, just personal experience. Note that I’m not saying regular Tensorflow is slower than pytorch (in fact I’ve found them to be roughly the same) just eager mode.

Edit: Just realized this might be a good thing to write a blog post about. I’ll get back to you after finals :)

make3 · on March 15, 2018

I'm genuinely interested if you ever do write that blog post

grandmczeb · on March 28, 2018

Update: other people have already scooped me:

https://medium.com/@yaroslavvb/tensorflow-meets-pytorch-with...

I've seen similar performance regressions on my own tasks and I don't have much to add beyond what's in that blog post.

WhitneyLand · on March 14, 2018

I assume this is more intro to Pytorch than intro to ML.

Any tips on an high quality intro to ML content using PyTorch for the hands on examples?

nl · on March 14, 2018

The Fast.ai courses are great.

pugio · on March 15, 2018

Seconded. Their courses are superb, and they have their own library built on top of PyTorch that makes creating high quality models even easier.

You start with their lib, and over time they teach you all the techniques they're using, so the easy black box you start with becomes more transparent over time. It's a hands-on, code-first approach.

nl · on March 14, 2018

This is an interesting introduction to writing your own neural network models from scratch in PyTorch.

I don't think it's a great way to learn it though - almost no one writes their own models from scratch.

Almost all the time you want to be using one of the pre-written RNN models, since they are optimized, debugged and do things like use CuDNN where available.

erezsh · on March 14, 2018

I'm not sure what's the point. Predicting the sine-wave is pretty trivial with NN, and doesn't require a RNN.

sarabande · on March 15, 2018

It's an educational post. Calling something "pretty trivial" doesn't reduce the value of the post for people who don't know what you know, and want to learn it.

erezsh · on March 15, 2018

That wasn't my point. How can a post teach recurrent networks, if the "recurrent" part of it is redundant, and the network would work perfectly well without it?