LSTM: Move Wx matrix multiplication out of the loop in forward by antihutka · Pull Request #187 · jcjohnson/torch-rnn

antihutka · 2017-04-28T09:57:00Z

Move one of the addmm calls out of the loop and do it in one call across all timesteps. This should provide a significant speedup when running with small batch_size.
I was able to get 10-20% speedup with batch_size=8 when running on CPU, but I'm unable to test it on GPU at the moment.

dgcrouse · 2017-04-28T15:27:42Z

I can test GPU execution on CUDA this weekend, can someone check OpenCL?

LSTM: Move Wx matrix multiplication out of the loop in forward

1fbdc5b

antihutka mentioned this pull request Jun 2, 2017

Any info for tweaking training settings for those with little background in LSTMs? #196

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LSTM: Move Wx matrix multiplication out of the loop in forward#187

LSTM: Move Wx matrix multiplication out of the loop in forward#187
antihutka wants to merge 1 commit intojcjohnson:masterfrom
antihutka:lstm_speedup

antihutka commented Apr 28, 2017

Uh oh!

dgcrouse commented Apr 28, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

antihutka commented Apr 28, 2017

Uh oh!

dgcrouse commented Apr 28, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants