unrolled LSTM layer with batch BPTT #159

xiw9 · 2015-05-21T02:01:45Z

An fast unrolled LSTM layer similar to BVLC/caffe#1873
Need 2 inputs,
The input sequence nodes_in[0] and the corresponding sequence label nodes_in[1].
nodes_in[0] size: [batch_size][1][1][input_width]
nodes_in[1] size: [batch_size][1][1][1]
Example config file:

data = train
iter = csv
  filename = "...."
  has_header = 0
iter = attachtxt
  txtfilename = "...."
iter = threadbuffer
  buffer_size = 4
iter = end

eval = val
iter = csv
  filename = "...."
  has_header = 0
iter = attachtxt
  txtfilename = "...."
iter = threadbuffer
  buffer_size = 4
iter = end

extra_data_shape[0] = 1,1,1
extra_data_num = 1

netconfig=start
layer[in,in_1->2] = lstm:lstm1
  nhidden = 1024
  parallel_size = 8
layer[2,in_1->3] = lstm:lstm2
  nhidden = 512
  parallel_size = 8
layer[3->4] = fullc:fc1
  nhidden = 51
layer[4->4] = softmax:softmax1
netconfig=end

# evaluation metric
metric = error

max_round = 40
num_round = 40

# input shape not including batch
input_shape = 1,1,4096

batch_size = 512

xiw9 · 2015-05-21T02:03:45Z

src/io/iter_attach_txt-inl.hpp

conflict between txt iter and csv iter

data = train iter = csv filename = "............" has_header = 0 iter = attachtxt txtfilename = "............." iter = threadbuffer buffer_size = 4 iter = end

antinucleon · 2015-05-21T02:32:55Z

Thanks for your PR. I am still working on general case of LSTM, which is able to run CNN + LSTM together. In that branch we have special text IO for sequence. However I still have some bug to fix. Do you have interest to work together on that branch?

xiw9 reviewed May 21, 2015
View reviewed changes

xiw9 changed the title ~~unrolled LSTM layer with batched BPTT~~ unrolled LSTM layer with batch BPTT May 21, 2015

xiw9 force-pushed the lstm branch from a14a490 to ad5e03c Compare May 27, 2015 10:52

xiw9 mentioned this pull request May 27, 2015

extra_data@iter_attach_txt-inl.hpp usage #148

Closed

xiw9 force-pushed the lstm branch from ad5e03c to 63b62c1 Compare May 29, 2015 01:35

xiw9 force-pushed the lstm branch from 63b62c1 to 46185b1 Compare June 5, 2015 06:09

xiw9 force-pushed the lstm branch from 46185b1 to d9496cb Compare September 12, 2015 11:31

xiw9 added 6 commits September 12, 2015 19:34

using git-submodule

dfaf25e

Merge branch lstm-dev

e31bf85

code fix

ebb8669

batch boundary fix

86d41af

fix

b3f2fe4

fix seq label mismatch

8a3c6cc

xiw9 force-pushed the lstm branch from d9496cb to 8a3c6cc Compare September 12, 2015 11:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unrolled LSTM layer with batch BPTT #159

unrolled LSTM layer with batch BPTT #159

Uh oh!

xiw9 commented May 21, 2015

Uh oh!

xiw9 May 21, 2015

Uh oh!

antinucleon commented May 21, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

unrolled LSTM layer with batch BPTT #159

Are you sure you want to change the base?

unrolled LSTM layer with batch BPTT #159

Uh oh!

Conversation

xiw9 commented May 21, 2015

Uh oh!

xiw9 May 21, 2015

Choose a reason for hiding this comment

Uh oh!

antinucleon commented May 21, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants