Tensorflow: Getting All States From A Rnn
Solution 1:
tf.nn.dynamic_rnn(also tf.nn.static_rnn) has two return values; "outputs", "state" (https://www.tensorflow.org/api_docs/python/tf/nn/dynamic_rnn)
As you said, "state" is the final state of RNN, but "outputs" are all hidden states of RNN(which shape is [batch_size, max_time, cell.output_size])
You can use "outputs" as hidden states of RNN, because in most library-provided RNNCell, "output" and "state" are same. (except LSTMCell)
Solution 2:
I've already created a PR here and it might help you deal with simple cases
Let me briefly explain my implementation, so you can write your own version if you need. The main part is the modification of the _time_step
function:
def _time_step(time, output_ta_t, state, *args):
The parameters remain the same except an extra *args
is passed in. But why args
? That's because I want to support tensorflow's customary behavior. You are able to return the final state only by simply ignoring the args
parameter:
if states_ta is not None:
# If you want to return all states, set `args` to be `states_ta`
loop_vars = (time, output_ta, state, states_ta)
else:
# If you want the final state only, ignore `args`
loop_vars = (time, output_ta, state)
How to make use of it?
if args:
args = tuple(
ta.write(time, out) for ta, out in zip(args[0], [new_state])
)
In fact this is just a modification of the following (original) codes:
output_ta_t = tuple(
ta.write(time, out) for ta, out in zip(output_ta_t, output)
)
Now the args
should contain all the states you want.
After all the works done above, you can pick up the states (or the final state) with following codes:
_, output_final_ta, *state_info = control_flow_ops.while_loop( ...
and
if states_ta isnotNone:
final_state, states_final_ta = state_info
else:
final_state, states_final_ta = state_info[0], None
Although I haven't tested it in complicated cases, it should work under 'simple' condition (here's my test cases)
Post a Comment for "Tensorflow: Getting All States From A Rnn"