Modern accelerator such as GPU, when executes a forward pass on neural network, will it execute layer by layer? That is, will it finish ALL work of the previous layer, then starts to execute the next layer? I think the answer should also depend on software, but could someone just share some thoughts on this?
2.1m questions
2.1m answers
60 comments
57.0k users