1

ENAS implementation here

I'm trying to understand how the parameter sharing works in ENAS. The first two questions are there partially to answer the third main question.

  1. Are all nodes only used ONCE during macro search?
  2. For macro search, will all the nodes definitely link to its previous node?
  3. How are the parameters shared? Does each operations have their own weights, which are always loaded when called? If this is the case, then which weight to update and memorize during training, assuming multiple instances of the same operation is used. Or are there weights for each unique connection, e.g. Node1 to Node3 (W13) has one weight set, Node2 to Node3 (W23) has another weight set. If so, then how does it handle cases when there are skip connections (e.g. Node1 and Node2 are concatenated, which are then passed to Node 3. Will it have W12-3?)?
4

1 回答 1

0

我已经阅读了很多次代码,所以我想我会自己回答这些,以防将来有人看到。

  1. 不,它们可以出现多次。
  2. 是的。
  3. 存储每一层中所有可能节点的权重,在初始化新网络时从中提取权重。
于 2018-05-23T02:12:42.303 回答