Any researching on keeping Weights linear independent during training? I am thinking that no matter how you transform features on high dimensional space, if these features are independent, the vector representation should also be linear independent and these vectors span the problem space.
发自「今日水木 on iPhone 8」
--
FROM 114.250.31.*