fix init weights issue for critic/reward model #983
+3
−1
Merged
Loading