Hello Dr.Zhong, thank you for your excellent work. I'm very interested in what you mentioned in section 3.3
we update the running mean μ and variance σ and yet fix the learnable linear transformation parameters α and β for better normalization in Stage-2.
But, I cannot find the implementation in your code. If you are available, can you tell me the exact location?
Wish you good health and success in your studies!