I would like to know if I want to make changes to the network structure of your code, such as adding or removing MLPS, adding branches. At the same time, the dimension of the input variable changes, for example, the input dimension is changed from N, 3 to N, 3, 3. What parts of cuda code do I need to change?