I find loss weight become biger when time step is smaller. But flow matching method usually train more on big time step. is it a bug or you design it?
shift as 3. Red point's time index is 900, means timestep value is small because timestep is inverse sorted.
