You almost certainly can formally believe the equivalence of the two formulations if you have seen the proof of the Green's identity, right? However, when you "balance the derivatives" of the solution $u$ and the test function $v$ as you have done (in the beginning $u$ has two derivatives and $v$ zero, and in the weak formulation both have one derivatives and are, thus, balanced out), you actually end up requiring less smoothness from the solution $u$.

This implies that solutions to the weak problem are not, in general, equivalent to the solutions of the strong problem. The weak solution is equivalent to the strong solution only if $q \in L^2(\Omega)$ and the boundary of the domain satisfies some smoothness criteria. See, e.g., the wonderful book of Lawrence Evans, Partial Differential Equations, for more information on this matter.

But why do we have to change it? Why can't we solve the system as originally stated?

This is something which you will realize when implementing the finite element solver. It is much more convenient to have only single derivative in the weak formulation than two derivatives since then the finite element is only required to be $H^1$-conforming. (Intuitively, second derivative of piecewise-linear function is zero just as you mentioned and therefore the stiffness matrix would be zero.) You could solve the problem directly with $H^2$-conforming elements but those are much more harder to construct.

Furthermore, if the weak formulation is symmetric then you end up having a symmetric stiffness matrix which is faster to solve.