Right now Wikipedia defines it as:
Let $E\rightarrow M$ be a smooth vector bundle over a differentiable manifold $M$. Let $\Gamma(E)$ be the space of all smooth sections. A connection on $E$ is a $\mathbb{R}$-linear map $\nabla:\Gamma(E)\rightarrow\Gamma(E\otimes T^*M)$ such that $$\nabla(\sigma f)=\nabla(\sigma)f+\sigma \otimes df$$ for all smooth functions $f$ on $M$ and all sections $\sigma$.
What I don't understand is how $\sigma f$ is defined. If $\sigma\in\Gamma(E)$ and $f\in C^\infty(M)$, why would $\sigma f$ be in $\Gamma(E)$? For that matter, I also don't understand how $\nabla(\sigma)f$ is defined.