First, note that when you prove something, you usually stated that a statement is proveably from something. Suppose $\Lambda$ is a set of axioms. $\Lambda \vdash \varphi$ denotes a proof $\varphi$ using $\Lambda$. You are looking the most elegant proof of $\varphi$ from $\Lambda$. Therefore, your second point is not very meaningful since a proof of the statement from itself (thought may be short and elegant) is not a proof using $\Lambda$ is $\varphi$ is not a statement in $\lambda$. 
However, I can try to address some ways that you may be able to measure some of the characteristics that you brought up. 
1) It is possible to measure the length of the proof. Assuming you are working in some formals system (like propositional logic or first order logic) and trying to prove something from a set of axioms $\Lambda$, then you can count the length of the proof. A proof is really a sequence of statements each of which is axioms or follows from previous statements using the logical deduction rule of your formal system. The length of the proof would be the number of steps used in the proof. Note that a proof can get arbitrarily long since you can always add unnessary steps to a proof. 
2) If you are trying to prove a statement using some axioms $\Lambda$, you can also make sense of using the least number of axiom. Of course the statement $\varphi$, you are proving is already a statement in $\Lambda$, then that single axiom sufficies. However, given a proof, you can always analyze the proof to see exactly which axioms were necessary. This sort of idea is used very often by logician, especially set theory. You must have seen statement like $ZF$ proves cartesian product of sets exists, $ZFC$ proves every set can be well ordered; $ZF$ + the axiom of determinacy proves every subset of $\mathbb{R}$ is measureable.  Moreover using technique of logic, you may be able to determine if some of your axions of redundant. A well known result is that the axiom of choice is not proveable from $ZF$. Knowing which axioms are necessary for a proof can be helpful for understanding the limits of provability; however, when all the axioms of well-accepted, the proof using less axioms may be more difficult. For example, every vector has a basis can be proven using $ZF$ plus the well-ordering principle and without the power set axiom; however, the more common Zorn Lemma approach require the power set axiom. 
Your other points are somewhat subjective. These other aspect are somewhat phycological. Some people may find that a proof of a result that most people would consider to be part of Algebra or Analysis is easier to understand if it is proven using results of algebra or analysis, respectively. It would be reasonable to expect that if bunch of statement are equivalent, the form closest to statement you are trying to prove would give the easiest proof. Regarding this, there is a program in logic called reverse math that attempts to classify theorem of mathematics over very weak base system according to their logical equivalence. Some result in this areas have shown that combinatorial result, topological theorem, and algebriac theorem are equivalent over weak system of arithmetics. Though these results may be equivalent, the most evident proof would likely use the result closest to field you are working in.
Again other people may like proof that applies techniques from other areas. These results may be surprising and yield new and potentially useful connections to other fields. 
Also it is hard to say that a proof is elegant if it is understandable by particular people. Depending on background and point of view proof may be more understandable or more appealing. For instance, some results may have less general forms that are understandable by high schoolers but the proof is very long or intricate. A good example may be the intermediate and extreme value theorem. The statement of the result and proof of the result using general topology ideas like continuous function, connectedness, and compactness is much cleaner after the appropriate definition and lemmas are given. These results are then more applicable for other areas of mathematics.