Suppose there are $n$ points in $\mathbb R^2$. So we have ${n\choose 2}$ line segments joining pairs of these points. How do I find a straight line such that the angles between this straight line and all these line segments are not very high?
The question, as you can see, is not really very clear, and I cannot help better it since this was asked verbatim in an admissions interview. Of course, one ca ask what I mean by "not very high". I apologise -- I do not know what it means.
A hint was given "Think about projections, inner product, etc."
But like fitting least squares regression line, some efforts can be given in understanding this and here are what I think:
- One can try to minimise $\sum_i\tan^2(\theta_i)$ where $\theta_i$ is the angle between our desired line and the $i$-th line segment. I know the slopes of my line segments, so minimising this quantity will imply finding solution to the slope of my line. This may give me a line, but probably it will not be very standard.
- I somehow feel that our desired line will be parallel to the regression line, although I can't prove it. Somehow I feel that minimising the angles will effectively mean minimising the projections orthogonal to my line. I am not at all sure about this. But this is the one closest to "projections, inner product" I guess.
- How about the principal component analysis line? I do not know the theory but will be eigenvalues be of any assistance to me?