Scalable probabilistic PCA for large-scale genetic variation data