Abstract
We consider the problem of predicting an individual's identity fromaccelerometry data collected during walking. In a previous paper we introducedan approach that transforms the accelerometry time series into an image byconstructing its complete empirical autocorrelation distribution. Predictorsderived by partitioning this image into grid cells were used in logisticregression to predict individuals. Here we: (1) implement machine learningmethods for prediction using the grid cell-derived predictors; (2) deriveinferential methods to screen for the most predictive grid cells; and (3)develop a novel multivariate functional regression model that avoidspartitioning of the predictor space into cells. Prediction methods are comparedon two open source data sets: (1) accelerometry data collected from $32$individuals walking on a $1.06$ kilometer path; and (2) accelerometry datacollected from six repetitions of walking on a $20$ meter path on two separateoccasions at least one week apart for $153$ study participants. In the$32$-individual study, all methods achieve at least $95$% rank-1 accuracy,while in the $153$-individual study, accuracy varies from $41$% to $98$%,depending on the method and prediction task. Methods provide insights into whysome individuals are easier to predict than others.