This software uses a statistical model to predict protein molecular function for unannotated proteins using functional annotations from a set of homologous proteins, described in
B.E. Engelhardt, M.I. Jordan, J.R. Srouji, and S.E. Brenner (2010) Genome-scale phylogenetic function annotation of large and diverse protein families. Submitted.
Engelhardt BE, Stephens M (2010) "Analysis of population structure: a unifying framework and novel methods based on sparse factor analysis." PLoS Genetics 6(9):e1001117.
Download C++ code and instructions for SFA 1.0 and further documentation for the SFA model.