Knowles, D and Ghahramani, Z *Nonparametric Bayesian sparse factor models with application to gene expression modeling.* Annals of Applied Statistics, 5. pp. 1534-1552. (Unpublished)

## Abstract

A nonparametric Bayesian extension of Factor Analysis (FA) is proposed where observed data $\mathbf{Y}$ is modeled as a linear superposition, $\mathbf{G}$, of a potentially infinite number of hidden factors, $\mathbf{X}$. The Indian Buffet Process (IBP) is used as a prior on $\mathbf{G}$ to incorporate sparsity and to allow the number of latent features to be inferred. The model's utility for modeling gene expression data is investigated using randomly generated data sets based on a known sparse connectivity matrix for E. Coli, and on three biological data sets of increasing complexity.

Item Type: | Article |
---|---|

Uncontrolled Keywords: | stat.AP stat.AP cs.AI stat.ML |

Subjects: | UNSPECIFIED |

Divisions: | Div F > Computational and Biological Learning |

Depositing User: | Cron Job |

Date Deposited: | 18 May 2016 17:41 |

Last Modified: | 24 Jul 2016 02:11 |

DOI: | 10.1214/10-AOAS435 |