Replication data for: Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics (doi:10.7910/DVN/OSQPDV)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Replication data for: Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics

Identification Number:

doi:10.7910/DVN/OSQPDV

Distributor:

Harvard Dataverse

Date of Distribution:

2007-11-28

Version:

1

Bibliographic Citation:

David M. Seo; Pascal J. Goldschmidt-Clermont; and Mike West, 2007, "Replication data for: Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics", https://doi.org/10.7910/DVN/OSQPDV, Harvard Dataverse, V1

Study Description

Citation

Title:

Replication data for: Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics

Identification Number:

doi:10.7910/DVN/OSQPDV

Authoring Entity:

David M. Seo (Duke University)

Pascal J. Goldschmidt-Clermont (University of Miami)

and Mike West (Duke University)

Date of Production:

2007

Distributor:

Harvard Dataverse

Distributor:

Institute for Mathematical Statistics

Date of Deposit:

2007-10-01

Date of Distribution:

2007

Holdings Information:

https://doi.org/10.7910/DVN/OSQPDV

Study Scope

Keywords:

Animal–human extrapolation, atherosclerosis risk factors, gene-environment interactions, gene expression signatures, multivariate anova, latent factor models, sparse statistical modeling

Abstract:

In high-throughput genomics, large-scale designed experiments are becoming common, and analysis approaches based on highly multivariate regression and anova concepts are key tools. Shrinkage models of one form or another can provide comprehensive approaches to the problems of simultaneous inference that involve implicit multiple comparisons over the many, many parameters representing effects of design factors and covariates. We use such approaches here in a study of cardiovascular genomics. The primary experimental context concerns a carefully designed, and rich, gene expression study focused on gene-environment interactions, with the goals of identifying genes implicated in connection with disease states and known risk factors, and in generating expression signatures as proxies for such risk factors. A coupled exploratory analysis investigates cross-species extrapolation of gene expression signatures—how these mouse-model signatures translate to humans. The latter involves exploration of sparse latent factor analysis of human observational data and of how it relates to projected risk signatures derived in the animal models. The study also highlights a range of applied statistical and genomic data analysis issues, including model specification, computational questions and model-based correction of experimental artifacts in DNA microarray data.

Notes:

Subject: STANDARD DEPOSIT TERMS 1.0 Type: DATAPASS:TERMS:STANDARD:1.0 Notes: This study was deposited under the of the Data-PASS standard deposit terms. A copy of the usage agreement is included in the file section of this study.;

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

Related Publications

Citation

Title:

David M. Seo, Pascal J. Goldschmidt-Clermont and Mike West. 2007. "Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics." Ann. Appl. Statist. Volume 1, Number 1 (2007), 152-178. <a href="http://projecteuclid.org/DPubS/Repository/1.0/Disseminate?view=body&amp;id=pdfview_1&amp;handle=euclid.aoas/1183143733" target= "_new">article available here</a>

Bibliographic Citation:

David M. Seo, Pascal J. Goldschmidt-Clermont and Mike West. 2007. "Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics." Ann. Appl. Statist. Volume 1, Number 1 (2007), 152-178. <a href="http://projecteuclid.org/DPubS/Repository/1.0/Disseminate?view=body&amp;id=pdfview_1&amp;handle=euclid.aoas/1183143733" target= "_new">article available here</a>

Other Study-Related Materials

Label:

Human.factorlanalysis.info.zip

Text:

Zip file containing raw data and model parameters files for the human factor analyses

Notes:

application/zip

Other Study-Related Materials

Label:

mice-anovaregn-info.zip

Text:

Zip file containing raw data and model parameters files for the mice factor analyses

Notes:

application/zip

Other Study-Related Materials

Label:

supplement3.pdf

Text:

SPARSE STATISTICAL MODELLING IN CARDIOVASCULAR GENOMICS — SUPPLEMENTARY FIGURES AND TABLES

Notes:

application/pdf