Statistical research spans an enormous range from direct subject-matter collaborations to pure mathematical theory. The Annals of Applied Statistics, the newest journal from the IMS, is aimed at papers in the applied half of this range. Published quarterly in both print and electronic form, our goal is to provide a timely and unified forum for all areas of applied statistics.
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 10 of 11 Results
Jan 3, 2008
Lev Klebanov; Andrei Yakovlev, 2008, "Replication data for: Diverse Correlation Structures in Microarray Gene Expression Data", https://doi.org/10.7910/DVN/Z6UE8D, Harvard Dataverse, V1
It is well-known that correlations in microarray data represent a serious nuisance deteriorating the performance of gene selection procedures. This paper is intended to demonstrate that the correlation structure of microarray data provides a rich source of useful information. We discuss distinct correlation substructures revealed in microarray gene...
Nov 27, 2007
Qing Zhou; Wing Hung Wong, 2007, "Replication data for: Coupling of Hidden Markov Models for the Discovery of Cis-Regulatory Modules in Multiple Species", https://doi.org/10.7910/DVN/P1FC4F, Harvard Dataverse, V1
Cis-regulatory modules (CRMs) composed of multiple transcription factor binding sites (TFBSs) control gene expression in eukaryotic genomes. Comparative genomic studies have shown that these regulatory elements are more conserved across species due to evolutionary constraints. We propose a statistical method to combine module structure and cross-sp...
Nov 27, 2007
Brian James Reich; Montserrat Fuentes, 2007, "Replication data for: A multivariate semiparametric Bayesian spatial modeling framework for hurricane surface wind fields", https://doi.org/10.7910/DVN/PMF6PG, Harvard Dataverse, V1
Storm surge, the onshore rush of sea water caused by the high winds and low pressure associated with a hurricane, can compound the effects of inland flooding caused by rainfall, leading to loss of property and loss of life for residents of coastal areas. Numerical ocean models are essential for creating storm surge forecasts for coastal areas. Thes...
Nov 27, 2007
Michael A. Newton; Fernando A. Quintana; ohan A. den Boon; Srikumar Sengupta; and Paul Ahlquist, 2007, "Replication data for: Random-set Methods Identify Distinct Aspects of the Enrichment Signal in Gene-set Analysis", https://doi.org/10.7910/DVN/ZVHDGA, Harvard Dataverse, V1
A prespecified set of genes may be enriched, to varying degrees, for genes that have altered expression levels relative to two or more states of a cell. Knowing the enrichment of gene sets defined by functional categories, such as gene ontology (GO) annotations, is valuable for analyzing the biological signals in microarray expression data. A commo...
Nov 27, 2007
Galit Shmueli; Ralph P. Russo; and Wolfgang Jank, 2007, "Replication data for: The BARISTA: A Model for Bid Arrivals in Online Auctions", https://doi.org/10.7910/DVN/FQ3CCH, Harvard Dataverse, V1
The arrival process of bidders and bids in online auctions is important for studying and modeling supply and demand in the online marketplace. A popular assumption in the online auction literature is that a Poisson bidder arrival process is a reasonable approximation. This approximation underlies theoretical derivations, statistical models, and sim...
Nov 27, 2007
David M. Seo; Pascal J. Goldschmidt-Clermont; and Mike West, 2007, "Replication data for: Of Mice and Men: Sparse Statistical Modeling in Cardiovascular Genomics", https://doi.org/10.7910/DVN/OSQPDV, Harvard Dataverse, V1
In high-throughput genomics, large-scale designed experiments are becoming common, and analysis approaches based on highly multivariate regression and anova concepts are key tools. Shrinkage models of one form or another can provide comprehensive approaches to the problems of simultaneous inference that involve implicit multiple comparisons over th...
Nov 27, 2007
Anita M. Araneda; Stephen E. Fienberg; Pontificia Universidad Católica de Chile, 2007, "Replication data for: A Statistical Approach to Simultaneous Mapping and Localization for Mobile Robots", https://doi.org/10.7910/DVN/TYUUPP, Harvard Dataverse, V1
Mobile robots require basic information to navigate through an environment: they need to know where they are (localization) and they need to know where they are going. For the latter, robots need a map of the environment. Using sensors of a variety of forms, robots gather information as they move through an environment in order to build a map. In t...
Nov 27, 2007
Peter D. Hoff, 2007, "Replication data for: Extending the rank likelihood for semiparametric copula estimation", https://doi.org/10.7910/DVN/G4WZFP, Harvard Dataverse, V1, UNF:3:IA0sBg0nAMB7CZi0YV10ig== [fileUNF]
Quantitative studies in many fields involve the analysis of multivariate data of diverse types, including measurements that we may consider binary, ordinal and continuous. One approach to the analysis of such mixed data is to use a copula model, in which the associations among the variables are parameterized separately from their univariate margina...
Nov 27, 2007
David M. Blei; John D. Lafferty, 2007, "Replication data for: A Correlated Topic Model of Science", https://doi.org/10.7910/DVN/12VGO7, Harvard Dataverse, V1
Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical analysis of document collections and other discrete data. The LDA model assumes that the words of each document arise from a mixture of topics, each of which is a distribution over the vocabulary. A limitation of LDA is the inability to model topic corr...
Nov 27, 2007
Clifford Spiegelman; William A. Tobin; William D. James; Simon J. Sheather; Stuart Wexler; and D. Max Roundhill, 2007, "Replication data for: Chemical and forensic analysis of JFK assassination bullet lots: Is a second shooter possible?", https://doi.org/10.7910/DVN/6B4CXH, Harvard Dataverse, V1
The assassination of President John Fitzgerald Kennedy (JFK) traumatized the nation. In this paper we show that evidence used to rule out a second assassin is fundamentally flawed. This paper discusses new compositional analyses of bullets reportedly to have been derived from the same batch as those used in the assassination. The new analyses show...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.