Tags: Colloquium Series

The Statistics Department hosts weekly colloquia on a variety of statistcal subjects, bringing in speakers from around the world.

Statistical Analysis of Big Data and Structured Data with Application to Neuroscience

Tue, 07/19/2016 - 12:40pm

In this talk, we consider two types of data from neuroscience: neuromorphology data and neuron activity data. First, we focus on data extracted from brain neuron cells of rodents and model each neuron as a data object with topological and geometric properties characterizing the branching structure, connectedness and orientation of a neuron. We define the notions of topological and geometric medians as well as quantiles based on newly-…

Extreme Quantile Determination for Manufacturing Process Parameters

Tue, 07/19/2016 - 12:39pm

Monitoring the control and capability of process parameters is a continual and mammoth task for today’s manufacturers. The importance of simple, efficient, and automated approaches cannot be overstated. Paramount in this endeavor is the determination of extreme quantiles. I will review approaches for determining these quantile from the last 25 years of literature, as well as current usage at Eli Lilly and Company. A number of candidate…

Statistical issues and challenges in biomedical studies

Tue, 07/19/2016 - 12:37pm

In this talk, I will present statistical issues and challenges that I have encountered in my biomedical collaborative studies of item selection in disease screening, comparison and identification of biomarkers that are more informative to disease diagnosis, and estimation of weights on relatively importance of exposure variables on health outcome. After a discussion on the issues and challenges with real examples, I will review available…

Distribution-free Multiple Testing

Tue, 07/19/2016 - 12:36pm

We study a stylized multiple testing problem where the test statistics are independent and assumed to have the same distribution under their respective null hypotheses. We first show that, in the normal means model where the test statistics are normal Z-scores, the well-known method of (Benjamini and Hochberg, 1995) is optimal in some asymptotic sense. We then show that this is also the case of a recent distribution-free method proposed by…

Testing of Regression Functions with Data Missing at Random

Tue, 07/19/2016 - 12:34pm

This talk includes two testing problems of regression functions with responses missing at random. One problem is minimum distance model checking. The proposed lack-of-fit tests are based on a class of minimum integrated square distances between a kernel type estimator of a regression function and the parametric regression function being fitted. These tests are shown to be consistent against a large class of fixed alternatives. The corresponding…

Slicing Public Opinion for State and National Research

Tue, 07/19/2016 - 12:33pm

In this paper, we develop and release measures of public ideology in 2010 for the 50 American states, 435 congressional districts, and state legislative districts. We do this using the geospatial statistical technique of kriging, which uses the locations of survey respondents, as well as population covariate values, to predict ideology for simulated citizens in districts across the country. In doing this, we improve on past research …

On Surrogate Variable Analysis for High Dimensional Genetics and Genomics Data

Wed, 06/29/2016 - 9:10am

Unwanted variation in hidden variables often negatively impacts analysis of high-dimensional data, leading to high false discovery rates, and/or low rates of true discoveries. A number of procedures have been proposed to detect and estimate the hidden variables, including principal component analysis (PCA). However, empirical data analysis suggests that PCA is not efficient in identifying the hidden…

Confidence Inference Function in Big Data

Thu, 06/23/2016 - 9:24am

Statistical inference along with the strategy of divide-and-combine for Big Data analysis has been little studied. As an effective inferential tool, confidence distribution (CD) has attracted a surge of renewed attention. The essence in constructing confidence distribution pertains to the availability of suitable pivotal quantities, which are usually obtained from the (asymptotical) distribution of point maximum…

Functional and very high dimension reduction

Mon, 06/20/2016 - 8:42am

The talk has two components. In the first component, to study the relation between a univariate response and multiple functional covariates, we propose a functional single index model that is semiparametric. The parametric part of the model integrates the linear regression modeling for functional data and the sucient dimension reduction structure. The nonparametric part of the model further allows the response-index dependence or the link…

Visualizing chance in introductory probability

Fri, 06/17/2016 - 10:30am

Research in the last thirty years has documented the challenges and difficulties in teaching probability and the many misconceptions prevalent in people’s reasoning. There is now a call to reform the approach to teaching probability from a traditional mathematical base to include more emphasis on modeling andinvestigations. Within this spirit of reform we undertook a two-part exploratory study. In the first part we interviewed seven…

Subscribe to Colloquium Series

Slideshow

Tags: Colloquium Series

Support us