Tags: Colloquium Series

The Statistics Department hosts weekly colloquia on a variety of statistcal subjects, bringing in speakers from around the world.

Jung Ae Lee

Thu, 08/29/2013 - 3:11pm

This dissertation consists of two parts for the topic of sample integrity in high dimensional data. The first part focuses on batch effect in gene expression data. Batch bias has been found in many microarray studies that involve multiple batches of samples. Currently available methods for batch effect removal are mainly based on gene-by-gene analysis. There has been relatively little development on multivariate approaches to batch adjustment…

Eric Vance

Thu, 08/29/2013 - 3:09pm

LISA and its partners will educate and train statisticians from developing countries to communicate and collaborate with non-statisticians and then support these statisticians to create statistical collaboration laboratories in their home countries to help researchers, government officials, local industries, and NGOs apply statistical thinking and data science to make better decisions through data. At LISA and elsewhere, we will unlock the…

Xiaotong Shen

Thu, 08/29/2013 - 3:07pm

Personalized information filtering extracts the information specifically relevant to a user, based on the opinions of users who think alike or the content of the items that a specific user prefers. In this talk, we discuss latent models to utilize additional user-specific and content-specific predictors, for personalized prediction. In particular, we factorize a user-over-item preference matrix into a product…

Yao Xie

Thu, 08/29/2013 - 3:06pm

How do we quickly detect small solar flares in a large video stream generated by NASA satellites? How do we improve detection by efficient representation of high-dimensional data that is time-varying? Besides astronomical imaging, high-dimensional change-point detection also arises in many other applications including computer network intrusion detection, sensor networks, medical imaging, and epidemiology. In these problems, each dimension…

Eric Kolaczyk

Thu, 08/29/2013 - 3:03pm

Networks are a popular tool for representing elements in a system and their interconnectedness. Many observed networks can be viewed as only samples of some true underlying network. Such is frequently the case, for example, in the monitoring and study of massive, online social networks. We study the problem of how to estimate the degree distribution -- an object of fundamental interest -- of a true underlying network from its sampled network. In…

Chris McMahan

Thu, 08/29/2013 - 3:02pm

Screening for sexually transmitted diseases has benefitted greatly from the use of group testing (pooled testing) to lower costs. With the development of assays that detect multiple infections, screening practices now involve testing pools of individuals for multiple infections simultaneously. Building on the research for single infection group testing procedures, we examine the performance of group testing for multiple infections. Our work is…

Sun-Young Hwang

Thu, 08/29/2013 - 2:58pm

Various estimation methods in time series are reviewed in a unified framework via martingale estimating functions. In particular, maximum likelihood and quasi-likelihood are discussed in the context of asymptotic optimality within certain estimating functions. Both ergodic and non-ergodic processes are considered. To illustrate the main results, various parameter estimates for GARCH processes, bifurcating and explosive AR processes,…

Heping Zhang

Thu, 08/29/2013 - 2:55pm

In psychiatric and behavioral research, about six out of ten people with a substance use disorder suffer from another form of mental illness as well, making it necessary to consider multiple conditions as we study the etiologies of these conditions. The occurrence of multiple disorders in the same patient is referred to as comorbidity. Identifying the risk factors for comorbidity is an important yet difficult topic in psychiatric research. The…

Tianxi Cai

Thu, 08/29/2013 - 2:53pm

Clinical trials that evaluate treatment benefit focus primarily on estimating the average benefit. However, a treatment reported to be effective may not be beneficial to all patients. For example, the benefit of giving chemotherapy prior to hormone therapy with Tamoxifen in the adjuvant treatment of postmenopausal women with lymph node negative breast cancer depends on the ER-status. Due to the toxicity of chemotherapy, it is crucial to identify…

Jianhua Hu

Thu, 08/29/2013 - 2:52pm

For high dimensional genetic data, an important problem is to search for associations between genetic variables and a phenotype---typically, a discrete variable (diseased versus normal). A conventional solution is to characterize such relationships through regression models in which a phenotype is treated as the response variable and genetic variables are treated as the covariates. Not surprisingly, such a way incurs the challenging problem of…

Subscribe to Colloquium Series

Slideshow

Tags: Colloquium Series

Support us