Tags: Colloquium Series

The Statistics Department hosts weekly colloquia on a variety of statistcal subjects, bringing in speakers from around the world.

Immunohistochemical (IHC) staining is widely used in the diagnosis of cancer. Current methods for IHC scoring rely on a number of subjective evaluations.  A pathologist, for example, has to assess the overall level of staining of the tumor as negative, weak, intermediate, or strong, and then make a secondary subjective assessment of the percentage of cells that fall into each of these staining intensity categories.  Current inter- and…
Preliminary results are presented from an ongoing study of the development of tertiary students’ reasoning in a one-semester college-level statistics course. The modeling and simulation-based course relies on randomization and bootstrap methods for inference. Students in the statistics course learn to use TinkerPlots® to create "just by chance" models that form the basis of simulated distributions of sample statistics in order to draw an…
In model-based survey sampling Hierarchical Bayesian (HB) methods have gained immense popularity. One of the major reasons for this popularity remains the convenience in implementation of HB models using MCMC methods even when the models are complex. An inevitable part of this approach is elicitation of the priors for the parameters involved in the model. Authentic expert information can be incorporated by assigning suitable subjective prior…
Space-filling designs are widely used for emulating computer simulators. Over the last three decades, a wide spectrum of Latin hypercube designs (LHDs) has been proposed with different space-filling criteria like minimum correlation among factors, maximin inter-point distance, and orthogonality among the factors. In this talk I will present a new class of space-filling designs. These designs are derived from randomization defining contrast…
Large genome-wide association studies are now consistently pointing towards an extremely polygenic model for complex diseases. Such models may involve thousands of susceptibility markers, each conferring only a modest risk, but collectively they could be explaining substantial variation in disease-risks in populations. Further, a few large studies of gene-environment interactions indicate that genetic and environmental risk-factors may broadly…
JMP includes comprehensive capabilities for statistical and graphical analysis of data for every academic field and most research needs. Easily perform basic statistics to the most complex analyses a PhD student may encounter. JMP runs on Windows and Macintosh operating systems and also functions as an easy, point-and-click interface to SAS®, R, MATLAB and Excel.  JMP is visual and interactive. It drives you through the statistical analysis…
This dissertation undertakes the theory and methods of sufficient dimension reduction in the content of Hilbert-Schmidt Independence Criterion (HSIC). The proposed estimation methods enjoy model free property and require no link function to be smoothed or estimated. Two tests: Permutation test and Bootstrap test, are investigated to examine the true underlying dimension of data considered. Sampling distribution of our estimator is established in…
Sufficient Dimension Reduction (SDR) is a dimension reduction paradigm for reducing the dimension of the predictor vector without losing regression information. Classical inverse regression based SDR methods, though successfully used in many applications and have attractive computational properties, require inverting the predictor vector covariance matrix. This has hindered their use in contemporary high dimensional data analysis, where the…
With the development of computing and internet technology, data sets with stupendously large numbers of observations are more and more common. One technique to handle the big data is to aggregate classical data to symbolic data, like lists, intervals, lists with probabilities and intervals with probabilities (histograms). Building clustering methods for symbolic data has been an active area over the past decade. In this dissertation, we first…
Many data sets in the sciences (broadly defined) deal with multiple sets of multivariate time series. The case of a single univariate time series is very well developed in the literature; and single multivariate series though less well studied have also been developed (under the rubric of vector time series). A class of matrix time series models is introduced for dealing with the situation where there are multiple sets of multivariate time…