Multivariate Statistical Analysis 

A covariance matrix is the simplest summary and the best way to explain the dependence relations of several variables. In different applications various structures for the covariance matrix could be of interest. The estimator for the covariance matrix is extra important since inference on the mean parameters strongly depends on the estimated covariance matrix. Hence, when testing the parameters for a multivariate model, in moderate and high dimensions, the estimator of the covariance matrix is very important. 

For 200 years ago, Laplace and Gauss were some of the first ones to introduce the bivariate normal density, i.e., considering two dependent normal variables. In the end of 19th century Galton (half-cousin of Charles Darwin) exploited bivariate problems in genetics and introduced the ideas of correlation and regression in the study of paired measurements. Pearson developed the ideas further for studying problems in genetics, biology and other applications. In the beginning of 20th century Fisher gave the distribution of the sample Pearson correlation coefficient, where he also extended the chi-squared distribution to the bivariate case. This was later generalized by Wishart to the multivariate case and the Wishart matrix distribution which play an essential role in multivariate statistical analyses. 

Many multivariate statistical analyses discussed in the literature are based on the assumption that the observation vectors are independent and normally distributed. The main reason for this is that sets of multivariate observations are often at least approximately normally distributed through the central limit effect. One advantage of normal distribution is that it is mathematically tractable and useful explicit results can be obtained. Furthermore, normally distributed data can also be modeled entirely in terms of their means and variances/covariances.

Estimating the mean and the covariance matrix is therefore a problem of great interest in statistics and it is also of course of great significance to consider the correct statistical model. Originally, many estimators of the covariance matrix were obtained from non-iterative least squares methods. When computer sources became stronger and covariance matrices with structures were considered iterative methods were introduced such as the maximum likelihood method and the restricted maximum likelihood method, among others. 

Nowadays, more and more interests are in the area of high dimensional statistical analyses, i.e., when the dimension of the multivariate normal distribution is larger than the number of observations and even grows to infinity, and non-iterative methods have again become of interest. In multivariate analyses, especially in high dimensions, the behavior of eigenvalues and eigenvectors of random symmetric matrices are of much interest. These ideas goes back to the work of Pearson (early 20th century) who considered dimension reduction of multivariate data through principal component analysis. Furthermore, high dimensional statistical analyses and random matrices are very important tools in mathematics, financial mathematics, statistics, big data analyses, engineering, physics as well as other areas. 

My research focuses on the problem of estimating and testing the parameters in multivariate normal distribution, in moderate and high dimensions, where the mean and covariance matrix may have various structures. For the covariance matrix it can be the Kronecker structure, which leads to a matrix normal distribution or more general the multilinear normal distribution, i.e., a tensor normal distribution. The Kronecker structured model can for example be used in the purpuse to model dependent multilevel observations. The mean can also follows a parametric model, such as linear, bilinear (also known as the Growth Curve model) or even higher dimensions as trilinear regression models. In these models the covariance matrix has a Kronecker product structure, whereas some factor matrices may be linearly structured, e.g., banded, intraclass, Toeplitz, circular Toeplitz, special structure with zeros or some mix.

 

Professional activities

  • Head of Division of Mathematical Statistics at the Department of Mathematics, Linköping University
  • Assistant Director for the Research School in Interdisciplinary Mathematics at Linköping University
  • Deputy Team Leader for the sub-programme Applied Mathematics and Statistics part of the University of Rwanda - Sweden Research, Higher Education and Institutional Advancement Cooperation Programme
  • Member of the board of the Department of Mathematics, Linköping University

Upcoming conferences

Invited speaker at:

  • 2017 Annual Southern Africa Mathematical Sciences Association (SAMSA) conference, November 20-23, 2017, Arusha, Tanzania — http://www.maths.udsm.ac.tz/samsa2017/
  • International Conference on Linear Algebra and its Applications (ICLAA2017), December 11-15, 2017, Manipal University, Manipal, India — http://iclaa2017.com

Related documents

Research

PhD students

Former PhD students

Publications

2017

Innocent Ngaruye, Dietrich von Rosen, Martin Singull

Small area estimation under a multivariate linear model for incomplete repeated measures data

Report

Innocent Ngaruye, Dietrich von Rosen, Martin Singull

Small Area Estimation under a multivariate linear model for repeated measures Data

In Communications in Statistics - Theory and Methods

Article in journal

Jolanta Pielaszkiewicz, Dietrich von Rosen, Martin Singull

Testing Independence via Spectral Moments

In Springer Proceedings in Mathematics & Statistics

Article in journal

Innocent Ngaruye, Dietrich von Rosen, Martin Singull

Mean-squared errors of small area estimators under a multivariate linear model for repeated measures data

Report

Jolanta Pielaszkiewicz, Dietrich von Rosen, Martin Singull

On E [Pi(k)(i=0) Tr{W-mi}], where W similar to Wp (l, n)

In Communications in Statistics - Theory and Methods

Article in journal

Emanuel Evarest Sinkwembe, Fredrik Berntsson, Martin Singull, Xiangfeng Yang

Weather derivatives pricing using regim switching models

Report

Fredrik Berntsson, Martin Ohlson

More on Estimation of Banded and Banded Toeplitz Covariance Matrices

Report

Muni S. Srivastava, Martin Singull

Testing Sphericity and Intraclass Covariance Structures under a Growth Curve Model in High Dimension

In Communications in statistics. Simulation and computation

Article, review/survey

Emanuel Evarest, Fredrik Berntsson, Martin Singull, Wilson M. Charles

Regime Switching models on Temperature Dynamics

In International Journal of Applied Mathematics and Statistics

Article in journal

Muni S. Srivastava, Martin Singull

Test for the mean matrix in a Growth Curve model for high dimensions

In Communications in Statistics - Theory and Methods

Article in journal

Cassien Habyarimana, Martin Singull, Joseph Nzabanita

Estimation of Parameters in the Growth Curve Model with a Linearly Structured Covariance Matrix: A Simulation Study

In International Journal of Scientific Engineering and Technology

Article in journal

Innocent Ngaruye, Dietrich von Rosen, Martin Singull

Small area estimation with missing data using a multivariate linear random effects model

Report

2016

Emanuel Evarest, Fredrik Berntsson, Martin Singull, Wilson Charles

Regime Switching models on Temperature Dynamics

Report

Innocent Ngaruye, Dietrich von Rosen, Martin Singull

Crop yield estimation at district level for agricultural seasons 2014 in Rwanda

In African Journal of Applied Statistics

Article in journal

Innocent Ngaruye, Joseph Nzabanita, Dietrich von Rosen, Martin Singull

Small Area Estimation under a Multivariate Linear Model for Repeated measures Data

In Communications in Statistics - Theory and Methods

Article in journal