Loyola College M.Sc. Statistics Nov 2003 Multivariate Analysis Question Paper PDF Download

LOYOLA COLLEGE (AUTONOMOUS), CHENNAI –600 034

M.Sc., DEGREE EXAMINATION – STATISTICS

THIRD SEMESTER – NOVEMBER 2003

ST-3801/S916 – MULTIVARIATE ANALYSIS

05.11.2003                                                                                                           Max:100 marks

1.00 – 4.00

SECTION-A

Answer ALL the questions.                                                                              (10×2=20 marks)

 

  1. Let X ~ N . Obtain the conditional distribution of X1 given X2 = x2.
  2. Write the characteristic function of bivariate normal distribution.
  3. Explain how the collinearity problem can be solved in the multiple regression

Y = Xb + Î.

  1. If X and Y are two independent standard normal variables, obtain the distribution of two times of the mean of these two variables.
  2. Let X be trinormal with

m =  and   compute  .

  1. Define Fisher’s Z – transformation.
  2. Explain classification problem into two classes.
  3. Write down any four similarity measures used in cluster analysis.
  4. Distinguish between principal component and factor analysis.
  5. What is meant by residual plot?

 

SECTION-B

Answer any FIVE  questions.                                                                          (5×8=40 marks)

 

  1. Define multiple correlation coefficient between X1 and X2, …., Xp. Show that the multiple correlation coefficient between X1 and X2, …., Xp has the expression

.

  1. Let Y ~ Np ( 0, å) . Show that Y1 has – distribution.
  2. Test at level 0.05 whether m = in a bivariate normal population with s11 = s22 = 5 and s12 = -2 ,   by using the sample mean vector  based on a sample of size 10.
  3. How will you test the equality of covariance matrices of two multivariate normal distributions on the basis of independent samples drawn from two populations?
  4. Derive the characteristic function of Wishart distribution.
  5. In Principal component analysis derive the first principal component.
  6. Obtain the rule to assign an observation of unknown origin to one of two p-variate normal populations having the same dispersion matrix.
  7. Outline single linkage and complete linkage clustering procedures with an example.

 

SECTION-C

Answer any TWO  questions.                                                                          (2×20=40 marks)

 

  1. a) Derive the MLE of å when the sample is from Np (m, å).
  2. b) Define Hottelling’s T2 – statistic.
  3. c) Using the likelihood ratio test procedure, show that the rejection region for testing m =

mo against m ¹ mo is given by

T2 = n( S-1 (³  T.                                                             (10+3+7)

  1. a) Prove that under some assumptions (to be stated), Variance- covariance matrix can be

written as å = LL1 +  in the factor analysis model.  Also discuss the effect of an

orthogonal transformation.

  1. b) Let X1,X2,…., Xp have covariance matrix å with eigen value vector pairs (l1, e1),…,

(lp, ep),  l1 l2 ….. ≥ lp 0,   then prove that

s11 + s22 + …..+ spp = ,

Where ,  Yi  represents the i – th principal component.

  1. c) Explain the principal component (principal factor) method of estimating L in the factor

analysis model.                                                                                                  (10+5+5)

  1. a) Explain the method of extracting canonical correlations and canonical variables. Also

explain how the theory of canonical correlation is helpful in the analysis of multivariate

data.

  1. b) State an establish the additive property of Wishart distribution. (10+10)
  2. Write shot notes on:-
  3. Roy’s Union – Intersection principle
  4. Step – Wise regression
  5. Mohalanobis Squared distance. (5+10+5)

 

Go To Main page

 

 

Loyola College M.Sc. Statistics April 2007 Multivariate Analysis Question Paper PDF Download

LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034

AC 43

M.Sc. DEGREE EXAMINATION – STATISTICS

THIRD SEMESTER – APRIL 2007

ST 3801/ 3805 / 3808 – MULTIVARIATE ANALYSIS

 

 

 

Date & Time: 24/04/2007 / 9:00 – 12:00      Dept. No.                                       Max. : 100 Marks

 

 

 

SECTION A

 

10 X 2 =20

 

ANSWER ALL QUESTIONS. EACH CARRIES TWO MARKS

 

 

  1. Write the likelihood function corresponding to a sample of size drawn from .
  2. Mention any two applications of F-distribution in Multivariate Statistical Analysis.
  3. Find the distribution of if.
  4. Mention any two properties of variance-covariance matrix.
  5. How will you obtain the MLE of population correlation coefficient between two variables having bivariate normal distribution?
  6. What do mean by agglomerative algorithms?
  7. Define : Communalities
  8. Define Non-central Chi-square statistic.
  9. What is meant by “Expected Cost of Misclassification”?
  10. Give an unbiased estimator of variance-covariance matrix in

 

 

SECTION B

 

5 X 8 = 40

 

ANSWER ANY FIVE. EACH CARRIES EIGHT MARKS

 

  1. Derive the distribution of where and is a matrix of rank

 

  1. State and prove a characterization of multivariate normal distribution

 

  1. Obtain an expression of under usual notations.

 

  1. Obtain the rule which minimizes Expected Cost of Misclassification(ECM) in the case of two multivariate normal populations assuming priori probabilities are known to be equal and the populations have equal variance-covariance matrices.
  2. Show that where B,C,D and E are matrices of suitable order and mention any two places the above identity is used in multivariate analysis.

 

 

  1. State and prove any two properties of Wishart’s Distribution.

 

  1. Define Generalized variance. Show it is equal to the product of eigen roots of

 

 

  1. Obtain expressions for the first r-principal components in a p-variate normal distribution.

 

SECTION C

 

2 X 20 = 40

 

ANSWER ANY TWO. EACH CARRIES TWENTY MARKS

 

  1. Derive the conditional distribution of a sub vector of
  2. Derive the sampling distribution sample correlation coefficient assuming the population multiple correlation coefficient is zero.

 

  1. Derive the likelihood ratio test for assigned mean based on a sample of size N drawn from assuming is unknown.

 

  1. Write short notes of the following :
  • Similarity and Distance Measures.
  • Discriminant Analysis
  • Cluster Analysis
  • Orthogonal Factor Model

 

Go To Main Page

Loyola College M.Sc. Statistics April 2008 Multivariate Analysis Question Paper PDF Download

LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034

NO 42

M.Sc. DEGREE EXAMINATION – STATISTICS

THIRD SEMESTER – APRIL 2008

    ST 3808 / 3801 – MULTIVARIATE ANALYSIS

 

 

 

Date : 26/04/2008            Dept. No.                                        Max. : 100 Marks

Time : 9:00 – 12:00

 

PART – A

Answer all the questions.                                                                         (10 X 2 = 20)

 

  1. If X and Y are two independent standard normal variables then obtain the distribution of two times of the mean of these two variables.
  • If X = ( X1, X2 )’~N2 then write the c.f. of the marginal distribution

of X2.

  1. Mention any two properties of multivariate normal distribution.

4 . What is meant by residual plot?.

  1. Explain the use of partial and multiple correlation coefficients.
  2. Define Hotelling’s T2 – statistics.
  3. Define Fisher’s Z-transformation
  4. Write a short on discriminant analysis.
  5. Explain how canonical correlation is used in multivariate data analysis
  6. Explain classification problem into two classes.

 

 

PART B

Answer any FIVE questions.                                                                     (5 X 8 = 40)

 

11.Find the multiple correlation coefficient between X1 and X 2 , X3, …     , X p.

Prove that the conditional variance of X1 given the rest of the variables can not be

greater than unconditional variance of X1.

  1. Derive the c.f. of multivariate normal distribution.
  2. Let Y ~ Np ( 0 , S ). Show that Y’S -1 Y   has     distribution.
  3. Obtain a linear function to allocate an object of unknown origin to one of the two

normal populations.

  1. Giving a suitable example describe how objects are grouped by complete linkage

method.

  1. Let X ~ Np (m, S). If X(1) and X(2) are two subvectors of X, obtain the conditional

distribution of X(1) given X(2).

  1. Prove that the extraction of principal components from a dispersion matrix is the

study of characteristic roots and vectors of the same matrix  .

  1. Explain step-wise regression.

 

 

 

 

 

PART C

Answer any two questions.                                                                 (2 X 20 = 40)

 

 

               19 .  Derive the MLE of  m and S when the sample is from N ( m ,S ).

 

  1. a) Derive the procedure to test the equality of mean vectors of two p-variate

normal populations when the dispersion matrices are equal.

 

  1. b) Test at level 0.05 ,whether µ = ( 0 0  )’ in a bivariate normal population with

σ11 = σ22= 10 and  σ12= -4 , by using the sample mean vector= (7  -3)

based on a sample size 10.

(15 + 5)

 

  1. a) Define i) Common factor ii) Communality  iii) Total variation
  2. b) Explain the principal component ( principal factor )method of estimating L

in the factor analysis method.

  1. c) Discuss the effect of an orthogonal transformation in factor analysis method.

( 6 + 6 + 8 )

  1. What are canonical correlations and canonical variables? .Describe the extraction

of canonical correlations and their variables from the dispersion matrix. Also

show that there will be p canonical correlations if the dispersion matrix is of size p.

( 2+10+8 )

 

 

 Go To Main page

 

Loyola College M.Sc. Statistics April 2009 Multivariate Analysis Question Paper PDF Download

     LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034

M.Sc. DEGREE EXAMINATION – STATISTICS

YB 41

THIRD SEMESTER – April 2009

ST 3808 – MULTIVARIATE ANALYSIS

 

 

 

Date & Time: 16/04/2009 / 1:00 – 4:00         Dept. No.                                                       Max. : 100 Marks

 

 

PART – A

Answer all the questions.                                                               (10 X 2 = 20)

 

  • Give an example in the bivariate situation that the marginal distributions are normal but the bivariate distribution is not.
  1. Let X, Y and Z have trivariate normal distribution with null mean vector and covariance matrix

2     3      4

3     2     -1

4    -1      1   ,

 

find the distribution of  Y+X.

  1. Mention any two properties of multivariate normal distribution.
  2. Explain the use of partial and multiple correlation coefficients.
  3. Define Hotelling’s T2 – statistics. How is it related to Mahlanobis’ D2?
  4. Outline the use of discriminant analysis.
  5. What are canonical correlation coefficients and canonical variables?
  6. Write down any four similarity measures used in cluster analysis.
  7. Write the c.f. of X where

X~N2 { ,   }.

10.Write  a short  note on data mining.

 

PART B

Answer any FIVE questions.                                                                    (5 X 8 = 40)

 

  1. Obtain the maximum likelihood estimator S of p-variate normal distribution with

mean vector known.

  1. Let X1, X2,…, X n be independent N( 0 , 1 ) random variables. Show that X’ A X

is chi-square if A is idempotent, where  X= ( X1,X2,…,X n )’.    

  1. 13. How will you test the equality of covariance matrices of two multivariate normal

distributions on the basis of independent samples drawn from two populations?.

  1. Let (Xi, Yi)’ , i = 1, 2, 3 be independently distributed each according to bivariate

normal with mean vector and covariance matrix as given below. Find the joint

distribution of six variables. Also find the joint distribution of  and .

Mean vector: (m, t)’, covariance matrix:

  1. Outline single linkage and complete linkage clustering procedures with an

example.

  1. Giving suitable examples explain how factor scores are used in data analysis.
  2. Consider a multivariate normal distribution of X with

m =      ,       S =                    .

 

Find i )  the conditional distribution of ( X1, X3 ) / ( X2, X4 )

  1. ii) s42  
  2. a) Define i ) Common factor  ii) Communality  iii) Total variation

b)Explain classification problem into two classes and testing problem.

PART C

      Answer any two questions.                                                                       (2 X 20 = 40)

 

  1. a) Derive the distribution function of the generalized T2 – statistic.
  2. b) Test at level 0.05 ,whether µ = ( 0 0 )’ in a bivariate normal population with

σ11 = σ22= 10 and  σ12= -4 , by using the sample mean vector= (7  -3)   based

on a sample size 20.                                                                                 (15 + 5)

  1. a) What are principal components?. Outline the procedure to extract principal

components   from a given covariance matrix.

  1. b) Define partial correlation between Xi and Xj .Also prove that

______   ______

r12.3=  ( r12-r13r23)/ {Ö(1-r223) Ö(1-r213)}.                           (  12+8)

21.a) Consider the two data sets

X1=     and   X2 =

for which         .

1) Calculate the linear discriminant function.

2) Classify the observation x0‘= ( 2  7 ) as population π1 or  population π2 using

the decision rule with equal priors and equal costs.

  1. b) Explain how the collinearity problem can be solved in the multiple regression.

( 14+6)

22.a)  Explain the method of extracting canonical correlations and their variables

from a dispersion matrix.

  1. b) Prove that under some assumptions (to be stated), variance and covariance can

be written as S = LL’ + y in the factor analysis model. Also discuss the effect

of an orthogonal transformation.                                                              (8 + 12)

 

 

Go To Main page

Loyola College M.Sc. Statistics Nov 2010 Multivariate Analysis Question Paper PDF Download

LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034

M.Sc. DEGREE EXAMINATION – STATISTICS

THIRD SEMESTER – NOVEMBER 2010

    ST 3811  – MULTIVARIATE ANALYSIS

 

 

 

Date : 29-10-10                 Dept. No.                                        Max. : 100 Marks

Time : 9:00 – 12:00

 

SECTION – A

Answer ALL the following questions                                                                                  (10 x 2 = 20 marks)

 

  1. State and briefly explain any one broad objective for which multivariate analysis techniques are used.
  2. Define Variance-Covariance matrix of a random vector.
  3. Explain ‘Bubble Plot’ for three-dimensional data display.
  4. Define ‘Partial Correlation Coefficient’.
  5. Explain the dimensionality reduction role of Principal components.
  6. Distinguish between ‘Clustering’ and ‘Classification’.
  7. State any two similarity measures for pairs of items when variables are binary.
  8. State any one of the commonly employed scalings on the coefficient vector of the Fisher’s linear discriminant function and the interpretative use of the scaling.
  9. State the Hotelling’s T2 statistic for testing of hypothesis about the mean vector of a multivariate normal population.
  10. State the test for significance of the correlation coefficient in a bivariate normal population.

 

SECTION – B

Answer any FIVE questions                                                                                                 (5 x 8 = 40 marks)

 

  1. If is the sample mean vector and Sn is the sample var-cov matrix based on a random sample of size ‘n’ from a p-variate distribution with mean vector μ and Var-cov matrix Σ, show that Var-Cov() = Σ /n and Sn is a biased estimator of Σ.
  2. Explain the construction of ‘probability plots’ in general and the investigation of multivariate normality assumption in particular.
  3. If X ~ Np(μ, Σ ) and C is a non-singular matrix of order p x p, show that                CX ~ Np ( Cμ, CΣCT ). Hence, deduce the distribution of DX where D is a q x p matrix with rank q (≤ p).
  4. Under usual notations of partitions of a random vector and its var-cov matrix, with  = , show that ρ( Xi ,X(2) ) ≥ ρ( Xi , X(2) ) for every vector.
  5. Develop the MANOVA for comparing mean vectors of a number of normal populations and explain the test procedure for the same.
  6. Establish the relationship of principal components to the eigen values and eigen vectors of the var-cov matrix of the underlying random vector.
  7. State the three linkage methods for hierarchical clustering. Present a figurative display of the measure of between-cluster distances in each method.
  8. Derive the ‘Minimum ECM Rule’ for classification involving two multivariate normal populations with a common var-cov matrix.

 

 

(P.T.O)

 

SECTION – C

Answer any TWO questions                                                                                              (2 x 20 = 40 marks)

 

  1. (a) Describe the enhancement of scatter plots with ‘lowess’ curves.

(b) Develop the multivariate normal density function.                                                              (8 +12)

 

  1. (a) Derive the MLE’s of the parameters μ, Σ of multivariate normal distribution.

(b) Show that these MLEs are independently distributed.                                                         (12 + 8)

 

  1. Present the motivation, definition and derivation of Fisher’s (multiple) discriminant functions.

 

  1. (a) Present the ‘Orthogonal Factor Model’ in detail and develop the notions of ‘Communality’ and ‘Specific Variance’.

(b) Bring out the ambiguity in a factor model, the rationale for ‘factor rotation’ and explain the varimax criterion for factor rotation.                                                                                          (12 + 8)

 

Go To Main Page

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Loyola College M.Sc. Statistics April 2012 Multivariate Analysis Question Paper PDF Download

LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034

M.Sc. DEGREE EXAMINATION – STATISTICS

THIRD SEMESTER – APRIL 2012

ST 3811/3808 – MULTIVARIATE ANALYSIS

 

 

Date : 21-04-2012             Dept. No.                                        Max. : 100 Marks

Time : 1:00 – 4:00

 

SECTION – A

Answer ALL the following questions:                                                                            (10 x 2 = 20 marks)

  1. Define var-cov matrix, correlation matrix and standard deviation matrix. State the relationship among them.
  2. State the general expressions for the mean vector and var-cov matrix of linear transformations of a random vector.
  3. State the general form of ‘statistical distance’.
  4. Explain ‘Bubble Plots’.
  5. Define ‘Partial Correlation Coefficient’.
  6. State the one-way MANOVA model.
  7. State the test for significance of correlation coefficient in a bivariate normal population.
  8. Give one reason why in K-means algorithm, the number of clusters ‘K’ is kept an open question.
  9. State the postulates on the ‘common factors’ and the ‘specific factors’ in the orthogonal factor model.
  10. Explain a situation where the ‘challenge’ of ‘Classification’ arises.

 

SECTION – B

Answer any FIVE questions:                                                                                             (5 x 8 = 40 marks)

  1. Briefly explain the terms ‘Data Reduction / Structural Simplification’ and ‘Sorting / Grouping’. Give real-life examples of these two objectives which are addressed by multivariate methods.

 

  1. Explain probability plots in general and how it is used for investigation of multivariate normality assumption.

 

  1. Derive the moment generating function of multivariate normal distribution.
  2. If X =  ~ Np (μ,Σ) and μ and Σ are accordingly partitioned as  and where  ≠ 0, derive the conditional distribution of X(1) given X(2).

 

  1. Derive Fisher’s linear discriminant function for discriminating two populations.

 

  1. Mention the three linkage methods for hierarchical clustering and present a figurative display of the measure of between-cluster distances in each method.

 

  1. Develop the Hotelling’s T2 test through the likelihood ratio criterion.         (P.T.O)
  2. Give the motivation and the formal definition of Principal Components. State the ‘Maximization Lemma’ (without proof) and hence, obtain the PCs for a random vector

 

SECTION – C

Answer any TWO questions:                                                                                          (2 x 20 = 40 marks)

  1. (a) If and S are the sample mean vector and var-cov matrix from a sample of size ‘n’ from a multivariate population with mean vector μ and var-cov matrix Σ , show that  is an unbiased estimator of μ but Sn is a biased estimator of Σ.

(b) Derive the MLEs of the parameters of multivariate normal distribution.

(10+10)

 

  1. (a) Consider the partitions in Q.No. (14). Let =  where= ith row of  . Show that, for every vector α,

(i)  Var[ Xi – X(2) ] ≤ Var [ Xi – α′ X(2) ]

(ii) Corr ( Xi, X(2) ) ≥ Corr ( X , α′ X(2) ).

Hence, obtain an expression for the multiple correlation coefficient between Xi and X(2).

(b) Find the mean vector and the var-cov matrix for the bivariate normal distribution whose p.d.f. is

f(x,y) = exp                 (12 + 8)

  1. (a) Exhibit the ‘ambiguity’ in the factor model. Bring out the need for factor  rotation and explain the ‘Varimax’ criterion for rotation.

(b) Explain the ‘Ordinary Least Squares Method’ of estimating the Factor Scores.

(12 + 8)

 

  1. (a) Derive an expression for ‘Expected Cost of Misclassification (ECM)’ for classification involving two populations and obtain the optimum allocation regions for the ‘Minimum ECM Rule’.

(b) Consider the following table on three binary variables measured on five subjects with a view to carry out clustering of the five subjects:

    variable

 

Individual

 

X1              X2               X5

1

2

3

4

5

    0                0                   1

1                1                   1

0                0                   1

0                1                   1

1                1                   0

 

 

 

 

 

 

 

 

Obtain the matrices of matches and mismatches, compute the similarity measure

sij =  (under usual notations) and carry out the clustering.         (10 + 10)

 

 

Go To Main Page

 

Loyola College M.Sc. Statistics Nov 2012 Multivariate Analysis Question Paper PDF Download

LOYOLA COLLEGE (AUTONOMOUS), CHENNAI – 600 034

M.Sc. DEGREE EXAMINATION – STATISTICS

THIRD SEMESTER – NOVEMBER 2012

ST 3811 – MULTIVARIATE ANALYSIS

 

 

Date : 01/11/2012            Dept. No.                                        Max. : 100 Marks

Time : 9:00 – 12:00

SECTION – A

Answer ALL the following questions:                                                                             (10 x 2 = 20 marks)

 

  1. Distinguish between ‘Data Exploration’ and ‘Confirmatory Analysis’.
  2. Define Variance-Covariance matrix and Generalized Variance of a random vector.
  3. Give the motivation for ‘Statistical Distance’ and express its form.
  4. Define ‘partial correlation coefficient’ and give the expression for it in a trivariate normal distribution.
  5. Explain the objectives of Principal Component Analysis.
  6. Explain the ‘Varimax’ criterion for factor rotation.
  7. Distinguish between ‘Agglomerative’ and ‘Divisive’ methods in clustering.
  8. Present the decomposition of the total sum of squares and cross products in one way MANOVA.
  9. State the test for significance of correlation coefficient in a bivariate normal population.
  10. Define Wishart distribution.

 

SECTION – B

Answer any FIVE Questions                                                                                              (5 x 8 = 40 marks)

 

  1. Find the mean vector and var-cov matrix of X = (X, Y) whose p.d.f. is

f (x , y) =

  1. Describe p-p plot and q-q plot and state how the multivariate normality assumption is verified.
  2. Let X = ~ Np (μ,Σ) and μ and Σ be correspondingly partitioned as  and ,  Xi be the ith component of X(1) and let β = σ(i)  where σ(i) is the ith row of . Derive an expression for multiple correlation coefficient between Xi and X(2).
  3. Derive the moment generating function of multivariate normal distribution.
  4. Derive the T2 test for hypothesis concerning the mean vector of a multivariate normal population using the likelihood ratio criterion.
  5. Define Principal Components and extract the same for a given random vector, stating the lemma on maximization of quadratic forms (without proof).
  6. Present Fisher’s method of discriminating two populations and derive the linear discriminant function. Explain the classification rule based on it.
  7. Describe Hierarchical clustering of objects and its algorithm giving figurative depiction of three linkage methods.

 

SECTION – C

Answer any TWO Questions:                                                                                           (2 x 20 = 40 marks)

 

  1. (a) Explain the ‘lowess’ curve enhancement of a scatter plot.

(b) Consider the partition in Q. No. (13). If X(1) and X(2) are uncorrelated, then show that their distributions are multivariate normal of appropriate dimensions. Also, establish the same result even when X(1) and X(2) are correlated.                                                                      (7+13)

 

  1. (a) Present the ‘Orthogonal Factor Model’ and develop the ideas of ‘communality’ and ‘specific variance’.

(b) Explain the ‘Principal Factor’ Method of Factor Analysis. Bring out the approach to ‘Reduction of factors’ and ‘Decision on number of factors’.                                   (10+10)

 

  1. (a) Carry out the ‘single linkage’ process for clustering six items whose distance matrix is given below (Dendrogram not required):

 

(b) Derive the  expression for ‘Expected Cost of Misclassification’ in  the case  of  two

populations and obtain the Minimum ECM Rule. Discuss the special cases of equal prior

probabilities and equal misclassification costs.                                                        (10+10)

 

  1. (a) Derive the MLEs of the parameters of multivariate normal distribution.

(b) Establish the independence of the sample mean vector and sample var-cov matrix of a sample from multivariate normal distribution.                                                         (10+10)

 

 

Go To Main Page

 

 

© Copyright Entrance India - Engineering and Medical Entrance Exams in India | Website Maintained by Firewall Firm - IT Monteur