Title page for ETD etd-23098-6054


Type of Document Dissertation
Author Wang, Shin Cheng
Author's Email Address dswang@vt.edu
URN etd-23098-6054
Title Analysis of Zero-Heavy Data Using a Mixture Model Approach
Degree PhD
Department Statistics
Advisory Committee
Advisor Name Title
Smith, Eric P. Committee Chair
Arnold, Jesse C. Committee Member
Coakley, Clint W. Committee Member
Hinkelmann, Klaus H. Committee Member
Ye, Keying Committee Member
Keywords
  • Principal Component Analysis
  • Longitudinal Data
  • Inhibition Concentration
  • Generalized Estimating Equations
  • Chronic toxicity testing
  • Ceriodaphnia Dubia
  • Zero-inflated Poisson
Date of Defense 1998-03-18
Availability unrestricted
Abstract
The problem of high proportion of zeroes has long been an interest in data analysis and modeling, however, there are no unique solutions to this problem. The solution to the individual problem really depends on its particular situation and the design of the experiment. For example, different biological, chemical, or physical processes may follow different distributions and behave differently. Different mechanisms may generate the zeroes and require different modeling approaches. So it would be quite impossible and inflexible to come up with a unique or a general solution.

In this dissertation, I focus on cases where zeroes are produced by mechanisms that create distinct sub-populations of zeroes. The dissertation is motivated from problems of chronic toxicity testing which has a data set that contains a high proportion of zeroes. The analysis of chronic test data is complicated because there are two different sources of zeroes: mortality and non-reproduction in the data. So researchers have to separate zeroes from mortality and fecundity. The use of mixture model approach which combines the two mechanisms to model the data here is appropriate because it can incorporate the mortality kind of extra zeroes.

A zero inflated Poisson (ZIP) model is used for modeling the fecundity in Ceriodaphnia dubia toxicity test. A generalized estimating equation (GEE) based ZIP model is developed to handle longitudinal data with zeroes due to mortality. A joint estimate of inhibition concentration (ICx) is also developed as potency estimation based on the mixture model approach. It is found that the ZIP model would perform better than the regular Poisson model if the mortality is high. This kind of toxicity testing also involves longitudinal data where the same subject is measured for a period of seven days. The GEE model allows the flexiblity to incorporate the extra zeroes and a correlation structure among the repeated measures.

The problem of zero-heavy data also exists in environmental studies in which the growth or reproduction rates of multi-species are measured. This gives rise to multivariate data. Since the inter-relationships between different species are imbedded in the correlation structure, the study of the information in the correlation of the variables, which is often accessed through principal component analysis, is one of the major interests in multi-variate data. In the case where mortality influences the variables of interests, but mortality is not the subject of interests, the use of the mixture approach can be applied to recover the information of the correlation structure. In order to investigate the effect of zeroes on multi-variate data, simulation studies on principal component analysis are performed. A method that recovers the information of the correlation structure is also presented.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  abs.pdf 31.43 Kb 00:00:08 00:00:04 00:00:03 00:00:01 < 00:00:01
  appendix.pdf 39.00 Kb 00:00:10 00:00:05 00:00:04 00:00:02 < 00:00:01
  Bibliography.pdf 13.39 Kb 00:00:03 00:00:01 00:00:01 < 00:00:01 < 00:00:01
  ch1.pdf 11.09 Kb 00:00:03 00:00:01 00:00:01 < 00:00:01 < 00:00:01
  ch2.pdf 71.97 Kb 00:00:19 00:00:10 00:00:08 00:00:04 < 00:00:01
  ch3.pdf 383.21 Kb 00:01:46 00:00:54 00:00:47 00:00:23 00:00:02
  ch4.pdf 88.31 Kb 00:00:24 00:00:12 00:00:11 00:00:05 < 00:00:01
  ch5.pdf 107.83 Kb 00:00:29 00:00:15 00:00:13 00:00:06 < 00:00:01
  ch6.pdf 9.38 Kb 00:00:02 00:00:01 00:00:01 < 00:00:01 < 00:00:01
  Fig3-1.pdf 3.79 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-10.pdf 3.17 Kb < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-2.pdf 3.75 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-3.pdf 4.34 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-4.pdf 4.06 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-5.pdf 3.75 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-6.pdf 3.88 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-7.pdf 3.87 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-8.pdf 2.97 Kb < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig3-9.pdf 3.16 Kb < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  Fig4-1.pdf 3.89 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01
  vita.pdf 3.89 Kb 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01 < 00:00:01

Browse All Available ETDs by ( Author | Department )

dla home
etds imagebase journals news ereserve special collections
virgnia tech home contact dla university libraries

If you have questions or technical problems, please Contact DLA.