Statistical inference

Khan, Shahjahan (2010) Statistical inference. In: Scientific data mining and knowledge discovery: principles and foundations. Springer-Verlag, Berlin, Germany, pp. 53-76. ISBN 978-3-642-02787-1


Often scientific information on various data generating processes are presented in the from of numerical and categorical data. Except for some very rare occasions, generally such data represent a small part of the population, or selected outcomes of any data generating process. Although, valuable and useful information is lurking in the array of scientific data, generally, they are unavailable to the users. Appropriate statistical methods are essential to reveal the hidden jewels in the mess of the row data. Exploratory data analysis methods are used to uncover such valuable characteristics of the observed data. Statistical inference provides techniques to make valid conclusions about the unknown characteristics or parameters of the population from which scientifically drawn sample data are selected. Usually, statistical inference includes estimation of population parameters as well as performing test of hypotheses on the parameters. However, prediction of future responses and determining the prediction distributions are also part of statistical inference. Both Classical or Frequentists and Bayesian approaches are used in statistical inference. The commonly used Classical approach is based on the sample data alone. In contrast, increasingly popular Bayesian approach uses prior distribution on the parameters along with the sample data to make inferences. The non-parametric and robust methods are also being used in situations where commonly used model assumptions are unsupported. In this chapter, we cover the philosophical and methodological aspects of both the Classical and Bayesian approaches. Moreover, some aspects of predictive inference are also included. In the absence of any evidence to support assumptions regarding the distribution of the underlying population, or if the variable is measured only in ordinal scale, non-parametric methods are used. Robust methods are employed to avoid any significant changes in the results due to deviations from the model assumptions. The aim is to provide an overview of the scientific domain of statistical inference without going in to minute details of any particular statistical method. This is done by considering several commonly used multivariate models, following both normal and non-normal, including elliptically contoured, distributions for the responses.

Statistics for USQ ePrint 6962
Statistics for this ePrint Item
Item Type: Book Chapter (Commonwealth Reporting Category B)
Refereed: Yes
Item Status: Live Archive
Additional Information: © Springer-Verlag, Berlin/Heidelberg 2010. Permanent restricted access to published version due to publisher copyright restrictions. Fulltext Reference: Khan, S. (2010). Statistical Inference. In Scientific Data Mining and Knowledge Discovery: Principles and Foundations, p.53-76, ed. by M. M. Gaber, Springer-Verlag, Berlin / Heidelberg.
Depositing User: Professor Shahjahan Khan
Faculty / Department / School: Historic - Faculty of Sciences - Department of Maths and Computing
Date Deposited: 22 Mar 2011 05:40
Last Modified: 30 Aug 2016 03:13
Uncontrolled Keywords: statistical inference; sampling
Fields of Research : 01 Mathematical Sciences > 0104 Statistics > 010401 Applied Statistics
08 Information and Computing Sciences > 0801 Artificial Intelligence and Image Processing > 080109 Pattern Recognition and Data Mining
01 Mathematical Sciences > 0104 Statistics > 010405 Statistical Theory
Socio-Economic Objective: E Expanding Knowledge > 97 Expanding Knowledge > 970101 Expanding Knowledge in the Mathematical Sciences
Identification Number or DOI: 10.1007/978-3-642-02788-8_3

Actions (login required)

View Item Archive Repository Staff Only