## The Application of Exploratory Data Analysis in Marketing: An Introduction to Selected Methods

In statistics , exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task. Exploratory data analysis was promoted by John Tukey to encourage statisticians to explore the data, and possibly formulate hypotheses that could lead to new data collection and experiments. EDA is different from initial data analysis IDA , [1] which focuses more narrowly on checking assumptions required for model fitting and hypothesis testing, and handling missing values and making transformations of variables as needed. Tukey defined data analysis in as: "Procedures for analyzing data, techniques for interpreting the results of such procedures, ways of planning the gathering of data to make its analysis easier, more precise or more accurate, and all the machinery and results of mathematical statistics which apply to analyzing data.

This paper introduces the family of techniques called exploratory data analysis. Unlike classical confirmatory statistics which rely upon strict distributional assumptions, parameter estimation, and hypothesis testing, EDA adopts an informal method of data examination designed to explore the structure of the data. Three representative EDA techniques are introduced and applications to marketing data sets are presented. Unable to display preview. Download preview PDF. Skip to main content.

Download Understanding Robust and Exploratory Data Analysis free book PDF Author: David C. Hoaglin, Frederick Mosteller, John W. Tukey Pages:

As a discipline, statistics has mostly developed in the past century. Probability theory—the mathematical foundation for statistics—was developed in the 17th to 19th centuries based on work by Thomas Bayes, Pierre-Simon Laplace, and Carl Gauss. In contrast to the purely theoretical nature of probability, statistics is an applied science concerned with analysis and modeling of data.

Statistics Problems with Statistics, discusses probability and statistics from the viewpoint of resampling. Statistics is the process of converting data into information that is usable to people. For example in Milwaukee County, there is a 5 year vesting period.

