MEL761: Statistics For Decision Making: About The Course
MEL761: Statistics For Decision Making: About The Course
MEL761: Statistics For Decision Making: About The Course
About the course Introduction Need Descriptive and Inferential Statistics Examples Various Problem Areas
Dr S G Deshmukh
Mechanical Department Indian Institute of Technology
1
Course coverage
Introduction to statistics: definitions and terminology; data classification; data collection techniques, various scales for measurement and their relevance Descriptive statistics: frequency distributions; measures of central tendency, Variation, Probability: basic concepts; multiplication and addition rules, Bayes rule, Discrete probability distributions: basic concepts; Binomial , Poisson and other discrete distributions Continuous probability distributions :Exponential and other distributions: Normal probability distributions: introductory concepts; the standard normal Distribution; central limit theorem, applications of normal distributions, approximations to discrete probability distributions Correlation and Regression analysis: overview of correlation; linear regression Type I and Type II errors, Confidence intervals: confidence intervals for the mean (large samples and small samples) and for population proportions Analysis of Variance and Design of Experiments, Non-parametric tests Case studies and applications to managerial decision making
3
Evaluation scheme
Surprise Quizzes (n numbers) Minors(2) Major Lab work /assignments Mini-Project Statistics application review 5% 30 % 35% 15 % 10 % 5%
Learning Objectives
Define statistics Become aware of a wide range of applications of statistics in business for decision making Differentiate between descriptive and inferential statistics Formulate and test various sets of hypotheses Understand implications of design of experiments
Statistics..
Plays an important role in many facets of human endeavour Occurs remarkably frequently in our everyday lives Is often incorrectly thought of as just a collection of data, graphs and diagrams
Statistics in Business
Accounting auditing and cost estimation Economics regional, national, and international economic performance Finance investments and portfolio management Management human resources, compensation, and quality management Management Information Systems (ERP): performance of systems which gather, summarize, and disseminate information to various managerial levels Marketing market analysis and consumer research International Business market and demographic analysis
7
What is Statistics?
Science of gathering, analyzing, interpreting, and presenting data Branch of mathematics One page in Courses of study? Facts and figures Measurement taken on a sample Type of distribution being used to analyze data Statistics is the scientific method that enables us to make decisions as responsibly as possible. 8
Statistics
The science of data to answer research questions
Formulate a research question(s) (hypothesis) Collect data Analyze and summarize data Draw conclusions to answer research questions
Statistical Inference
10
3. Making the actual decisions as sensibly as possible on the basis of the available evidence.
4. Perceiving the risks entailed in the particular decision made, and evaluating the corresponding risks of alternative actions.
11
Example
Polio Vaccine Results of the Experiment
57 142
12
Census gathering data from the entire population Sample a portion of the whole
a subset of the population a part of the population from which we actually collect information, used to draw conclusions about the whole (statistical inference
15
2. Statistical inference
16
Descriptive statistics..
Encompasses the following:
Graphical or pictorial display Condensation of large masses of data into a form such as tables Preparation of summary measures to give a concise description of complex information (e.g. an average figure) Exhibition of patterns that may be found in sets of information
18
Inferential Statistics..
Especially relates to:
Determining whether characteristics of a situation are unusual or if they have happened by chance Estimating values of numerical quantities and determining the reliability of those estimates Using past occurrences to attempt to predict the future
19
20
22
23
to estimate
Sample x (statistic)
(parameter )
25
Example: Ethnicity
1 for African-American 2 for Anglo-American 3 for Hispanic-American
26
27
1 6 2 4 3 5
f i n i s h
28
Ordinal Data
Faculty should receive preferential treatment for parking space in new Bharati Telecom building.
Strongly Agree
Agree
Neutral
Disagree
Strongly Disagree
29
Examples: Height, Weight, and Volume Example: Monetary Variables, such as Profit and Loss, Revenues, and Expenses Example: Financial ratios, such as P/E Ratio, Inventory Turnover
31
32
Statistical Methods
Nonparametric Nonparametric Parametric
Ratio
Parametric
33
34
Before presentation always check: the source of the data that the data has been accurately transcribed the figures are relevant to the problem
35
36
37
38
100 80 60 40 20 0 1st Qtr 2nd Qtr 3rd Qtr 4th Qtr East
West
North
40
Frequency distributions
Frequency tables
Observation Table Frequency Cumulative Frequency 13 13 18 31 25 56 15 71 9 80
41
Frequency diagrams
Frequency 30 25 20 15 10 5 0 < 20 <40 <60 <80 <100 Frequency
Cumulative Frequency
<40
<60
<80
<100
42
Grouped data
have been organized into a frequency distribution
43
50
52 30 55
40
28 36 30
32
23 32 58
31
35 26 64
40
25 50 52
49
61 74
33
31 37
43
30 29
46
40 43
32
60 54
44
Data Range
42 30 53 50 52 30 55 49 61 74 26 58 40 40 28 36 30 33 31 37 32 37 30 32 23 32 58 43 30 29 34 50 47 31 35 26 64 46 40 43 57 30 49 40 25 50 52 32 60 54
Smallest
Largest
46
47
Class Midpoint
beginning class endpoint + ending class endpoint Class Midpoint = 2 30 + 40 = 2 = 35
1 Class Midpoint = class beginning point + class width 2 1 = 30 + 10 2 = 35
48
Relative Frequency
Class Interval 20-under 30 30-under 40 40-under 50 50-under 60 60-under 70 70-under 80 Total Relative Frequency Frequency 6 .12 6 18 .36 50 11 .22 18 50 11 .22 3 .06 1 .02 50 1.00
49
Cumulative Frequency
Cumulative Class Interval 20-under 30 30-under 40 40-under 50 50-under 60 60-under 70 70-under 80 Total
Frequency 6 18 11 11 3 1 50
Frequency 6 18 + 6 24 11 + 24 35 46 49 50
50
53
Histogram
Class Interval Frequency 20-under 30 6 30-under 40 18 40-under 50 11 50-under 60 11 60-under 70 3 70-under 80 1
20 Frequency 0 10
10 20 30 40 50 60 70 80 Years
54
Histogram Construction
Class Interval Frequency 20-under 30 6 30-under 40 18 40-under 50 11 50-under 60 11 60-under 70 3 70-under 80 1
Frequency
10
20
10 20 30 40 50 60 70 80 Years
55
Frequency Polygon
Class Interval Frequency 20-under 30 6 30-under 40 18 40-under 50 11 50-under 60 11 60-under 70 3 70-under 80 1
20 Frequency 0 10
10 20 30 40 50 60 70 80 Years
56
Ogive
Cumulative Class Interval Frequency 20-under 30 6 30-under 40 24 40-under 50 35 50-under 60 46 60-under 70 49 70-under 80 50
Frequency
0
0
20
40
60
10
20
30
40 Years
50
60
70
80
57
1.00 0.90 0.80 0.70 0.60 0.50 0.40 0.30 0.20 0.10 0.00 0 10 20 30 40 Years
58
50
60
70
80
Complaints by Passengers
COMPLAINT Stations, etc. Train Performance Equipment Personnel Schedules, etc. Total NUMBER 28,000 14,700 10,500 9,800 7,000 PROPORTION .40 .21 .15 .14 .10 DEGREES 144.0 75.6 50.4 50.6 36.0
70,000
1.00
360.0
59
Complaints by Passengers
Personnel 14% Equipment 15% Schedules, Etc. 10%
60
Company A B
D
E Totals
34,099
12,747 920,190
61
39% 39%
62
B
C D E
354,936
160,997 34,099 12,747 920,190
.386
.175
139
63 13 5 360
63
Totals
Pareto Chart
100 90 100% 90%
80
70
80%
70% 60%
Frequency
60
50
40 30 20 10 0 Poor Wiring Short in Coil Defective Plug Other
50%
40% 30% 20% 10% 0%
64
Scatter Plot
Registered Vehicles (1000's) Gasoline Sales (1000's of Gallons)
Gasoline Sales
200
5 15 9 15 7
60 120 90
100
140 60
10 15 Registered Vehicles
20
65