The words you are searching are inside this book. To get more targeted content, please make full-text search by clicking here.

This e-book describes an Introduction to Statistics to helps students who taking Engineering Mathematics subjects especially in Malaysian Polytechnic. The concept of this e­book is interactive in the terms of infographic elements which included theory notes, clear examples with the guided answer and intensive practice. Hopefully, with this e-book students will gain a better understanding of theories, concepts and methods used in solving problems for statistic topics.

Discover the best professional documents and content resources in AnyFlip Document Base.
Search
Published by Penerbit PSIS, 2022-11-03 05:06:35

EBOOK STATISTICS

This e-book describes an Introduction to Statistics to helps students who taking Engineering Mathematics subjects especially in Malaysian Polytechnic. The concept of this e­book is interactive in the terms of infographic elements which included theory notes, clear examples with the guided answer and intensive practice. Hopefully, with this e-book students will gain a better understanding of theories, concepts and methods used in solving problems for statistic topics.

Keywords: EBOOK STATISTICS

POLYTECHNIC SERIES

INT RODUCT ION TO

FIRST EDITION

NORSHAFIZA BINTI MOHAMAD KHAIRUDDIN



POLYTECHNIC SERIES

INT RODUCT ION TO

FIRST EDITION

NORSHAFIZA BINTI MOHAMAD KHAIRUDDIN

POLITEKNIK SULTAN IDRIS SHAH
KEMENTERIAN PENGAJIAN TINGGI

Published by:
Politeknik Sultan Idris Shah
Sg.Lang, 45100 Sg. Ayer Tawar

Selangor
No. Tel : 03 3280 6200
No. Fax : 03 3280 6400
Laman web : http://psis.mypolycc.edu.my





ABSTRACT iiii
PREFACE iv
TABLE OF CONTENTS v
SUBTOPIC 1.1 PAGE 1

SUBTOPIC 1.2 PAGE 8
PAGE 21
SUBTOPIC 1.3 PAGE 55

SUBTOPIC 1.4
REFERENCE

SUBTOPIC 1.1

WHAT IS
STATISTICS?

Statistics is a branch of applied mathematics
that involves the collecting, analyzing,

interpreting and presenting empirical data

The two major areas of statistics are known as
descriptive statistics, which describes the

properties of sample and population data, and
inferential statistics, which uses those
properties to test hypotheses and draw
conclusions.

Sample survey methods are used to collect data
from observational studies, and experimental
design method are used to collect data from
experimental studies.

2

WHAT IS DATA?

Data is a collection of information
assembled by observation, research
measurements, or analysis. Data may be
classified as either quantitative or

qualitative.

Qualitative data represent some characteristics
or attribute. These descriptions that may be
observed but cannot be computed.

Quantitative data represent numerical value.
These can be numerically computed and
calculations can be performed.

3

TYPES OF DATA

PRIMARY DATA

• Data collected for the first time

SECONDARY DATA

• Data that is sourced by someone
other than the user

DISCRETE DATA

• Data that can take only specific
value

CONTINUOUS DATA

• Data that can take values from a
given range

4

WHAT IS
UNGROUPED

DATA?

Ungrouped data is the data that
obtained from direct observation that
gather from an experiment or study. The

data is called as raw data.

These data is not sorted into categories,
classified or otherwise grouped.

An ungrouped set of data is basically a
list of numbers.

5

WHAT IS
GROUPED

DATA?

Grouped data is the type of data
which is classified into groups after

the collection. These data is
categorized into various groups and a

table is created.

The purpose of the table is to show
the data points occurring in each
group.

6

CONCEPT MAP

STATISTICS

Grouped Data Ungrouped Data

Measures of central Measures of
tendency dispersion

Mean Mode Mean Standard
deviation deviation

Median Variance
7

SUBTOPIC 1.2

A table which the values for a sample are grouped into
the classes and the frequency of the observation into
each class are recorded.

Step 1: Calculate the number of classes by using
Sturge's Formula, k = 1+3.3 log n

Step 2: Find the size of class

Class size ≥ highest value - smallest value

number of classes

Step 3: Create a table with three column [class interval,
tally marks, frequency] and
number of row = number of classes.

Step 4: Make a tick mark in the appropriate row for each
data. Count the tally marks to determine the
frequency.

9

EXAMPLE 1

The weekly wages, in RM, of a group of
28 workers in a factory are as follow.
Construct a frequency distribution table
for this data.

10

SOLUTION

STEP 1: Calculate the number of classes by using
Sturge's Formula:

k = 1 + 3.3 log (28)
= 5.775 ~ 6

Round to the nearest integer

STEP 2: Find size of class:

• Highest value = 187
• Smallest value = 153

class size 187 153
6

= 5.667 ~ 6

Round up to the next integer

11

Create a table with 3 column and number of
STEP 3:

rows = number of classes

153 - 158
159 - 164
165 - 170
171 - 176
177 - 182
183 - 188

12

STEP 4: Make a tick mark in appropriate row for
each data and count the tally marks to
determine the frequency

153 - 158 / 1
159 - 164 //// 4
165 - 170 //// //// 9
171 - 176 //// /// 8
177 - 182 //// 4
183 - 188 // 2

13

EXAMPLE 2

The data below shows the time
taken (in seconds) for 40
students to complete a Quiz.
Construct a frequency distribution
table for this data.

40 49 43 45 42 43 46 36
42 36 37 44 39 41 31 45
38 48 44 51 38 53 35 32
30 43 41 52 46 43 50 40
39 41 48 47 32 52 47 42

14

SOLUTION

STEP 1: Calculate the number of classes by using
Sturge's Formula:

k = 1 + 3.3 log (40)
= 6.287 ~ 6

Round to the nearest integer

STEP 2: Find size of class:

• Highest value = 53
• Smallest value = 30

class size 53 30
6

= 3.833 ~ 4

Round up to the next integer

15

STEP 3: Create a table with 3 column and number of
rows = number of classes

30 - 33
34 - 37
38 - 41
42 - 45
46 - 49
50 - 53

16

STEP 4: Make a tick mark in appropriate row for
each data and count the tally marks to
determine the frequency

30 - 33 //// 4
34 - 37 //// 4
38 - 41 //// /// 9
42 - 45 //// //// / 11
46 - 49 //// // 7
50 - 53 //// 5

17

LET'S TRY

QUESTION 1

The data below shows the marks obtained by
50 students in a Mathematics Test. Construct a
frequency distribution table for this data with
number of classes is 7.

72 85 51 63 75 90 35 39 53 63
71 82 48 60 68 78 31 25 44 57
69 80 46 59 72 83 28 34 49 61
74 88 53 63 68 78 39 22 44 56
75 92 55 65 76 96 40 42 55 66

18

LET'S TRY

QUESTION 2

Construct a frequency distribution table
for the following data, starting with the
first class of 502 - 536.

592 534 663 631 541
622 708 541 528 546
631 599 637 502 592
618 546 641 603 650
641 622 547 586 567
644 578 483 578 619
695 645 578 644 531
536 644 689 557 612

19

LET'S TRY

QUESTION 3

The electricity bills for 50 houses are recorded
as in the table below. Construct a frequency
distribution table for the data with a class
interval of 10.

35 86 71 99 88 43 54 48 68 77
93 70 84 78 68 63 47 56 66 57
80 52 67 59 60 79 62 55 52 90
64 87 65 64 50 71 72 64 71 67
40 56 74 69 97 67 81 77 77 57

20

SUBTOPIC 1.3

WHAT IS
MEASURES OF

CENTRAL
TENDENCY?

A measures of central tendency is a summary
measure that attempts to describe a whole set of

data with a single value that represents the
middle or center of its distribution.

There are three main measures of central
tendency which is the mode, the median and the

mean. Each of these measures describes a
different indication of the typical or central value

in the distribution.

22

MEASURES OF CENTRAL TENDENCY

Is the most commonly
occurring value in a sequence

Is the middle value in a
sequence after arranged in

ascending order

Is the sum of the value in a
sequence divided by the
number of data

23

EXAMPLE 3

Find the mean, median
and mode for the
following data:

1, 3, 4, 9, 7, 2, 9

24

SOLUTION

1+3+4+9+7+2+9
ҧ = 7

=5

25

EXAMPLE 4

Find the mean, median
and mode for the
following data:

2, 5, 2, 3, 5, 8

26

SOLUTION

4+5+2+3+5+8
ҧ = 6

= 4.5

27

EXAMPLE 5

Given that the mean for series of
number m-2, m+4, 2m+5 and m+2
is 16. Find the value of m.

28

SOLUTION

σ
, ҧ =

16 = − 2 + + 4 + 2 + 5 + ( + 2)

4

64 = + + 2 + − 2 + 4 + 5 + 2

64 = 5 + 9

5 = 55

=

29

LET'S TRY

QUESTION 1

Find the mean, median and mode for the
following data:

a) 1, 4, 3, 2, 7, 8, 4, 8, 8
b) 2, 5, 8, 6, 6, 4, 5, 6

30

LET'S TRY

QUESTION 2

Mean for the series of number a-2, 2a+5
and a+1 is 8. Find the value of a.

QUESTION 3

The mean of a set positive integers; 3, 11,
9, 6, 15, 7, 2k and k+3 is 9. Find the
value of k.

31

MEASURES OF CENTRAL TENDENCY

σ
ഥ = σ

෥ = + ൗ −


ෝ = +
+

32

σ f = frequency
ഥ = σ x = midpoint

How to calculate midpoint?
Lower limit + Upper limit

Midpoint =
2

33

EXAMPLE 6

The table below show the waiting time
for 40 customers in a restaurant. Find
the mean of the data.

Time (minutes) Number of customers

1–4 3

5–8 8
9 – 12 13

13 – 16 9
17 – 20 4

21 – 24 2

25 – 28 1

34

SOLUTION

Step 2: Find fx

Step 1: Find the midpoint, x

Time Frequency Midpoint,
(minutes)
3 2.5 7.5
1–4 8 6.5 52
5–8 13 10.5 136.5
9 – 12 9 14.5 130.5
13 – 16 4 18.5 74
17 – 20 2 22.5 45
21 – 24 1 26.5 26.5
25 – 28

σ = 40 σ = 472

Step 3: σ 472
Substitute into ҧ = σ = 40

the formula = .

35

EXAMPLE 7

Calculate the mean from the
following data below.

Score Number of students

11 – 20 4
21 – 30 12
31 – 40 18
41 – 50 6

36

SOLUTION

Step 2: Find fx

Step 1: Find the midpoint, x

Score
4 62
11 – 20 12 15.5 306
21 – 30 18 25.5 639
31 – 40 6 35.5 273
41 – 50 45.5
σ = 40 σ = 1280

Step 3: Substitute into the formula
σ 1280

ҧ = σ = 40
=

37

LET'S TRY

QUESTION 1

The table shows the ages of the participants
of a badminton tournament. Calculate the
mean of the age of the participants.

Age (years) Number of participants

45 - 48 16
49 - 52 20
53 - 56 9
57 - 60 5
61 - 64 2

38

LET'S TRY

QUESTION 2

The table shows the lengths of fish in a pond.
Find the mean of the length of the fish.

Length (cm) Number of fish
1-5 5
6 - 10 16
11 - 15 12
7
16 - 20

39

෥ = + ൗ −


L = lower boundary median class
n = number of data
F = cumulative frequency before

median class
fm = frequency median class
c = size of class

40

How to calculate lower boundary?
The Lower boundary of each class is
calculated by subtracting half of the gap
value = 0.5 from the class lower limit.

How to calculate cumulative frequency?
The cumulative frequency is calculated by
adding each frequency from a frequency
distribution table

How to find median class?
The median class is the class interval
in which the median lies.

41

EXAMPLE 8

The table below shows the mass
of a group of students in a class.
Calculate the median of the data.

Mass (kg) Number of students
30 – 39 6
40 – 49 12
50 – 59 17
60 – 69 5
70 – 79 2

42

SOLUTION

Mass (kg) Number of students Cumulative
frequency
30 – 39 6
40 – 49 12 6
50 – 59 17 18
60 – 69 5 35
70 – 79 2 40
42

Therefore, ෥ = + ൗ −


෤ = 49.5 + 42ൗ2 − 18 (10)
17

= .

43


Click to View FlipBook Version