Statistical inferences

1) (9 points) Below are the heights of children and their parents.

Gender

M

M

M

M

M

M

M

M

M

M

M

M

M

M

M

M

M

F

F

F

F

F

F

F

F

F

F

F

F

F

F

F

F

F

F

Mother’s Father’s

Height

Height

66

58

68

64

62

66

63

65

64

64

62

62

67

63

66

65

71

63

64

60

65

67

59

60

58

72

63

67

62

63

64

63

64

60

65

70

69

71

68

66

74

73

69

67

68

72

72

68

71

67

71

75

64

67

72

72

72

67

71

66

75

69

70

69

66

76

69

68

66

68

Offspring’s

Height

62,5

64,6

73,9

67,1

64,4

71,1

71

69,3

64,9

68,1

66,5

66,5

70,3

67,5

68,5

71,9

67,8

58,6

65,3

61

65,4

67,4

60,9

63,1

60

71,1

62,2

67,2

63,4

62,2

64,7

59,6

61

64

65,4

a) Calculate the mean and median

of the mother’s height.

mean

63,85714 median

b) Calculate the mean and median

of the father’s height.

mean

69,65714 median

c) Calculate the mean and median

of the offspring’s height.

mean

65,66857 median

64

69

65,4

2) (9 points) Below are the heights of children and their parents.

Gender

M

M

M

F

F

Mother’s Father’s

Height

Height

66

64

69

67

69

64

62

66

65

62

Offspring’s

Height

69,1

67,4

67,5

64,7

68,4

a) Calculate the mean and median of the mother’s height.

mean

67 median

67

b) Calculate the mean and median of the father’s height.

mean

63,8 median

64

c) Calculate the mean and median of the offspring’s height..

mean

67,42 median

67,5

3) (4 points) Based on your calculations what can you conclude about the data sets?

4) (15 points) Below are the high temperatures of January 2016.

47

48

54

38

34

40

43

45

50

62

41

47

32

53

50

53

38

24

28

31

34

29

29

35

42

52

48

40

43

42

57

a) Construct a grouped frequency distribution for the data set using 7 classes.

Calculate the relative frequencies of each interval.

min

24

max

62

range

38

no. of classes

7

class width 5,42857143

class width

6

lower limit upper limit lower boundery

upper boundary

class mark frequency

24

29

23,5

28,5

26,5

4

30

35

29,5

34,5

32,5

5

36

41

35,5

40,5

38,5

5

42

47

41,5

46,5

44,5

7

48

53

47,5

52,5

50,5

7

54

59

53,5

58,5

56,5

2

60

65

59,5

64,5

62,5

1

31

b) Find the mean of the data set.

42,22581

c) Find the standard deviation of the data set.

9,429421

d) Find the coefficient of variation of the data set.

0,223309

e) Find the z-score of the highest temperature. Find the z-score of the lowest temperature.

Are any of these temps within usual parameters?

5) (15 points) Below are the high temperatures of January 2017.

56

25

49

52

47

46

29

46

50

39

46

41

53

49

56

57

57

60

61

37

72

53

61

35

65

49

45

27

43

50

49

a) Construct a grouped frequency distribution for the data set using 7 classes.

Calculate the relative frequencies of each interval.

min

25

max

72

range

47

no. of classes

7

class width 6,71428571

class width

7

lower limit upper limit lower boundery

upper boundary

class mark frequency

25

31

24,5

31,5

28

3

32

38

31,5

38,5

35

2

39

45

38,5

45,5

42

4

46

52

45,5

52,5

49

11

53

59

52,5

59,5

56

6

60

66

59,5

66,5

63

4

67

73

66,5

73,5

70

1

31

b) Find the mean of the data set.

48,54839

c) Find the standard deviation of the data set.

10,80691

d) Find the coefficient of variation of the data set.

0,222601

e) Find the z-scores of the highest temperature. Find the z-score of the lowest temperature.

Are any of these temps within usual parameters?

6) (4 points) Compare your analysis of each data set. Which in your estimation

has been the warmer January? Which data set is more variable?

What factors do you believe have caused this warming trend?

7) (15 points) Below are the high temperatures of February 2016.

61

53

57

58

44

44

46

46

40

39

30

31

28

26

31

53

49

40

42

65

57

53

45

65

60

40

47

65

65

a) Construct a grouped frequency distribution for the data set using 7 classes.

Calculate the relative frequencies of each interval.

min

26

max

65

range

39

no. of classes

7

class width 5,57142857

class width

6

lower limit upper limit lower boundery

upper boundary

class mark frequency

26

31

25,5

31,5

28,5

5

32

37

31,5

37,5

34,5

0

38

43

37,5

43,5

40,5

5

44

49

43,5

49,5

46,5

7

50

55

49,5

55,5

52,5

3

56

61

55,5

61,5

58,5

5

62

67

61,5

67,5

64,5

4

29

b) Find the mean of the data set.

47,58621

c) Find the standard deviation of the data set.

11,87289

d) Find the coefficient of variation of the data set.

0,249503

e) Find the z-scores of the highest temperature. Find the z-score of the lowest temperature.

Are any of these temps within usual parameters?

8) (15 points) Below are the high temperatures of February 2017.

51

50

40

41

54

61

73

74

53

39

53

55

47

46

55

42

51

68

71

68

58

63

75

82

71

52

59

67

a) Construct a grouped frequency distribution for the data set using 7 classes.

Calculate the relative frequencies of each interval.

min

39

max

82

range

43

no. of classes

7

class width 6,14285714

class width

7

lower limit upper limit lower boundery

upper boundary

class mark frequency

39

45

38,5

45,5

42

4

46

52

45,5

52,5

49

6

53

59

52,5

59,5

56

7

60

66

59,5

66,5

63

2

67

73

66,5

73,5

70

6

74

80

73,5

80,5

77

2

81

87

80,5

87,5

84

1

28

b) Find the mean of the data set.

57,82143

c) Find the standard deviation of the data set.

11,82607

d) Find the coefficient of variation of the data set.

0,204527

e) Find the z-scores of the highest temperature. Find the z-score of the lowest temperature.

Are any of these temps within usual parameters?

9) (4 points) Compare your analysis of each data set. Which in your estimation

has been the warmer February? Which data set is more variable?

What factors do you believe have caused this warming trend?

10) (10 points) Median Household Income By State

State

Alabama

Alaska

Arizona

Arkansas

California

Colorado

Connecticut

Delaware

District of Columbia

Florida

Georgia

Hawaii

Idaho

Illinois

Indiana

Iowa

Kansas

Kentucky

Louisiana

Maine

Maryland

Massachusetts

Michigan

Minnesota

Mississippi

Missouri

Montana

Nebraska

Nevada

New Hampshire

New Jersey

New Mexico

New York

North Carolina

North Dakota

Ohio

Oklahoma

Oregon

Pennsylvania

Puerto Rico

Rhode Island

South Carolina

South Dakota

Tennessee

Income

43.623

72.515

50.255

41.371

61.818

60.629

70.331

60.509

70.848

47.507

49.620

69.515

47.583

57.574

49.255

53.183

52.205

43.740

45.047

49.331

74.551

68.563

49.576

61.492

39.665

48.173

47.169

52.997

51.847

66.779

72.093

44.963

59.269

46.868

57.181

49.429

46.879

51.243

53.599

19.350

56.852

45.483

50.957

45.219

a) Construct a grouped frequency distribution using

eight classes.

min

19.350

max

74.551

range

55.201

no. of classes

8

class width 6900,125

class width

6901

lower limit upper limitlower boundery

upper boundary

class mark

19.350

26250 19.349,5 26250,5

22800

26.251

33151 26.250,5 33151,5

29701

33.152

40052 33.151,5 40052,5

36602

40.053

46953 40.052,5 46953,5

43503

46.954

53854 46.953,5 53854,5

50404

53.855

60755 53.854,5 60755,5

57305

60.756

67656 60.755,5 67656,5

64206

67.657

74557 67.656,5 74557,5

71107

Texas

Utah

Vermont

Virginia

Washington

West Virginia

Wisconsin

Wyoming

53.207

60.727

55.176

65.015

61.062

41.751

53.357

58.840

b) Construct a histogram. Does the data resemble

a normal distribution?

No, because it does not follow the bell shaped curve of a normal distribution.

Household Income By State

40%

frequency relative frequency

1

2%

0

0%

1

2%

10

19%

19

37%

9

17%

5

10%

7

13%

52

100%

35%

37%

30%

25%

20%

19%

17%

15%

13%

10%

10%

5%

2%

0%

2%

22800

29701

36602

0%

43503

50404

57305

64206

71107

13%

71107

…

