Cluster or One way Anova analysis of maruti suzuki car

Group members- DEEPAK, SHUBHAM, MAHAK AGARWALL

Batch- M2

From- ITM Business school, Kharghar, Navi Mumbai

 

Cluster Membership or Anova analysis-

 

Introduction-

Our topic is Maruti Suzuki car in this study, we talk about Maruti Suzuki car features and aim to analyze a dataset that categorizes various cases into distinct clusters based on their attributes and distances to cluster centroids.

Objective-

The primary objective is to investigate whether there are significant differences in various features among these clusters using Analysis of Variance (ANOVA). By understanding the relationships between cluster membership and the specified attributes, we can derive insights into the characteristics that differentiate the clusters.

Data Collection-

The dataset we collect through conduct the google from and taken the survey from collage, hostel and outside area.

  1. Cluster Membership Data: This data includes 100 cases, each assigned to one of four clusters based on their distances from the cluster centroids. The distance metric indicates how closely related each case is to its assigned cluster.
  2. ANOVA Results: The ANOVA analysis was conducted on several features, including fuel efficiency, features, air bags, easily modifiable, affordable price, bright lighting, high technology, comfort, and extra boot space. The analysis aimed to determine whether there are statistically significant differences in these features across the different clusters.

 

Data Analysis-

The data analysis form SPSS (IBM) software.

  1. Cluster Membership Analysis-

The cluster membership data indicates the distribution of cases across the four clusters, with distances varying significantly among them. Each case’s distance from the cluster centroid provides insight into how well it fits within its assigned cluster.

Cluster Membership

Case Number

Cluster

Distance

1

3

7.958

2

2

7.020

3

3

8.453

4

4

5.888

5

4

8.569

6

3

7.131

7

1

8.423

8

1

8.209

9

2

8.921

10

1

8.533

11

1

7.603

12

3

6.367

13

3

8.922

14

2

8.209

15

3

6.257

16

3

8.604

17

2

8.935

18

4

8.785

19

3

8.354

20

3

4.960

21

2

8.702

22

4

5.874

23

3

9.249

24

4

8.282

25

4

8.530

26

1

6.935

27

3

7.610

28

2

8.236

29

2

8.666

30

3

9.896

31

4

9.605

32

2

7.972

33

3

7.990

34

3

6.764

35

3

10.397

36

2

9.386

37

1

7.835

38

2

6.026

39

4

8.016

40

3

9.490

41

3

8.176

42

1

8.499

43

1

7.316

44

3

9.637

45

2

8.026

46

2

7.887

47

4

7.901

48

1

7.027

49

2

8.481

50

4

7.594

51

4

8.006

52

2

6.946

53

1

8.033

54

4

8.661

55

2

5.376

56

1

7.118

57

2

8.298

58

2

9.814

59

2

8.305

60

2

7.506

61

3

5.980

62

3

8.191

63

3

9.055

64

2

8.015

65

4

10.658

66

3

8.057

67

4

8.063

68

2

8.875

69

3

8.844

70

4

8.083

71

3

8.611

72

2

8.319

73

2

8.822

74

3

8.283

75

3

10.072

76

3

8.307

77

2

10.627

78

3

8.692

79

3

8.721

80

4

9.047

81

1

8.415

82

2

8.660

83

4

6.764

84

4

9.074

85

2

6.167

86

3

10.000

87

1

7.934

88

4

8.675

89

1

5.576

90

4

6.590

91

4

6.953

92

2

4.828

93

4

7.709

94

3

6.862

95

2

7.247

96

2

7.154

97

3

5.901

98

4

7.649

99

3

5.303

100

4

5.846

 

  1. ANOVA Analysis-

 

Feature

Between Groups SS

df

Mean Square

F

Sig.

Fuel efficiency

54.463

9

6.051

0.672

0.732

Features

23.222

9

2.580

0.324

0.965

Air bags

140.211

9

15.579

1.910

0.060

Easily modifiable

36.735

9

4.082

0.480

0.885

Affordable price

71.037

9

7.893

0.956

0.482

Bright lighting

44.360

9

4.929

0.571

0.817

High technology

39.638

9

4.404

0.515

0.860

Comfort

103.782

9

11.531

1.582

0.133

Extra boot space

58.379

9

6.487

0.645

0.755

 

ANOVA

 

Sum of Squares

df

Mean Square

F

Sig.

Fuel efficiency

Between Groups

54.463

9

6.051

.672

.732

Within Groups

810.447

90

9.005

 

 

Total

864.910

99

 

 

 

Feautures

Between Groups

23.222

9

2.580

.324

.965

Within Groups

715.738

90

7.953

 

 

Total

738.960

99

 

 

 

Air bags

Between Groups

140.211

9

15.579

1.910

.060

Within Groups

733.979

90

8.155

 

 

Total

874.190

99

 

 

 

Easily modifie

Between Groups

36.735

9

4.082

.480

.885

Within Groups

765.305

90

8.503

 

 

Total

802.040

99

 

 

 

Affordable price

Between Groups

71.037

9

7.893

.956

.482

Within Groups

743.153

90

8.257

 

 

Total

814.190

99

 

 

 

Bright lighting

Between Groups

44.360

9

4.929

.571

.817

Within Groups

776.550

90

8.628

 

 

Total

820.910

99

 

 

 

High technology

Between Groups

39.638

9

4.404

.515

.860

Within Groups

770.152

90

8.557

 

 

Total

809.790

99

 

 

 

Comfort

Between Groups

103.782

9

11.531

1.582

.133

Within Groups

656.218

90

7.291

 

 

Total

760.000

99

 

 

 

Extraboot space

Between Groups

58.379

9

6.487

.645

.755

Within Groups

904.581

90

10.051

 

 

Total

962.960

99

 

 

 

 

 

Interpretation-

The ANOVA results indicate that none of the features show statistically significant differences between the groups at the conventional alpha level of 0.05.

  • The Air bags feature comes closest to significance with a p-value of 0.060, suggesting a potential trend that may warrant further investigation.
  • All other features, including fuel efficiency, features, easily modifiable, affordable price, bright lighting, high technology, comfort, and extra boot space, have p-values well above the significance threshold, indicating no significant differences among the clusters for these attributes.

The high p-values suggest that the variation in these features is largely due to random chance rather than the effects of cluster membership.

Conclusion-

In conclusion, the analysis of cluster membership and ANOVA results indicates that there are no significant differences in the assessed features across the identified clusters. While the air bags feature approaches significance, it does not meet the conventional threshold, and further research may be needed to explore this trend. The findings imply that the clusters may not be distinctly characterized by the features analyzed, suggesting that other factors or additional features may be necessary to effectively differentiate between the clusters. Future studies could include a broader range of features or utilize different clustering techniques to gain deeper insights into the data.

 

 

Leave a comment