DATA EXPLORATION & PREPROCESSING SUMMARY
----------------------------------------
Dataset Shape:
- Rows: 9551
- Columns: 21
Missing Values:
Cuisines 9
Data Types Before Conversion:
Restaurant ID int64
Restaurant Name object
Country Code int64
City object
Address object
Locality object
Locality Verbose object
Longitude float64
Latitude float64
Cuisines object
Average Cost for two int64
Currency object
Has Table booking object
Has Online delivery object
Is delivering now object
Switch to order menu object
Price range int64
Aggregate rating float64
Rating color object
Rating text object
Votes int64
Data Types After Conversion:
Restaurant ID int64
Restaurant Name category
Country Code int64
City category
Address category
Locality category
Locality Verbose category
Longitude float64
Latitude float64
Cuisines category
Average Cost for two int64
Currency category
Has Table booking category
Has Online delivery category
Is delivering now category
Switch to order menu category
Price range int64
Aggregate rating float64
Rating color category
Rating text category
Votes int64
Aggregate Rating Distribution:
0.0 2148
1.8 1
1.9 2
2.0 7
2.1 15
2.2 27
2.3 47
2.4 87
2.5 110
2.6 191
2.7 250
2.8 315
2.9 381
3.0 468
3.1 519
3.2 522
3.3 483
3.4 498
3.5 480
3.6 458
3.7 427
3.8 400
3.9 335
4.0 266
4.1 274
4.2 221
4.3 174
4.4 144
4.5 95
4.6 78
4.7 42
4.8 25
4.9 61
Class Imbalance (Proportion):
0.0 0.224898
1.8 0.000105
1.9 0.000209
2.0 0.000733
2.1 0.001571
2.2 0.002827
2.3 0.004921
2.4 0.009109
2.5 0.011517
2.6 0.019998
2.7 0.026175
2.8 0.032981
2.9 0.039891
3.0 0.049000
3.1 0.054340
3.2 0.054654
3.3 0.050571
3.4 0.052141
3.5 0.050257
3.6 0.047953
3.7 0.044707
3.8 0.041880
3.9 0.035075
4.0 0.027850
4.1 0.028688
4.2 0.023139
4.3 0.018218
4.4 0.015077
4.5 0.009947
4.6 0.008167
4.7 0.004397
4.8 0.002618
4.9 0.006387