A tax is a compulsory financial charge or some other type of levy imposed on a taxpayer (an individual or legal entity) by a governmental organization in order to fund government spending and various public expenditures.
Can someone tel me what format to keep the csv file in. I have followed the instructions and the format of the file to be submitted but it keeps giving me an evaluation error
@sabeehalam have you changed the column name to prediction? - if you still face the issue after changing this, you can send your prediction file to support@dphi.tech.
I have completed the challenge, but my prediction model only got accuracy of 21%. If possible, can someone share the work script on what you suppose to do to make the prediction model. I am a beginner in this field and I really want to learn it. So I hope someone who got high accuracy prediction model can share their work script after the deadline.
If this Datathon ends, is DPhi going to put the top model with the prediction accuracy more than 70% at github? I want to learn how they can do it. I have tried hyperparameter tuning GridSearchCV, RandomizedSearchCV and GradientBoostingClassifier, but my model could only get accuracy 40%. So I have no idea what else to do to improve the accuracy.
Also in leaderboard, I saw people with 100% accuracy. Is it really possible?? it doesn’t seem make sense in real life situation to get 100% model accuracy. Is it not going to overfit?
Exactly my thought man. The features are barely even correlated to give that good a model. But I don’t exactly know too much about the field to be commenting on someone’s work.
I got the fix…actually the issue was prediction file has index column so wat i did was i deleted those and then tried to submit. It finally got submitted but accuracy being very low (around 36%)
You have to split and train then after model is created you follow the guidelines to submit i.e generate csv file by creating new_test_data and predicting your model output and saving it in target variable
They have clearly mentioned that target(output) is not present in the test data provided by them.
You need to evaluate accuracy using accuracy_score method but this is for us to find out accuracy of model we cant evaluate the accuracy of test data provided by them so we need to just send the predicted values in csv file this will be compared with real values (hidden to us) when you submit the csv file you will get the score of your model.
Hi @ismail
Don’t worry! We will be sharing the top performer notebooks at the end of the bootcamp so that all other learners can go through it and learn.