Task01
打印数据大小,样本量,维度
Train data shape: (75414, 31)
TestA data shape: (50000, 30)
4.sklearn. metrics 评价模型指标好坏的指标(分类),accuracy_score,准确率
5.baseline里面做了特征筛选,只提取了数值列(为了便于理解)
numerical_cols = Train_data.select_dtypes(exclude = ‘object’).columns
print(numerical_cols)
Task02
Task03
Task04
Task05
Original: https://blog.csdn.net/weixin_59882919/article/details/121678554
Author: weixin_59882919
Title: 数据挖掘训练营-笔记
原创文章受到原创版权保护。转载请注明出处:https://www.johngo689.com/700650/
转载文章受原作者版权保护。转载请注明原作者出处!