My question is about selecting and extracting some features from a dataframe column.
(我的问题是关于从数据框列中选择和提取一些功能。)
Does that effect in the effectiveness of ML algorithms. (这会影响ML算法的有效性吗?)
For example i'm doing analysis on "Chicago crime Dataset". (例如,我正在对“芝加哥犯罪数据集”进行分析。)
It has a "Date Of Occurrence" column and it has data in this form: "2018-11-23 05:10:00". (它具有“发生日期”列,并且具有以下形式的数据:“ 2018-11-23 05:10:00”。)
What i want to do is, i want to add some extra columns out of this for "year", "month", "weekday", "hour". (我想做的是,我想为此添加一些额外的列,用于“年”,“月”,“工作日”,“小时”。)
Will that effect in efficiency of KNN Classifier algorithm.
(这会影响KNN分类器算法的效率吗?)
This is the link to dataset if you want to check which dataset I'm Talking about.
(如果要检查我正在谈论的数据集,这是数据集的链接。)
" https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present/ijzp-q8t2 "
(“ https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-present/ijzp-q8t2 ”)
ask by Habib Ur Rehman translate from so 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…