I have a collection of text documents. I've been asked to show each document in tf-idf vector space and in ntc form and then, train a svm model based on documents' vectors in python. What does ntc exactly mean here?
I Found that it's the same as tf-idf weights with one step of normalization which is called "cosine normalization". But i can't find information about such thing. I found "cosine similarity" which is in my idea different from "cosine normalization". Are they the same? And how can i create this vector in python?
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…