python - How can i reduce memory usage of Scikit-Learn Vectorizers?

Question

Welcome To Ask or Share your Answers For Others

python - How can i reduce memory usage of Scikit-Learn Vectorizers?

1 Answer

深蓝 · Answer 1 · 2021-10-23T19:32:09+0000

I would strongly recommend you to use the HashingVectorizer when fitting models on large dataset.

The HashingVectorizer is data independent, only the parameters from vectorizer.get_params() are important. Hence (un)pickling `HashingVectorizer instance should be very fast.

The vocabulary based vectorizers are better suited for exploratory analysis on small datasets.

Categories

python - How can i reduce memory usage of Scikit-Learn Vectorizers?

python - How can i reduce memory usage of Scikit-Learn Vectorizers?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags