python - How to reversibly store and load a Pandas dataframe to/from disk

Question

Welcome To Ask or Share your Answers For Others

python - How to reversibly store and load a Pandas dataframe to/from disk

1 Answer

深蓝 · Answer 1 · 2021-10-16T22:11:51+0000

The easiest way is to pickle it using to_pickle:

df.to_pickle(file_name)  # where to save it, usually as a .pkl

Then you can load it back using:

df = pd.read_pickle(file_name)

Note: before 0.11.1 save and load were the only way to do this (they are now deprecated in favor of to_pickle and read_pickle respectively).

Another popular choice is to use HDF5 (pytables) which offers very fast access times for large datasets:

import pandas as pd
store = pd.HDFStore('store.h5')

store['df'] = df  # save it
store['df']  # load it

More advanced strategies are discussed in the cookbook.

Since 0.13 there's also msgpack which may be be better for interoperability, as a faster alternative to JSON, or if you have python object/text-heavy data (see this question).

Categories

python - How to reversibly store and load a Pandas dataframe to/from disk

python - How to reversibly store and load a Pandas dataframe to/from disk

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags