UPDATE: 2019-09-05
The chunksize
parameter has been deprecated as it wasn't used by pd.read_excel()
, because of the nature of XLSX file format, which will be read up into memory as a whole during parsing.
There are more details about that in this great SO answer...
OLD answer:
you can use read_excel() method:
chunksize = 10**5
for chunk in pd.read_excel(filename, chunksize=chunksize):
# process `chunk` DF
if your excel file has multiple sheets, take a look at bpachev's solution
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…