json - Is there a way to get columns names of dataframe in pyspark without reading the whole dataset?

Question

Welcome To Ask or Share your Answers For Others

json - Is there a way to get columns names of dataframe in pyspark without reading the whole dataset?

1 Answer

深蓝 · Answer 1 · 2021-02-19T03:46:02+0000

from the official doc :

If the schema parameter is not specified, this function goes through the input once to determine the input schema.

Therefore, you cannot get the column names with only the first line.
Still, you can do an extra step first, that will extract one line and create a dataframe from it, then extract the column names.

Categories

json - Is there a way to get columns names of dataframe in pyspark without reading the whole dataset?

json - Is there a way to get columns names of dataframe in pyspark without reading the whole dataset?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags