hadoop - How does Hive decide when to use map reduce and when not to?

Question

Welcome To Ask or Share your Answers For Others

hadoop - How does Hive decide when to use map reduce and when not to?

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

hadoop - How does Hive decide when to use map reduce and when not to?

As a simple example,

select * from tablename;

DOES NOT kick in map reduce, while

select count(*) from tablename;

DOES. What is the general principle used to decide when to use map reduce (by hive)?

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T17:52:31+0000

In general, any sort of aggregation, such as min/max/count is going to require a MapReduce job. This isn't going to explain everything for you, probably.

Hive, in the style of many RDBMS, has an EXPLAIN keyword that will outline how your Hive query gets translated into MapReduce jobs. Try running explain on both your example queries and see what it is trying to do behind the scenes.

Categories

hadoop - How does Hive decide when to use map reduce and when not to?

hadoop - How does Hive decide when to use map reduce and when not to?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags