Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

Recent questions tagged hadoop

0 votes
600 views
1 answer
    My map is currently inefficient when parsing one particular set of files (a total of 2 TB). I'd like to ... as copying from local to dfs.) See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
373 views
1 answer
    Google's Dremel is described here. What's the difference between Dremel and Mapreduce? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
478 views
1 answer
    I have a sample application working to read from csv files into a dataframe. The dataframe can be stored to a ... any help is much appreciated. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
721 views
1 answer
    The documentation describing how to connect to a kerberos secured endpoint shows the following: curl -i --negotiate ... command be looking for? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
523 views
1 answer
    I tried to install hive on a raspberry pi 2. I installed Hive by uncompress zipped Hive package and ... with Derby would be appreciated! See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
879 views
1 answer
    I have a oozie workflow. I am submitting all the hive actions with <name>mapred.job.queue.name</name> < ... behind this behavior and solution. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
754 views
1 answer
    I have a folder structure in HDFS like below. However, no partitions were actually created on the table using the ... csv /file0001.csv ...etc See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
813 views
1 answer
    Sqoop able to import data from multiple tables using --query clause but not clear whether it is able to import ... the priority? Thanks.... See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
734 views
1 answer
    We have two cloudera 5.7.1 clusters, one secured using Kerberos and one unsecured. Is it possible to run Spark ... we can't achieve the above. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
646 views
1 answer
    I want to create a HIVE Table with multi string character as a delimiter such as CREATE EXTERNAL TABlE tableex(id ... multi string like "~*". See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
822 views
1 answer
    I'm trying to create table in hive with java. I found java.sql.SQLException: org.apache.thrift.transport.TTransportException ... can i fix it.? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
779 views
1 answer
    I understand hdfs will split files into something like 64mb chunks. We have data coming in streaming and we can ... having, say, 1gb files? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
589 views
1 answer
    I'm new to spark. Now I can run spark 0.9.1 on yarn (2.0.0-cdh4.2.1). But there is no log ... 2> $LOG_DIR/stderr Where can LOG_DIR be set? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
581 views
1 answer
    I have a data file which is in .txt format. I am using the file to load data into Hive tables. When I load ... to store data into an ORC table? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
391 views
1 answer
    I have the following scenario- Pig version used 0.70 Sample HDFS directory structure: /user/training/test/ ... comma separated list? cheers See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
610 views
1 answer
    I have a directory of directories on HDFS, and I want to iterate over the directories. Is there any ... Spark using the SparkContext object? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
574 views
1 answer
    Right now I implement row count over ResultScanner like this for (Result rs = scanner.next(); rs != null; ... to quickly count number of rows. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
404 views
1 answer
    I understand that you can send individual files as dependencies with Python Spark programs. But what about full ... number of distributed nodes? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
564 views
1 answer
    I am new to Apache Spark, and I just learned that Spark supports three types of cluster: Standalone - meaning ... cluster type should I choose? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
554 views
1 answer
    As far as I understand; sort by only sorts with in the reducer order by orders things globally but shoves ... doesn't always do that. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
436 views
1 answer
    I'm trying to understand the relationship of the number of cores and the number of executors when running a ... How to explain this result? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
416 views
1 answer
    My Hive version is 0.13. I have two tables, table_1 and table_2 table_1 contains: customer_id | items | price | ... using join, but it fails. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
483 views
1 answer
    I have the following hive query: select count(distinct id) as total from mytable; which automatically spawns: 1408 ... in more reducers? Thanks! See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
881 views
1 answer
    As per title. I'm aware of textFile but, as the name suggests, it works only on text files. I would need to ... a local path. I'm using pyspark. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
468 views
1 answer
    I'm new to hadoop and now I'm testing simple workflow with just single sqoop action. It works if I use ... /value> </property> </configuration> See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
636 views
1 answer
    I have an Apache Spark application running on a YARN cluster (spark has 3 nodes on this cluster) on cluster mode. ... is 12g in yarn-site.xml See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
580 views
1 answer
    I am trying to understand how spark runs on YARN cluster/client. I have the following question in my mind. Is ... is sending the job to cluster? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
491 views
1 answer
    I was using Hadoop in a pseudo-distributed mode and everything was working fine. But then I had to restart my ... sh and bin/start-dfs.sh. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
Ask a question:
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

2.1m questions

2.1m answers

60 comments

57.0k users

...