Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

Recent questions tagged Hadoop

0 votes
565 views
1 answer
    How to copy file from HDFS to the local file system . There is no physical location of a file under the file ... .i am tried through winscp . See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
364 views
1 answer
    I have created the hive external table using below command: use hive2; create external table depTable (depId int ... it doesn't get deleted. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
694 views
1 answer
    I'm trying to create an internal (managed) table in hive that can store my incremental log data. The ... delimiters and load data successfully. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
520 views
1 answer
    I'm trying to Sqoop the data from Teradata to hive. I thought of following the below steps: 1) Create ... in the corresponding Hive table? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
690 views
1 answer
    I am using CDH 5.3.3 and using hive JDBC driver to connect to hive in the secured cluster. I ... jdbc.HiveDriver.connect(HiveDriver.java:104) See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
711 views
1 answer
    I have a folder which contains many small .gz files (compressed csv text files). I need to read them in my ... name using sc.textFile(...) See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
448 views
1 answer
    I have two tables,tab1 & tab2. tab1(T1) tab2(T2) a1 b1 b1 c1 c1 f1 d1 g1 I am looking for the values from ... join tab2 on (tab1.T1!=tab2.T2); See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
461 views
1 answer
    I am learning Scala and i need to write a custom file to HDFS. I have my own HDFS running on a Cloudera image using ... ) } println("Done!") } } See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
609 views
1 answer
    I have a map reduce job that connects to HBASE and I can't figure out where I am running into this error ... advance for any help or direction. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
540 views
1 answer
    Does a block in Hadoop Distributed File System store multiple small files, or a block stores only 1 file? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
565 views
1 answer
    I am running Apache Pig .11.1 with Hadoop 2.0.5. Most simple jobs that I run in Pig work perfectly fine. ... how to get rid of these messages? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
505 views
1 answer
    Any one know how many bytes occupy per file in namenode of Hdfs? I want to estimate how many files can store in single namenode of 32G memory. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
460 views
1 answer
    I am running hadoop with default configuration with one-node cluster, and would like to find where HDFS stores files locally. Any ideas? Thanks. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
585 views
1 answer
    Is there a way to delete files older than 10 days on HDFS? In Linux I would use: find /path/to/directory/ ... done based on file creation date) See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
542 views
1 answer
    We can persist an RDD into memory and/or disk when we want to use it more than once. However, do we ... myself, I get slower performance. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
666 views
1 answer
    I'm having trouble to append data to an existing file in HDFS. I want that if the file exists then append a ... I'm missing or doing wrong? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
456 views
1 answer
    We all know that the block size in HDFS is pretty large (64M or 128M) as compared to the block size in ... please throw some light on this. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
542 views
1 answer
    This command works with HiveQL: insert overwrite directory '/data/home.csv' select * from testtable; But with ... CSV feature in Spark SQL. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
490 views
1 answer
    When load data from HDFS to Hive, using LOAD DATA INPATH 'hdfs_file' INTO TABLE tablename; command, it looks ... be used by another process. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
584 views
1 answer
    I see exit codes and exit statuses all the time when running spark on yarn: Here are a few: CoarseGrainedExecutorBackend ... on a *lost* node See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
694 views
1 answer
    The join documentation for Hive encourages the use of implicit joins, i.e. SELECT * FROM table1 t1, ... the above return additional records? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
393 views
1 answer
    I execute a join using a javaHiveContext in Spark. The big table is 1,76Gb and has 100 millions record. The ... are stored as Parquet file. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
395 views
1 answer
    I have a math problem that I solve by trial and error (I think this is called brute force), and the program ... is more of a general question. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
662 views
1 answer
    I have some web server logs that I'd like to query with Hive. The directory structure, in HDFS, looks ... running those 4 commands every time? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
597 views
1 answer
    I want to list all folders within a hdfs directory using Scala/Spark. In Hadoop I can do this by using the ... file system with schema file//. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
553 views
1 answer
    The parameter "mapred.min.split.size" changes the size of the block in which the file was written earlier? ... occupy blocks in HDFS 128M; See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
403 views
1 answer
    I have installed cloudera CDH 5 by using cloudera manager. I can easily do hadoop fs -ls /input/war-and- ... same file by using hadoop commands? See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
0 votes
794 views
1 answer
    I am new to spark and I have a question. I have a two step process in which the first step write a ... . Hoping to find a better alternative. See Question&Answers more detail:os...
asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)
Ask a question:
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...