Professional Documents
Culture Documents
faq
Questions
Questions
search
Users
Groups
Tags
Tagged samples
tables
using
row
question
Group Hive-user
Hi folks,
Through samples here and there, I've seen tables definitions using RCFile
users:1
STORED AS RCFILE
Related Groups
Hive-commits
Hive-dev
Hive-user
behavior ?
Recent questions
Thank you
Power Toggles
Mathieu
samples
tables
using
row
question
5.0 By Ota
Svn Commit: R929357 - In /websites:
Production/tomee/content/index.html
Staging/tomee/trunk/content/index.html
Using The Google Search Feature And S
Voice
Proposal To Grow In-app Revenue
Svn Commit: R929355 - In /websites:
Production/tomee/content/index.html
Staging/tomee/trunk/content/index.html
Index Complex JSON Data In SOLR
Related discussions
How do I set the Row Group Size of RCFile in Hive CREATE TABLE OrderFactPartClustRcFile( order_id INT, emp_id INT,
order_amt FLOAT, order_cost FLOAT, qty_sold FLOAT, freight FLOAT, gross_dollar_sales FLOAT, ship_date STRING,
rush_order STRING, customer_id INT, pymt_type INT, shipper_id INT ) PARTITIONED BY (order_date STRING) CLUSTERED BY
(order_id) SORTED BY (order_id)
Hi, Can someone show me how to use RCfile in plain MapReduce job (as Input and Output Format)? Please.
RCFile Performance
http://qnalist.com/questions/548271/specification-of-serde-in-rcfile
1/5
11/16/2014
Hi Experts, I have a large file with 300+ columns. In order to query only few rows efficiently, I am using RCFile format in Hive. I
have tried setting the RCFile rowgroup size from default size till 32 MB. ex: set hive.io.rcfile.record.buffer.size = 134217728;
However, I do not see major changes in the amount of HDFS data scanned. Moreover, the amount of data scanned with RCFile is
not significantly
Writing To Rcfile
Could someone please point me to someway where I can store in rcfile format with snappy compression? I need to use this
output in hive.
RCfile
Hi, I want to use RCfile to address the IO problem, and I can not find some paper about how to install or how to use it by PIG, so if
you had some install or configue file, you could share with me. Thank you. Best Regards Malone 2012-05-24
Impalad Crashed When Query Data Stored In The RCFILE Format Stored In Hive Table
Hi, I have installed impala 0.6 and CDH 4.2, i have setuped my cluster with three data nodes and a namenode. First, i created a
table data stored as TEXTFILE format in hive, And i have loaded about 150 millons rows into the table, I could query data in hive
and in impalad-shell without any errors, But it was too slow query speed(described on
https://groups.google.com/a/cloudera.org/forum/#!topic
Strange Display When Upload RCFile From Local With Different Column.
hello, I tried to import data from SQL to Hive RCFile. I use RCFile.Writer to generate rcfile and upload to HIVE table directory. the
rcfiles has different columns: c0_1.rc has 2 columns c1_1.rc has 3 columns c2_1.rc has 4 columns the outputs was strange: #
All rc files are loaded: hive> select * from simple; OK 1 foo NULL null 2 bar NULL null 3 foobar NULL null 3 haliluya NULL
http://qnalist.com/questions/548271/specification-of-serde-in-rcfile
2/5
11/16/2014
Parquet VS RCfile
Hi all, I'd like to share my simple performance test for comparing 3 different file types(Text, Parquet, RCFile). Environment *
My cluster consists of 8 DNs, and each node is equipped with 24-core CPU, 64 GB memory and 6 disks. Total file size of each file
type TEXT(no compression) PARQUET(snappy) RCFILE(snappy) Total size 58.5Gb 19.2Gb 16.5Gb Num. of files 8 88 236 Num.
of rows 400M 400M
Problem With Load Data From Local File System Into RCFILE Table
Hi I have problem with loading data into RCFILE table from local file system. I am using hive 0.7.1 of cloudera's distribution.
1.create table create table test(c1 int,c2 string) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' Stored as RCFILE; 2.load
text file into table LOAD DATA LOCAL INPATH 'test.txt' INTO TABLE text; Hive command line throwing errors as the following:
Loading data to table
3/5
11/16/2014
queries over certain time frames get me RCFile/Compression issues. The data goes in fine. Is this a FS level corruption issue? Is
this something tunable? How would I even go about troubleshooting something like this? Hive Runtime Error while processing
writable SEQ -org.
Problem With Memory Limit When Query To Uncompress Rcfile Table Using Impala
Hi, I have a trouble with RCFile when query on it using IMPALA: 1. I using HiBench Tool to create 18Gb uncompress sequence
file and insert into uservisits. 2. Using Hive to create table with format: hive> set mapred.output.compress=false; hive> set
hive.exec.compress.output=false; hive> CREATE TABLE uservisits_rcfile (sourceIP STRING,destURL STRING,visitDate
STRING,adRevenue DOUBLE,
How To Update On A Huge RCFILE FORMAT PARTITIONED HIVE Table, How To Apply
Deltas(incremental Data).
If you have a Hive table that is RCFILE FORMAT and is partitioned and want to apply updates to it from the deltas that are coming
in how can that be possible please share the ideas which will have the better performance since the whole table is pretty huge
like 10 TB size. having temp tablewhich may be built and swapped may not be good option because of huge volume, dont want to
use the HBASE too
http://qnalist.com/questions/548271/specification-of-serde-in-rcfile
4/5
11/16/2014
http://qnalist.com/questions/548271/specification-of-serde-in-rcfile
5/5