You are on page 1of 7

HADOOP SPARK

 HOME
 
 AWS
 IAM
 
 UNIX COMMANDS
 ADVANCE SHELL SCRIPTING
 
 HDFS
 HDFS COMMANDS
 
 MAPREDUCE
 
 HIVE
 HIVE PROGRAMS
 
 COMPLEX DATA TYPES
 
 HIVE TRICKS
 
 XML FILE PROCESSING TYPE-1
 
 XML FILE PROCESSING TYPE-2
 
 XML FILE PROCESSING TYPE-3
 
 JSON FILE PROCESSING
 
 SQOOP
 MYSQL TO HDFS
 
 HDFS TO MYSQL
 
 MYSQL TO HIVE
 
 MYSQL
 
 POINTS TO REMEMBER
 
 DATASETS
 
 XML DATA SET
 
 SCALA PROGRAMMING
 SCALA COLLECTIONS
 
 DISCUSSION
XML FILE PROCESSING  TYPE-3
CLICK HERE to download XML FILE
Create a hive table with XML SERDEPROPERTIES
PLEASE REFER THIS WEBSITE FOR MORE DETAIL
CREATE TABLE group_xml(GROUPNAME
string,GROUPDESCRIPTION string,USERNAME
array<string>)
ROW FORMAT SERDE
‘com.ibm.spss.hive.serde2.xml.XmlSerDe’
WITH SERDEPROPERTIES (
“column.xpath.GROUPNAME”=”/RIGHTSGROUP/@GROUPNA
ME”,
“column.xpath.GROUPDESCRIPTION”=”/RIGHTSGROUP/@G
ROUPDESCRIPTION”,
“column.xpath.USERNAME”=”/RIGHTSGROUP/USERREF/@U
SERNAME”)
STORED AS
INPUTFORMAT
‘com.ibm.spss.hive.serde2.xml.XmlInputFormat’
OUTPUTFORMAT
‘org.apache.hadoop.hive.ql.io.IgnoreKeyTextOutputFormat’
TBLPROPERTIES
(“xmlinput.start”=”<RIGHTSGROUP”,”xmlinput.end”=”</RI
GHTSGROUP>”);

Load a XML file to HIVE table


LOAD DATA LOCAL INPATH
‘/home/datablockz/Desktop/KISHORE_TRAINING/Group.xml’
OVERWRITE INTO TABLE group_xml;

XML FILE:

OUTPUT
Convert a array data to structure data 
Create a table to store structure data
create table final_xml(GROUPNAME
string,GROUPDESCRIPTION string,USERNAME string);
insert overwrite table final_xml select
GROUPNAME,GROUPDESCRIPTION,aUSERNAME from
group_xml lateral view explode(USERNAME)a as
aUSERNAME;
Now check the output:
SHARE THIS:

 Share


SEARCH
Search for:
Close and accept
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree
to their use.
To find out more, including how to control cookies, see here: Cookie Policy

You might also like