Professional Documents
Culture Documents
HADOOP Spa
HADOOP Spa
HOME
AWS
IAM
UNIX COMMANDS
ADVANCE SHELL SCRIPTING
HDFS
HDFS COMMANDS
MAPREDUCE
HIVE
HIVE PROGRAMS
COMPLEX DATA TYPES
HIVE TRICKS
XML FILE PROCESSING TYPE-1
XML FILE PROCESSING TYPE-2
XML FILE PROCESSING TYPE-3
JSON FILE PROCESSING
SQOOP
MYSQL TO HDFS
HDFS TO MYSQL
MYSQL TO HIVE
MYSQL
POINTS TO REMEMBER
DATASETS
XML DATA SET
SCALA PROGRAMMING
SCALA COLLECTIONS
DISCUSSION
XML FILE PROCESSING TYPE-3
CLICK HERE to download XML FILE
Create a hive table with XML SERDEPROPERTIES
PLEASE REFER THIS WEBSITE FOR MORE DETAIL
CREATE TABLE group_xml(GROUPNAME
string,GROUPDESCRIPTION string,USERNAME
array<string>)
ROW FORMAT SERDE
‘com.ibm.spss.hive.serde2.xml.XmlSerDe’
WITH SERDEPROPERTIES (
“column.xpath.GROUPNAME”=”/RIGHTSGROUP/@GROUPNA
ME”,
“column.xpath.GROUPDESCRIPTION”=”/RIGHTSGROUP/@G
ROUPDESCRIPTION”,
“column.xpath.USERNAME”=”/RIGHTSGROUP/USERREF/@U
SERNAME”)
STORED AS
INPUTFORMAT
‘com.ibm.spss.hive.serde2.xml.XmlInputFormat’
OUTPUTFORMAT
‘org.apache.hadoop.hive.ql.io.IgnoreKeyTextOutputFormat’
TBLPROPERTIES
(“xmlinput.start”=”<RIGHTSGROUP”,”xmlinput.end”=”</RI
GHTSGROUP>”);
XML FILE:
OUTPUT
Convert a array data to structure data
Create a table to store structure data
create table final_xml(GROUPNAME
string,GROUPDESCRIPTION string,USERNAME string);
insert overwrite table final_xml select
GROUPNAME,GROUPDESCRIPTION,aUSERNAME from
group_xml lateral view explode(USERNAME)a as
aUSERNAME;
Now check the output:
SHARE THIS:
Share
SEARCH
Search for:
Close and accept
Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree
to their use.
To find out more, including how to control cookies, see here: Cookie Policy