You are on page 1of 15

What Is Apache Tez ?

 An application framework
 Build on top of Apache Hadoop YARN

Uses directed-acyclic-graphs ( DAG's )
 Open source / Apache 2.0 license
 Scaleable

Performant
Hadoop Eco Sphere
Tez DAG

 Tez directed-acyclic-graphs ( DAG )


 Distributed data processing


Vertices represent data transformation
 Edges represent data movement

 For data processing applications


TEZ is an execution engine
 Built on top of YARN
Tez

https://journalofbigdata.springeropen.com/articles/10.1186/s40537-016-0051-6
SELECT
*
FROM
(SELECT PROVINCE,COUNT(*) AS CNT FROM customer_data GROUP BY PROVINCE ) AS A
LEFT JOIN
(SELECT PROVINCE,COUNT(*) AS CNT FROM customer_data GROUP BY PROVINCE ) AS B
ON
A.PROVINCE = B.PROVINCE;
select phone_number, count(*) as count_rec from xademo.customer_details group by
phone_number order by phone_number limit 10;
Select
a.name,count(*) from testdb.mytable
a,testdb.mytable b,testdb.mytable c
where a.name=b.name
and a.name=c.name
group by a.name
union
select
a.name,count(*) from testdb.mytable
a,testdb.mytable b,testdb.mytable c
where a.name=b.name

You might also like