Professional Documents
Culture Documents
Profile Details
Lead Engineer with extensive experience in designing & developing 7037 NE Ridge Dr
complex end-to-end realtime & batch ingestion usecases. With 9+ Hillsboro, OR-97124
years of experience in Retail,Banking & Health domain data,extensive 2347389933
domain expertise in Supply chain ( Inventory , supply and material harishaddanki@gmail.com
planning), Consumer behaviour, Privacy and security and with over 7+
years of experience Bigdata ecosystem , Masters in computer science Links
and quick learning ability i can apply myself into any usecase. LinkedIn
• Migrated all Hive etl code into Spark etl using Spark RDDs and Compute : EC2,EMR
PySpark.
• Developed UDF’s for hive and pig to support extra functionality Databases: Oracle, Mysql ,
provided by Teradata. Teradata
• Worked on Generating Dynamic Case Statements based on Excel
provided by Business using Python.
• Extensively worked on Pyspark, Hive, Pig and Sqoop for sourcing
and transformations.
• Worked on Performance Optimizations to reduce the ETL Run time
by 50%.
• Worked on Avro and Parquet File Formats with snappy
compression.
• Worked on Autosys for scheduling the Oozie Workflows.
Education
Masters in Computer Science, University of Akron, Akron,OH
September 2012 — July 2014
GPA: 3.8/4.0