You are on page 1of 1

SONALI SINGH

Boston MA| 8578002953| singhsonali2611@gmail.com | linkedin.com/in/sonalisingh26| github.com/singhsonali978

EDUCATION
Northeastern University, Boston, MA, USA Expected Dec 2021
Master of Science in Information Systems
Courses: Data Science & Engineering Methods and Tools, Data Management & Database Design, Algorithmic Digital Marketing,
Data Warehousing & Business Intelligence, Advance Data Science & Architecture, Big Data Engineering Tools (Hadoop)

Mumbai University, Mumbai, MH, India Aug 2016


Bachelor of Engineering in Electronics and Telecommunication

PROFESSIONAL EXPERIENCE
American Tire Distributors, NC, USA Jan 2021 – Aug 2021
Data Engineer Intern
• Introduced and set up Talend use across the entire DMO team along with identifying the best practices for Talend ETL
components and jobs that brought about decommission of 50+ SQL processes
• Established efficient enterprise data integrations between heterogeneous sources (MDM, EBS, GCP, CRM) and targets utilizing
Talend/ SQLs / Unix Scripts and automated the workflows through enterprise schedulers for Pricing team models
• Scheduled daily Talend jobs to compare key attributes and track data sync between EBS and MDM servers to produce business
consumable reports through SSRS
• Programmed Python scripts and Used SnowSQL to perform Data Migration from On-Prem Database to Snowflake database
• Prototyped and realized an end-to-end Python utility for vendor address matching with 88% accuracy, when ingesting data from
various sources by comparing results from different APIs ( StrikeIron API, FedEx-API, Google Geocode API )

Accenture Solutions Private Limited (Avanade), Mumbai, India Aug 2016 – Jan 2019
Application Development Analyst
• Devised and optimized SQL queries. Improved execution time of SQL views for downstream users by 30% via derived tables
• Implemented a knowledge-based recommender system that mines and analyzes online activity tracked by Google Analytics to
set up a personalized view for users resulting in 20% in customer retention using python
• Delivered an interactive dashboard in Tableau for marketing team to make decisions to maximize customer conversions
• Designed PowerShell scripts for automatic selective content refresh throughout website during product rebranding that
effectively saved ~160 man-hours

TECHNICAL SKILLS
Languages : Python, SQL, C#
Technologies : Talend,Tableau, PowerBI, SSRS, Jira, Confluence, Github
Databases : MySQL, Microsoft SQL Server, MongoDB, Postgres SQL
Core Competencies : Linear Regression, Naive Bayes, K-Means, Regression Analysis, Time Series Forecasting
IDEs : VS Code, Visual Studio, Jupyter, Google Colab
Cloud and Big Data Tools : AWS-EC2, S3, Lambda, Google Cloud Platform (GCP), MapReduce, Athena, Pig, MongoDB, Hadoop, Hive

SELECTED ACADEMIC PROJECTS


Hadoop Big Data Analysis of BTS Airline Data| Pig, Map Reduce, Hive Nov 2020 – Oct 2020
• Applied Map Reduce chaining algorithm to search for 10 most punctual airline carriers since 2003 with Hadoop
• Computed the best months to travel for vacation by calculating total weather-related delays in Hive
• Generated a list of most efficient airports in PIG by aggregating delays that could have been avoided

AdventureWorks2017 Database |SSIS, Talend, Tableau, Alteryx, PowerBI Sep 2020 – Oct 2020
• Constructed Data Warehouse by instituting a dimensional Schema consisting of Facts and Dimensions and developed it by
reverse engineering, pipelining/migrating data from various sources using ERStudio, Alteryx, Talend and SSIS
• Created sub models of ER diagrams and extracted DDL scripts for other Databases viz. MySQL, MSSQL, PostgreSQL
• Performed data visualization to pull insights employing interactive dashboards in Tableau and PowerBI

Cross Channel Marketing Spend Optimization |Marketing Mix Model, AWS tools Feb 2020 – Mar 2020
• Analyzed real world sales data to provide budget-ROI optimization by comparing different attribution models such as FTA, LTA,
Time-Decay Attribution, Position based attribution and Linear attribution models
• Fashioned an ETL pipeline with AWS EC2, S3, Glue crawler and Athena and visualized on Apache-Superset

You might also like