You are on page 1of 4

Emily Guo

Data Analyst
San jose, CA,95051
SUMMARY
-------------------------------------------------------------------------------------- --------------------------------------------
-------
● 5+ years of experience in Data Analysis, Business Analyst with Finance Industry domain with proven ability
to articulate business values.
● Expertise in writing SQL (select/joins) to track progress metrics
● Expertise in using Python to do data cleaning and analysis with libraries such as Numpy, Pandas,
Matplotlib, Seaborn
● Hands on experience with statistical analysis using ANOVA, ANCOVA, and two-sample test
● Expert in creating data visualization reports using Tableau
● Hands on experience with machine learning modeling techniques including linear regression, Random
Forest, K-means
● Hands on experience with Database management including drawing the ERD diagram, normalization,
denormalization and query optimization.
● Create Functional Specification Documents and Data Mapping Analysis for System Integrations
● Seek to eliminate duplicate entries and record relevant content using strong attention to detail
● Ability to develop understanding of new systems and their methods of communication and interaction
● Experienced in performing project analysis which entails Requirements Analysis, SWOT Analysis, Data
Analysis, Gap Analysis, Process analysis and documentation of the same.
● Experienced in managing end-to-end project delivery, system & business analysis, operations & support for
various projects.
● Strong understanding of business process flows, Business Process Reengineering (BPR).
● Experienced in data warehouses and data marts for business intelligence reporting and data mining along with
developing and documenting process flows for business processes.
● Good knowledge and experience in Customer lifetime value (CLV)
● Strong communication skills and willingness to learn new technologies at work

TECHNICAL SKILLS
-----------------------------------------------------------------------------------------------------------------------------------------
Data Science: Machine learning, Neural Network (CNN, RNN, LSTM), NLP, A/B testing
Programming: Python (Pandas, NumPy, SciPy, Scikit-Learn, Matplotilb), SQL, Scala, R
Database: MySQL, PostgreSQL, MongoDB
Distributed Systems: Spark (MLlib, SparkSQL), Hadoop, Hive
Cloud: AWS (S3, EMR), GCP(BigQuery)
visualization tools: Tableau, Google Optimize, Github

WORKING EXPERIENCE
-----------------------------------------------------------------------------------------------------------------------------------------
Company: CitiBank Jan 2019-present
Role: Business Analyst/Data Analyst
Getzville, NY

Job description:
The daily goals are being responsible for managing and monitoring risks for fixed income, treasury bonds and
equity derivatives to provide security to customers and employees and to minimize losses, including applying
data-mining techniques, such as association, clustering, and classification for forecasting the prodolio value for
risk volatility.

Responsibilities:
● Work closely with key stakeholders to ensure alignment with the Bank’s risk appetite.
● Evaluate performance of existing products and forecasts and make strategy recommendations to manage the
risk including formal fixed income portfolio assessment reports.
● Gather operational risk-related data and conduct statistical analyses to quantify risk.
● Establish, implement and review status reports to ensure that proper management controls are maintained in the
various fixed income areas.
● Support management of citi fixed income policy and guidelines
● Collect and report on Key Risk Indicators (KRI’s) and Key Performance Indicators (KPIs); assist with
development and analysis of each.
● Communicate feedback to process owners and senior management regarding process improvement and risk
mitigation opportunities.
● Perform quarterly control validation activities to ensure all key controls are identified, challenged and tested.
● Provide assistance in addressing internal and external audit findings that relate to credit activity and help
prepare responses as requested.
● Identify and analyze situations that create risk for Citi, using various resources such as multiple risk systems
and knowledge of the elements of Risk
● providing analytical solution to MRM to compliance with model risk management governance policy and
procedures
● Develop analytical solutions to measure progress and effectiveness of MRM’s key initiatives such as policy
implementation, new model identification and attestation
● Synthesize trends and observations that emerge from the data to inform, or recommend strategic and tactical
solutions to senior management
● Define, prototype and document reporting requirements for technology implementation in partnership with IT
● Use SQL as data manipulation tools to cleanse or map the data to make it organised and easier to read
● Applied sorting, grouping, filtering conditions in Tableau to understand data in multiple dimensions and give
reports
● Classify each item in a set of data into one of predefined sets of classes or groups by using mathematical
techniques such as decision trees, linear programming, neural network and statistics.
● Performed out of time train/test split and in time train/test split using PCA method
Technologies Involved: SQL, Python, VBA, Tableau, linear regression, logistic regression, Principal
Component Analysis (PCA), decision trees, neural network, statistics

Company: SHUTTERFLY May 2018-Jan 2019


Role: Data Analyst Redwood City, CA

Job description:
The goals of the project are predicting the customer lifetime value(CLV) to effectively help to create and sustain
beneficial relationships with selected customers, therefore generating higher profitability and business growth by
applying the machine learning models.

Responsibilities:
● checking standardization with product management team and web application team to discuss the database
structures
● Analyze massive user/customers response to Shutterfly promotional information
● Used Python to clean historical and current data set, with tasks dealing with inconsistent format, missing
values, outliers and invaluable features
● Applied sorting, grouping, filtering conditions in Tableau to understand data in multiple dimensions (age,
salary range, occupations, etc)
● Created parameter in Tableau to plot dynamic graphs to quickly modify calculations and compare different
scenarios
● Unlock revenue opportunities by isolating and processing only the most relevant client information.
● Customer segmentation based on their specific characteristics (e.g. region, age, income for demographic
segmentation) using clustering, decision trees, logistic regression.
● Developed the Customer lifetime value (CLV) model by analyzing the client’s acquisition and attrition, use of
diverse banking products and services.
● Learned the CLV of every customer segment and discovered high-value and low-value segments by using
Generalized linear models (GLM) and Classification
● Used data mining techniques, such as classification and to extract useful features to predict the probability of a
customer’s response to a promotion or an offer.
● Helped build recommendation engines by creating algorithms such as Collaborative filtering (CF) involving
either user-based, or item-based to analyze other users’ preferences, then make recommendations.
Technologies Involved: SQL, Python, Tableau,linear models, logistic regression, Support vector machines
(SVMs)

Company: BESHTON SOFTWARE INC Sep 2017-May2018


Role: Business Analyst Santa Clara, CA

Job description:
The project’s goal is to help our customer(Advancorp) manage and track all the products sold, warranty, repair
records and usage patterns in order to improve customers efficiency and customer satisfaction.

Responsibilities:
● worked with a web application team to create feature descriptions to provide guidance and approve the
product-related Business Requirements Documents.
● Check the project progress daily and keeping good communication between technical teams and customer for
most frequent product/component failures
● Managed SQL Server database administration
● Create a daily performance report displayed as bar chart, pie chart, success/fail/rework status etc.
● Use Tableau to generate dashboards for component failure reports
● Explore the faulty products and calculate failure rate based on model, batch, and components and give detailed
failure reasons.
Technologies Involved: SQL, Tableau, MS Excel, MS PPT

EDUCATION
-----------------------------------------------------------------------------------------------------------------------------------------
University of California Santa Cruz Santa Cruz, CA
Bachelor of Arts in Economics
Boston University Boston, MA
Master of data analytics

You might also like