You are on page 1of 2

FULL NAME

Tel: 123-456-7890 | Email: name@gmail.com | Linkedin | City, State, Zipcode

EXPERIENCE SUMMARY

Senior Data Scientist with 5+ years of industry experience working on high-impact projects on fraud detection, payments,
and risk management.

PROFESSIONAL EXPERIENCE

Position, Company | City, State Month Year - Present


● Improve customer acquisition by 12% YoY and productionize a new pandemic credit model by developing
multivariate regression models (predict portfolio losses over macroeconomic forecasts) and quantifying the impact of
monetary and fiscal policies to financial credit markets by industries.
● Lead model development with SQL database creation, data query automation, imputation, normalization,
preprocessing (using PCA and PCR) and optimization to support usage for over 100+ global financial institutions
and multiple products’ releases.
● Drive implementation of scalable machine learning models, such as neural networks and clustering, with large time
series datasets, composed of thousands of macroeconomic indicators, to deal with non-linear relationships that was
impractical by the regression-based time series models.
● Lead and supervise an intern (who later joined the team with return offer) to research on evaluation metrics, variable
selections, and cross-validation visualization dashboard. Research findings have been adopted by multiple teams
across the company.
Position, Company | City, State Month Year - Month
Year
● Designed and developed a Python program that collects user input through JSON, constructs various types of loans
with embedded options and floating rate, and calculates portfolio level risk as the output.
● Refactored 2000+ lines of code, implemented unit tests, and deployed programs using Python to serve daily usage for
the model verification team.
Position, Company | City, State Month Year - Month
Year
● In charge of an A/B test (experiment design and result analysis) leading to $8M incremental annual revenue (also
convinced leadership on $3M data purchase)
● Collected and processed loan quality data from XX and built regression models to measure quality of debt and
estimate incoming revenue, which eventually led the company’s decision to purchase millions of discharged loans.

TECHNICAL SKILLS
● Languages/Tools: Python, R, SQL, C++, Git, LaTex

● Libraries/Frameworks: TensorFlow, Scikit-Learn, Pandas, NumPy, BeautifulSoup, Matplotlib

● Machine Learning: Covariance matrix optimization, Classification (Random Forest, KNN, SVM), Regression
Modeling (linear, sparse, logistic, regularized), Principal Component Analysis (PCA, PCR, sparse PCA), clustering
(K-means)
● Stats & Experimentation: Time-Series Analysis (OLS, GMM, ARIMA, MLE), hypothesis testing, Monte-carlo
simulations, Financial forecasting, Covariance and correlation modeling

SELECTED PROJECTS
Movie Night Month Year - Month Year
(Note by Emma: You can select the icons and use Command + K to replace the links.)
● 50+ raw features are first collected with web-scraping, then cleaned by removing noises and normalizing the data,
later engineered to have consistent frequencies and range.
● Based on the selected evaluation metrics including F1 score and AUC, the voting ensemble model, composed of
Decision Trees, NeuralNets, and XGBOOST with RFE feature reduction, was proven to have the best out of sample
model performance.

EDUCATION
University of California, Berkeley Month Year - Month
Year
Master of Science
University of California, Los Angeles Month Year - Month
Year

Bachelor of Science

You might also like