You are on page 1of 1

JANNET KIM VU, PhD

Data Scientist • Machine Learning Engineer • Data Analyst


jannetkimvu@gmail.com • linkedin.com/in/jkvu • github.com/jkvu08 • jannetkimvu.weebly.com

SKILLS
Software: Python • R • SQL • Git • TensorFlow • Tableau • Jupyter • Google Colab • ArcGIS • QGIS • LaTeX
Analytical tools: A/B Testing • data mining • machine learning • deep learning • NLP • AI • hyperparameter optimization •
multivariate regression • classification • clustering • decision trees • neural nets • support vector machines • anomaly
detection • hierarchical linear modeling • remote sensing • geospatial analysis

EXPERIENCE
Quantitative Researcher, Stony Brook University – Stony Brook, NY 08/2016 – 12/2023
• Led a 5-member international team through the full lifecycle (i.e., definition of project scope, experimental design,
data collection, data engineering, statistical modeling, stakeholder reporting) of 4 resource management projects
• Generated data-driven insights on the effect of climate change on resources to inform decisions regarding the
management of ~70,000 acres of land using A/B testing, machine learning and deep learning in Python and R
• Acquired and cleaned over 500,000 field collected structured and unstructured data records in preparation for
data analysis using automated custom aggregation, data manipulation and data imputation in R
• Secured $296K in funds to support dissertation research through competitive grant writing
• Communicated scientific findings to technical and non-technical audience through storytelling via publications,
presentations and multimedia content (e.g., videos, podcasts, infographics)
Science Policy Analyst Intern, United Nations - Washington, DC 02/2019 – 07/2019
• Cultivated a strategic vision and roadmap for the circular economy in North America by building relationships
between 120 government, business and tech leaders at the inaugural Great Lakes Circular Economy Forum
• Organized the 3-day Great Lakes Circular Economy Forum by researching emerging technologies, coordinating
logistics, managing budgets and maintaining stakeholder communication through cross-functional collaboration
• Distilled news and scientific research into policy briefs, memos and data visualizations to keep executive directors
and senior program officers knowledgeable on current events and scientific findings
Junior Data Scientist, UCLA – Los Angeles, CA 01/2014 – 08/2016
• Supported business decisions regarding solar farm development by optimizing for energy production and
environment health among 36 projects using spatial simulation and nonparametric statistics in ArcGIS and Python
• Uncovered that land development, topography and precipitation accounted for 44% of the variation in butterfly
diversity across 153,075 acres of the world’s largest urban national park using Random Forest in R

PROJECTS
Flowering Cycles of Tropical Plants, Dissertation Research 09/2021 – 12/2022
• Achieved a performance of up to 87% (AUC) in forecasting the flowering cycles of 6 tropical plants by analyzing a
decade of data using correlation analysis, time series analysis and Bayesian binary classification in Python
Weather and Fruit Productivity, Dissertation Research 06/2021 – 12/2021
• Discovered that precipitation contributed to explaining 56% of the variation in fruit production and projected that
food insecurity will heighten during the dry season in Madagascar because of climate change using causal
inference, multivariate regression and A/B testing in R
Predicting the Multistate Behavior of Wild Lemurs, Dissertation Research 08/2019 – 05/2021
• Attained an 80% improvement (F1) in predicting the multistate behavior of wild primates by designing, training
and testing RNNs (i.e., LSTM, GRU, encoder-decoders) via hyperparameter optimization in Python (TensorFlow)
University of California (UC) Reserve Design, Undergraduate Research 06/2013 – 12/2013
• Revealed that the UC reserve system is habitable by > 70% of at-risk species and recommended new reserve
sites that aligned with the UC’s education and sustainable development vision by simulating the distribution of
546 species using machine learning (MaxENT) and counterfactual models in ArcGIS, R and Python

EDUCATION
PhD, Ecology & Evolution, Stony Brook University – Stony Brook, NY (GPA: 3.96) 12/2023
BS, Environmental Science (Minors: GIS & Conservation Biology), UCLA – Los Angeles, CA

Interests: nature • dog training • cooking • crafting • TED talks • yoga • kickboxing • dance • community service • scuba

You might also like