You are on page 1of 3

Brahyan Jimenez Aricapa

Medellín, Colombia
Phone number: +57 3233804664 E-mail: brahyanmaiko@gmail.com
Linkedin: [ linkedin.com/in/brahyan-jimenez-aricapa-ba0529189 ]
GitHub: [ https://github.com/Brahyanmaiko ]

Data Scientist / Machine Learning Engineer


Python programmer 2+ years of experience and Data Scientist with 1+ years of experience. I have worked on
different Machine Learning projects using the following technologies: Python, Numpy, Pandas, Scikit-learn,
Keras, PyTorch, Docker, Linux, and AWS, I have used other tools like: MATLAB, Ubidots, Esp32 and Sensors.
I’m looking for my next challenge in the Data Science and Machine Learning industry where I can find solutions
to business problems and be impactful while contributing to the development of my career.

Work Experience

Data Scientist at Finaktiva Feb 2023 to date

● ML Risk back-testing models Analyze, monitor, retrain current models for risk management team,
models deployed on azure databricks.

● Perform financial data analysis to generate reports that help the company make informed decisions
regarding credit risk.

● Develop financial and risk models to aid in the company's decision-making process.

● Implement data analysis tools and techniques to identify trends, patterns, and improvement
opportunities in credit risk management.

● Collaborate with other departments to ensure accuracy and integrity of financial data used in reports.

● Communicate findings and recommendations to leadership team members to help them make
informed decisions about credit risk.

● Contribute to the development of credit risk management policies and procedures to ensure
compliance with industry regulations and requirements.

● Maintain confidentiality of financial and customer information at all times

Main Technologies: Python, SQL, Pyspark, Pandas, Scikit-learn, Azure Databricks, Azure Data Factory, Power BI,
Excel.

Machine Learning Engineer at Anyone AI Sep 2022 to date


● Movie reviews classification Analyze sentiment in reviews for a movie streaming service.
Manipulated data that is not in a traditional format (Web scrapping), pre-processed it, and vectorized
text data using pre-trained vectorizer from NLTK library and TF-IDF. Trained a word embedding using
the data provided and used it as a vectorizer to transform all the reviews to number vectors. Trained a
sentiment analysis model to detect positive and negative opinions for movie reviews achieving +0.94
ROC AUC.
● Vehicle Image Classification for E-Commerce Predict vehicle make and model from unstructured e-
commerce images. Trained on a pre-built dataset of 196 classes. Visualized and cleaned the dataset,
pre-processed and augmented data using DETECTRON 2 from Google, and trained a fine-grained
classification model using ResNet-50 as convolutional neural network achieving 80% accuracy in the
prediction of make and model combined. Deployed in AWS instances using Docker, using an API-based
web-service application.
● Home Credit Risk Analysis Predicted whether a person applying for a home credit will be able to
repay his debt or not. Manipulated and visualized data, and performed data pre-processing for a large
dataset of +350,000 transactions. Trained many supervised models achieving +0.75 ROC AUC. Models
used were LogisticRegression, RandomForestClassifier, and LightGBM.
● Salary Prediction Model Collected and analyzed data via an API using Python and Pandas. The goal
was to predict salary levels based on historical data for sports players. Cleaned up data, generated
additional fields, stored and created a base dataset. Manipulated and visualized data. Performed
feature engineering and standardization. Selected evaluation metrics and baseline models. The initial
baseline model had an error (MAE) of USD 6M in the prediction, after experimenting with different
machine learning models and hyperparameters, getting as the final model a DecisionTreeRegressor I
decreased the mean absolute error rate by 50%.
Main Technologies: Python, Numpy, Pandas, Scikit-learn, Keras, PyTorch, Docker, AWS

Data Scientist Freelance Oct 2021 to date


Data preprocessing: cleansing, filtering, Transformation
Data analysis: Statistical methods, Plots, Reports
Data modeling and Machine Learning models

Projects

Cyber Attack Prediction Oct 2021 to Dec 2021

● Predicted whether a request or connection to a server is a cyber-attack. Collected the data from different
sources via SQL queries, initial dataset size was +90.000 samples and 152 features, pre-processed it,
created new features compressed using Autoencoders approach, increasing the number of features to
172, analyzed correlations between the all the features regarding the target with statistical methods,
leaving the top 20 features with more correlation score. Trained different Machine Learning models and
looking for the best results with the lower number of features to reduce the complexity and time
inference of the model, getting as final model a GradientBoostingClassifier using only 2 features
achieving 99.94% accuracy.
Main Technologies: SQL, python, pandas, numpy, Scikit-learn, Keras

London University Desertion Feb 2022 to Apr 2022

● Predicted the desertion of a student based on several factors such as Ethnicity, Gender, Scores, UCAS
Points among others. Cleaned, Imputed and Applied Feature Engineered to Data, Analyzed the current
data looking for high correlations between features, eliminated those with high correlation with SULOV
method using a XGBoost Machine Learning Model, then Balanced the data regarding the target creating
artificial samples using SMOTE. Selected evaluation metrics and baseline models. Trained and tested
different models, best model found was a RandomForestClassifier achieving 0.97 accuracy, 0.93 AUC,
0.88 recall, 0.95 precision, 0.91 F1 score.
Main Technologies: Python, pandas, numpy, Scikit-learn, XGBoost

Skills

Tech Skills: Python, C++, SQL, MySQL, Numpy, Pandas, Keras, Scikit-learn, Pytorch, Docker, AWS.
Agile Methodologies: Scrum
Other Tools: IOT, ESP32, Sensors, MATLAB.
Languages: Fluent in English, Spanish Native

Education

Metropolitan Institute of Technology – Medellin, Colombia Graduation 2023


Biomedical Engineer

You might also like