You are on page 1of 2

Rohit Raju Kamble

Data Science Intern


rohit98ckd@gmail.com +918971664737 RAM NAGAR, CHIKODI LinkedIn | Rohit K
GitHub | Rohit K Portfolio | Rohit K

PROFILE
I am a Graduate with a Master's degree in Mathematics from Karnatak University Dharwad in 2022 and a current
intern in the field of data science. My interest in coding, which was sparked in my PUC as I opted computer science
as optional subject, has likely given me a valuable skills and experience that helped me in future advanced Coding. I
have a strong foundation in both theoretical and practical skills. My background in mathematics has equipped me
with a strong analytical mind, while my internship has allowed me to gain hands-on experience working with real-
world data sets and applying my knowledge to practical situations. With combination of education and experience,
I am well-prepared to excel in a career as a data scientist and make meaningful contributions to the field.

SKILLS

IDE's : (Jupiter Notebook, Spyder, VScode, Pycharm, MySQL, Google Colab.)

Technical Skills : (Python, MySQL, C, C++, Statistics, Machine Learning, Deep Learning, Exploratory Data
Analysis -EDA, Power BI, MS Excel, SPSS, Web Scraping, Documents Scraping, NLP, ANN, CNN, RNN,
LSTM, BERT, T5, Transformers.)

Libraries : (Numpy, Pandas, Sklearn, SciPy, TensorFlow, Pytorch, Keras , NLTK, Spacy, Beautifulsoup, Flask,
Streamlit, OpenCV, Git.)

EDUCATION
M.Sc Mathematics, Karnatak University Dharwad. 2020 – 2022 | Dharwad, India
Studied Mathematics as a subject with the C programming Language pursuing 83.4%, Having a strong grip in the
subjects like Linear Algebra, Differential Calculus, Operational Research and Topology.
Project on “A STUDY ON WAVELETS, LEGENDRE WAVELET FOR THE SOLUTION OF THE VARIATION
PROBLEMS AND IT’S APPLICATIONS IN NEUROSCIENCE".

EXPERIENCE
Data Science Intern, Ai Variant 2022 – present | Bangalore, India
Utilized algorithmic and programming tools to build helpful predictive models. Created reports and statistical
analysis as required.
Utilized exploratory data analysis techniques to identify patterns, relationships, and trends. Worked on AI/ML
projects.

COURSES
Post Graduation Program in Data Science, Excelr Solutions Bangalore 2022 – 2023 | Bangalore, India
Studied Machine Learning problems like Regression, Clustering, Decision Trees, NLP, PCA, Regularization,
Support Vector Machine, Neural Networks, Scraping and also Deep learning and Basics of AI like ANN, CNN, RNN
and LSTM TensorFlow and Keras, Computer vision like OpenCV, YOLO.

PROJECTS
FINE TUNING BERT MODEL
Fine-tuning BERT model for Generating Boolean questions from a given context using Transformers like T5 and

other models from Hugging face hub.

rohit98ckd@gmail.com 1/2
Training the BERT model on the dataset of context-question pairs, we can adapt them to generate relevant and

grammatical Boolean questions for any given context. These models are based on the encoder-decoder
architecture and can leverage large amounts of unlabeled data for self-supervised learning.
TIME SERIES FORECASTING USING ARIMA, SARIMA AND LSTM, Stock Price Forecasting
Objective: Predicting the Future Stock Prices of Apple Shares By building the ARIMA, SARIMA and LSTM models

Approach: Data Sourcing, Introduction, Problem Statement, Data Information, Data Pre-processing, Missing

Value Treatment, visualizing the time strap, Different Model Building


Tools Used: Python, Spyder.

Key Skills: Exploratory Data Analysis- Feature Engineering, Visualizing the time strap in different Seasonals Like

Monthly, Quarterly and Yearly bases, Checking for Stationarity, Differencing, plotting ACF and PACF plots and
Model building ARIMA, SARIMA and LSTM models
RESUME DOCUMENTS CLASSIFICATION AND PARSING,
Extracting the data from the Resumes and Classifying those -NLP project
Objective: This project is an effort to use NLP techniques to classify resumes into different categories based on

their content. The goal of the project is to help recruiters and hiring managers efficiently sort through a large
number of resumes and identify the most suitable candidates for a particular job.
Approach: Data sourcing - Collecting various resumes with different skill sets.

Text Extraction: extracting text from the resume document using documents reader libraries optical character

recognition (OCR) technology.


preprocessing the text data by removing stop words, stemming, and lemmatizing the text.

Using machine learning algorithms such as Naive Bayes, Support Vector Machines, and Random Forests to

classify the resumes into different categories.


Evaluating the performance of the algorithms using metrics such as precision, recall, and F1-score.

The final output of the project will be a classification model that can classify a new resume into the appropriate

category based on its content.


QUORA DUPLICATE QUESTION PAIRING USING RNN, -NLP project
Objective: To build a machine learning model that can identify duplicate question pairs on Quora. Given two

questions, the model should predict whether they are duplicates or not. This project will involve data
preprocessing, feature engineering, building and training a classification model, and evaluating its performance
on a test dataset.
Approach: Data Collection: Collect a large dataset of question pairs from Quora. Data Preprocessing Perform

data cleaning and pre-processing tasks such as removing stop words, stemming, and lemmatization.
Feature Engineering: Extract relevant features from the pre-processed data such as word overlap, word order,

and semantic similarity.


Model Selection and Training: Select an appropriate classification model (e.g., Logistic Regression, XGBoost) and

train it on the preprocessed data.


Model Evaluation: Evaluate the performance of the model using various metrics such as accuracy, precision,

recall, and F1-score.


Key Skills: NLP techniques such as tokenization, stop word removal, stemming, and lemmatization. data

cleaning and pre-processing techniques. different feature engineering techniques such as word embedding,
topic modeling, and text similarity measures.

CERTIFICATES
Data Science Certificate Machine Learning with Python Data Analytics Certificate
Excelr Solutions - 2023 Certificate Excelr Solutions -2023
IBM

LANGUAGES
English Kannada HIndi

INTERESTS
Traveling | Reading Tech blogs | Watching Tech videos | Drawing | Fitness

rohit98ckd@gmail.com 2/2

You might also like