You are on page 1of 3

Anuj Singh

PERSONAL INFORMATION

LinkedIn - (20) Anuj Singh | LinkedIn | | Mumbai, India| P: +91 8879485747 |


GitHub - singhanuj695 (ANUJ SINGH) (github.com)| | E-mail - Dsanuj21@gmail.com |

EXPERIENCE/ WORK EXPERIENCE

Data Glacier Internship– Remote, India


Data Science Intern March - May 2023
● I gained a deeper understanding of Version control, Agile(scrub/Kanban), Dockers, Flask, API, Streamlit, Python,
Feature Engineering, Model Interpretation, Model Governance and Stakeholder Communication.
● Participated in webinars conducted by industry experts and learned relevant topics related to data and analytics.
● Collaborated on individual and group assignments, gaining experience in the end-to-end lifecycle of data and
analytics, including data engineering and Web Deployment.

Tata Data Visualisation: Empowering Business with Effective Insights– Remote, India
Data Analytics & Visualisation Intern Feb - March 2023
● I had the opportunity to work on a project focused on data visualization for an online retail e-commerce store.
● Using various visualization tools and techniques, I analyzed and presented key insights on customer behavior,
sales trends, and inventory management.
● I gained a deeper understanding of data visualization and its importance in communicating complex data in a
clear and impactful way. Additionally, I further developed my skills in data analysis and interpretation, as well as
my ability to work collaboratively in a remote environment.

KPMG Virtual Internship in Data Analytics and Consulting – Remote, India


Data & Analytics Intern Feb - March 2023
Data Quality Assessment
● Targeted 250+ high-value customers based on customer Age distributions, number of bike purchases in 3
years/percentages purchases, Job industry category, Wealth segments, Number of cars owned in each state
Data Insights and Presentation
● Displayed the findings of the previous task such as target customers, top 10 goods, customer demographics, and
spending habits in the form of a dashboard using Power BI, and Tableau to help in data visualization.
Suvidh Foundation – Mumbai, India
Machine Learning Internship Jan - Feb 2023
● The Internship was based on a Project Text summarization using seq2seq
● Text summarization using seq2seq involves training a neural network model to learn how to summarize long
documents into shorter versions while preserving the most important information.
● The trained model can then be used to generate summaries for new text documents, which can be useful for
various applications such as news article summarization, document summarization, and chatbot responses.
● The model is evaluated based on various metrics such as ROUGE, BLEU, and F1 score to assess its performance in
summarizing text.

Youth India E-School - Mumbai,India


Market Research Analyst (Data Analyst) Jan - May 2021
● Design research studies to gather data on market trends, consumer behavior, and industry competition.
● Collect and analyze data using various methods such as surveys, focus groups, and interviews.
● We report our findings and make recommendations based on their analysis.
● We stay up-to-date on industry trends and research methods to ensure their work is accurate and relevant.
PROJECTS
Text Summarization Using Sequence to Sequence Model
● The Internship was based on a Project Text summarization using seq2seq
● Text summarization using seq2seq involves training a neural network model to learn how to summarize long
documents into shorter versions while preserving the most important information.
● The trained model can then be used to generate summaries for new text documents, which can be useful for
various applications such as news article summarization, document summarization, and chatbot responses.
● The model is evaluated based on various metrics such as ROUGE, BLEU, and F1 score to assess its performance in
summarizing text

Skin Cancer Detection Using CNN Model


● Collecting and preprocessing a diverse dataset of skin cancer images, ensuring high-quality input for accurate
model training. Experienced in performing essential tasks like data augmentation, normalization, and resizing to
enhance dataset variability and balance, optimizing the CNN's ability to detect various types and stages of skin
lesions.
● selecting appropriate CNN architectures, such as VGGNet, ResNet, or InceptionNet, for skin cancer detection.
Well-versed in implementing these architectures using popular deep learning frameworks like TensorFlow or
PyTorch.
● expertise in training and optimizing CNN models for skin cancer detection. Split the dataset into training,
validation, and testing sets, employing techniques like transfer learning and fine-tuning to achieve optimal model
performance.
● evaluating CNN models' performance using evaluation metrics such as accuracy, precision, recall, and F1 score on
testing datasets. Proficient in analyzing additional evaluation measures like confusion matrices or ROC curves to
assess model strengths and weaknesses.

Movie Recommendation Model using KNN


● preparing and structuring movie data for effective recommendation using k-Nearest Neighbors (k-NN) algorithm
● computing distances between movies based on selected features, such as genre, actors, directors, or user
ratings. Utilized distance metrics like Euclidean, Manhattan, or Cosine similarity to measure similarity between
movies. Employed k-NN algorithm to select the nearest neighbors based on calculated distances and determine
the most similar movies.
● creating user profiles and modeling user preferences using k-NN. Analyzed user behavior, ratings, and
interactions with movies to identify patterns and personalize recommendations.
● enerating movie recommendations based on selected neighbors and user preferences. Employed k-NN to
provide a list of top-rated or most similar movies to enhance user satisfaction and engagement. Conducted
evaluation using metrics like precision, recall, and ranking metrics to measure the effectiveness of the
recommendation system and iteratively improve its performance.

CERTIFICATIONS & TRAININGS


Data Science | Board Infinity Oct 2019 – Nov 2021
Artificial Intelligence | Mycaptain.com

SKILLS

Technical Skills
● Languages: C, Python-3.X, R, MySQL
● Databases: MySQL, Cassandra, Oracle
● BI Tool: Tableau, PowerBI
● Machine Learning: Understanding of various supervised, unsupervised and Reinforcement Learning techniques
including Linear regression, Decision tree, SVM, KNN, Naïve Bayes, Random Forest.
● Big Data: Understanding of the Hadoop framework including PySpark, HDFS, Map Reduce,
Hive(HiveQL).
●Deep Learning models like CNN, RNN,Long Short Term Memory Networks (LSTMs), Generative
Adversarial Networks (GANs), Multilayer Perceptrons (MLPs), Autoencoders

Personal Skills
Communication, Networking, Negotiation, Peoples Management

Languages
English, Hindi , Marathi (Conversational)

EDUCATION
S.I.E.S College of Arts Science and Commerce , Mumbai University April 2023
Masters of Science in Data Science ; Percentage: 8.60/10 CGPA

S.I.E.S College of Arts Science and Commerce , Mumbai University April 2021
Bachelors of Science in Physics

Higher Secondary School(HSC), Central Boad of Secondary Education


Science and Mathematics

Secondary School(SSC), Central Boad of Secondary Education


Science

ACHIEVEMENTS
● Worked with Star TV and Channel V for a Multi-State College Festival know as VFest

● RunnerUp at a All India Footall Tournament by AIFF under my Leadership

● Super Division Football Player At Mumbai District

● Self-Employed (12years) - Retail Wholesale Shop

You might also like