Professional Documents
Culture Documents
Data Science Roadmap
Data Science Roadmap
HISTORY
MONTH
BY THE RAVIT SHOW
february 2024
FOUNDATION IN
MATHEMATICS
BLACK
AND STATISTICS
HISTORY
LINEAR ALGEBRA: VECTORS,
MATRICES, MATRIX
MULTIPLICATION,
MONTH
EIGENVECTORS, AND
EIGENVALUES.
CALCULUS: LIMITS,
DERIVATIVES, AND INTEGRALS.
PROBABILITY AND
STATISTICS: PROBABILITY
Presented by Olivia Wilson
THEORY, STATISTICAL
INFERENCE, BAYESIAN
STATISTICS, AND STATISTICAL
MODELLING.
HYPOTHESIS TESTING:
HYPOTHESIS FORMULATION,
NULL HYPOTHESIS,
february 2024
ALTERNATIVE HYPOTHESIS,
STATISTICAL SIGNIFICANCE,
AND P-VALUES.
PROGRAMMING
BLACK
LANGUAGES
PYTHON: WIDELY USED IN DATA
HISTORY
SCIENCE FOR ITS SIMPLICITY,
VERSATILITY, AND LARGE
COMMUNITY OF USERS. PYTHON
MONTH
LIBRARIES FOR DATA SCIENCE
INCLUDE NUMPY, PANDAS,
MATPLOTLIB, SCIKIT-LEARN, AND
TENSORFLOW.
DATA PREPROCESSING:
SCALING,
NORMALIZING,
STANDARDIZING, AND
ENCODING DATA.
DATA
TRANSFORMATION:
FEATURE ENGINEERING,
AGGREGATION,
DISCRETIZATION, AND
BINNING.
DATA
BLACK
VISUALIZATION
HISTORY
TYPES OF VISUALIZATIONS:
BAR GRAPHS, HISTOGRAMS,
SCATTERPLOTS, HEATMAPS,
MONTH
BOXPLOTS, AND MORE.
GOOD PRACTICES
Presented by Olivia Wilson FOR
VISUALIZATION: CHOOSING
THE RIGHT CHART, USING
COLORS EFFECTIVELY,
REMOVING CHART JUNK,
ADDING CONTEXT, AND
TELLING A STORY.
february 2024
MACHINE LEARNING
BLACK
SUPERVISED LEARNING:
HISTORY
REGRESSION, CLASSIFICATION,
DECISION TREES, RANDOM
FORESTS, AND GRADIENT
BOOSTING MACHINES.
MONTH
UNSUPERVISED LEARNING:
CLUSTERING, DIMENSIONALITY
REDUCTION, PRINCIPAL
COMPONENT ANALYSIS, AND
SINGULAR VALUE
DECOMPOSITION.
Presented by Olivia Wilson
REINFORCEMENT LEARNING:
MARKOV DECISION PROCESSES,
Q-LEARNING, AND POLICY
GRADIENT METHODS.
february 2024
NEURAL NETWORKS:
FEEDFORWARD NETWORKS,
CONVOLUTIONAL NEURAL
NETWORKS, AND RECURRENT
NEURAL NETWORKS.
CONVOLUTIONAL NEURAL
NETWORKS: IMAGE
RECOGNITION, OBJECT
DETECTION, AND IMAGE
SEGMENTATION.
RECURRENT NEURAL
NETWORKS: NATURAL
LANGUAGE PROCESSING, TEXT
GENERATION, AND SPEECH
RECOGNITION.
NATURAL LANGUAGE
PROCESSING
BLACK
PREPROCESSING TEXT:
HISTORY
TOKENIZATION, STEMMING,
LEMMATIZATION, AND STOP
WORD REMOVAL.
MONTH
FEATURE EXTRACTION FROM
TEXT: BAG OF WORDS, TF-IDF,
WORD EMBEDDINGS, AND
SENTIMENT LEXICONS.
SENTIMENT ANALYSIS:
BINARY CLASSIFICATION,
Presented by Olivia Wilson
MULTI-CLASS
CLASSIFICATION, AND
ASPECT-BASED SENTIMENT
ANALYSIS.
RECOGNITION.
BIG DATA
BLACK
HISTORY
HADOOP: DISTRIBUTED STORAGE
AND PROCESSING OF LARGE
DATA SETS.
MONTH
SPARK: FAST AND GENERAL
ENGINE FOR LARGE-SCALE DATA
PROCESSING.
Presented by Olivia Wilson
NOSQL: NON-RELATIONAL
DATABASES FOR UNSTRUCTURED
AND SEMI-STRUCTURED DATA.
february 2024
INDUSTRY-SPECIFIC
APPLICATIONS: HEALTHCARE,
FINANCE, MARKETING, AND
MORE.