You are on page 1of 14

https://www.tatvasoft.

com/blog/etl-process-extract-transform-load/
Text mining is widely adopted in knowledge-driven organizations. It involves examining large collections of documents,
often for research purposes. Text mining is the tool that identifies patterns, uncovers relationships, and makes assertions
based on patterns it discovers buried deep within layers of textual big data.
Statistical analysis is the process of collecting and analyzing large volumes of data in order to identify trends and develop
valuable insights. In the professional world, statistical analysts take raw data and find correlations between variables to
reveal patterns and trends to relevant stakeholders.

Statistical Analysis could be based on the whole population or a sample thereof and includes the collection, analysis,
interpretation, presentation, and modeling of data. It can be categorised into - Descriptive Analysis and Inferential
Analysis.
• Business question • Analysis plan
• What is the stated business • What is the analysis goal?
question? • What hypotheses are to be
• What is the intent underlying tested?
the question (e.g., what is the • What data is required/available
context, what is the impacted to test the hypotheses?
segment, and what are • What methodology(-ies) will
stakeholders’ current you employ?
thoughts about the
underlying reasons? • What is the project plan
(timeline and milestones,
• What business considerations risks, phasing, prioritization,
(e.g., stakeholders, timeline, …)?
and cost) are likely to impact
the analysis?
Data collection •Recommendation
•From where can the data be obtained?
•How can you most effectively present
the results of your analysis to your
•How must the data be cleansed and validated?stakeholders (in terms they can
•Insights understand and in alignment with
information they’ll value)?
•Note: A generic template for a
•What patterns do you see in the recommendation presentation or
data? report might include:
•Objective
•Are each of the hypotheses
•Background (optional)
proven or disproven?
•Scope (optional)
•How much confidence should
•Approach (optional)
stakeholders place in the results? •Recommendations
•How do you rank your findings •Key insights with impact
in terms of quantified impact on •Next steps
the business?
CHALLENGE IN DATA ANALYTICS

 The amount of data being collected


 Data Validation and Data Cleaning

You might also like