data parsing, merging, extracting, sorting, filtering, clustering andclassification. They tend to be computer scientists, programmers,database managers, data miners, and mathematicians. They oftenwork with Big Data that is compiled, validated, and processed inreal time.
favor methods involving data description,hypothesis testing, and predictive modeling. They tend to bestatisticians, business analysts, quality managers, risk assessors,and predictive modelers. They usually work with static datasetsthat have been extensively scrubbed in preparation for analysis.
There are many different methods a data scientist can rely on, bethey programming languages or analysis techniques. In practice,each data scientist tends to have a set of core methods that he or she uses routinely. Usually, the methods are what they learned inschool or have found to be successful in their work. Sometimes, the methods are researchfavorites or specialties they offer for an advantage over business competitors. That leads to twotypes of data scientist on a spectrum of work practice
generalists and specialists.
will use a variety of methods and software, even going so far as to learn new analysis techniquesor programming languages that might be applicable to a given dataset.
will rely ontechniques they know well and have used extensively in the past, modifying design elements andmethod specifications to find the best result for a dataset.
Data scientists also have a tendency to focus on either the domain of the
data or a method’s
experts honor the sources, meanings, and limitations of thedata elements they are studying. They tend to be goal oriented and methodologically flexible.
They are often willing to “bend the rules” a bit in order to conduct an analysis. They will use
data transformations and other model optimization techniques. They will examine violations of assumptions to assess the severity of impact and possible corrective measures before foregoing a
planned analysis. They’ll even consider using unconventional and controversial
approaches if they believe the action is warranted.
experts understandthe mathematical foundation of their analysis technique and how itis implemented by software. They often write their own code, evenfor routine tasks. They tend to follow rigorous plans and procedures. They
lay by the rules.”
They avoid deleting outliersand using transformations and stepwise techniques that mightcapitalize on chance. They will switch to alternative analyticalmethods upon
violation of a method’s assumption
Credentials are embodied in education and experience, the more the
better, at least in general. Beyond that, it’s impossible to quantify
credentials. Education stresses theory; experience stressesapplication. A good education involves brief exposure to a widevariety of ideas; experience involves a much longer exposure tofewer ideas. A degree represents a package of learning that may or
I'm an Analyzer.
I have my methods.