You are on page 1of 11

Name: Ehsan Raza

Roll No: F21RDOCS1M08054


Department: BSCS
Semester: 6th(M2)
Subject: Data Science
Assignment:
Question 1:
Describe the job roles of Data Science?

Answer:

Job roles of Data Science:


A new generation of analytical data professionals, the data scientist is a relatively new vital actor
in enterprises. They are a mix of mathematicians and computer scientists who govern the big
data world. Businesses nowadays are grappling with massive amounts of unstructured data,
which may be a virtual gold mine if found and used to generate income. They do, however,
require people who can delve into the data and extract useful business insights, sifting through
the chaff to discover the gold nuggets. That is what a data scientist does, which is why they are
in high demand and well compensated.
Organizations are increasingly relying on data in their daily operations. A data scientist interprets
raw data and pulls meaningful information from it. They then analyze data to look for patterns
and propose solutions that will help an organization to grow and compete.

1. Data Scientist:
Data scientists are in charge of comprehending business difficulties and proposing the best
solutions through data analysis and processing. They will be required to undertake predictive
analysis and go over "unstructured/disorganized" data with a fine-toothed comb in order to
provide actionable insights. They can also do so by spotting trends and patterns that might aid
businesses in making better judgments.
Few Important Roles and Responsibilities of a Data Scientist include:
 Identifying data collection sources to meet business requirements
 Data processing, cleansing, and integration 
 Data collection and management process automation
 Improving processes with Data Science techniques/tools 
 Analyzing massive volumes of data to identify trends and deliver reports with
recommendations
 Collaborating with the product, engineering, and business teams
What Does it Take to Become a Data Scientist?
You must be proficient in R, MatLab, SQL, Python, and other complementary technologies in
order to work as a data scientist. If you have a higher degree in mathematics, computer
engineering, or a related field, it can also help.

2. Data Analyst:
Data analysts are in charge of a variety of tasks, including visualizing, transforming, and
manipulating data. They're also in charge of web analytics and A/B testing analysis on occasion.
Data analysts are often in charge of preparing data for corporate communications since they are
in control of visualization. They must also run queries against databases from time to time.
Optimization is one of a data analyst's most significant skills. Analysts create reports that
successfully display the trends and insights gained from their research in a way that non-experts
can comprehend.
Few Important Roles and Responsibilities of a Data Analyst include:
 Using automated techniques to extract data from primary and secondary sources
 Creating and maintaining databases
 Analyzing data and generating reports with recommendations
 Collaborating with other team members to improve data gathering and quality processes 
 Analyzing data and projecting patterns that affect the organization/project
How do you go about becoming a data analyst?
SQL, R, SAS, and Python are some of the most popular data analysis tools. As a result,
accreditation in these areas can readily increase your job applications. You should also be able to
solve problems effectively.

.3. Data Engineer:


Data engineers are in charge of planning, constructing, and maintaining data streams. They must
test business ecosystems and prepare them for data scientists to run their algorithms on stable
and highly optimized data systems. Data engineers also work on batch processing of collected
data and ensuring that it is formatted correctly for storage. To boost database performance, data
engineers also update existing systems with newer or improved versions of current technologies.
Few Important Roles and Responsibilities of a Data Engineer include:
 Designing and maintaining data management systems 
 Data collection, acquisition, and management 
 Primary and secondary research 
 Using data to uncover hidden patterns and forecast trends 
 Collaborating with other teams to understand organizational goals 
 Creating and maintaining data management systems 
What Does it Take to Become a Data Engineer?
Hive, NoSQL, R, Ruby, Java, C++, and Matlab are all technologies that require hands-on
knowledge if you want to work as a data engineer. Working with popular data APIs and ETL
tools, for example, would be advantageous.

4. Data Architect:
Data architects develop and build new database systems to meet the needs of a certain business
model. Architects are responsible for both the functional and administrative aspects of these
database systems. Architects, in other words, keep track of the data and select who has access to
view, utilize, and alter certain areas of it. A data architect is a person who provides the blueprints
for data management so that databases may be readily integrated, consolidated, and protected
using the most up-to-date security procedures. They also make certain that the data engineers
have the most up-to-date tools and systems with which to operate.
Few Important Roles and Responsibilities of a Data Architect include:
 Identifying data collecting sources in line with data strategy 
 Developing and implementing overall data strategy in line with business/organization
 Working cooperatively with cross-functional teams and stakeholders to ensure smooth
operations of database systems
 End-to-end data architecture planning and management
 Keeping database systems/architecture up to date in terms of efficiency and security
 Auditing the functioning of the data management system on a regular basis and making
modifications to improve systems accordingly.
What Does it Take to Become a Data Architect?
Expertise in data warehousing, data modelling, extraction transformation and loan (ETL), and
other areas is required for a job in data architecture. You should also be familiar with Hive, Pig,
and Spark, among other things.

5. Data Storyteller:
Data storytelling is not simply about displaying the data and creating reports to share
information; It's about identifying the narrative that best reflects the data and generating creative
ways to express that narrative. Data storytelling is a hybrid of pure, unprocessed data analysis
with human-centered communication. A data storyteller must take data, simplify it so that it can
be focused on a single component of it, study its behaviour, and then utilize their own insights to
create a captivating tale that helps people better comprehend a phenomenon.
6. Machine Learning Scientist:
A machine learning scientist looks for innovative ways to manipulate data in order to create new
algorithms. They're usually part of the R&D department, and their work usually results in
research papers being published. Rather than working in industry, most machine learning
scientists work in academia. Machine learning scientists are sometimes referred to as research
scientists or research engineers.

7. Machine Learning Engineer:


Today, machine learning engineers are in high demand. They must be acquainted with numerous
machine learning algorithms such as clustering, categorization, and classification, as well as stay
current on the field's latest research developments.
Machine learning engineers must have excellent statistics and programming abilities, as well as a
basic understanding of software engineering, in order to do their jobs properly. Machine learning
engineers must execute tests (such as A/B tests) while monitoring the performance and operation
of the various systems in addition to creating and building them.
In addition to having in-depth understanding of some of the most powerful technologies
such as SQL, REST APIs, etc. machine learning engineers are expected to perform A/B testing,
design data pipelines, and implement common machine learning algorithms such as
classification, clustering, and others. 
Few Important Roles and Responsibilities of a Machine Learning Engineer include:
 Extending existing Machine Learning frameworks and libraries 
 Exploring and visualizing data for better understanding 
 Training and retraining systems 
 Designing and developing Machine Learning systems 
 Researching on Machine Learning Algorithms 
 Testing Machine Learning systems 
 Developing apps/products based on client requirements
What Are the Steps to Becoming a Machine Learning Engineer?
To begin, you must have a thorough understanding of some of the technologies, such as Java,
Python, and JS. Second, you must have a solid understanding of statistics and mathematics. It's a
lot easier to succeed a job interview once you've mastered both.

8. Database Administrator:
Many companies currently design database systems based on specific business requirements, but
the company that purchases the product manages the system. A company will pay a person (or a
team) to handle the database in such instances. A database administrator will keep track of the
data flow and monitor the database for proper operation while creating backups and recoveries.
Security is also overseen by administrators, who grant different permissions to employees based
on their job requirements and level of employment.
A database administrator's job description is fairly self-explanatory: they are responsible for
the proper operation of all of an enterprise's databases and give or revoke services to corporate
personnel based on their needs. They're also in charge of database backups and restores.

9. Technology Specialized Roles:


Data science is a rapidly evolving discipline, and as it matures, more specialized technologies,
such as artificial intelligence (AI) and machine learning (ML), will emerge. As a result, as the
sector matures, more specialized job categories will develop. For instance, AI experts, deep
learning experts, NLP (natural language processing) experts, and so on. Data scientists and
analysts are included in this growing specialty subject. For instance, a transportation DS
specialist or a marketing storyteller. These positions will be tailored to the needs of the company
and will likely reduce the workload of generalist scientists and engineers. The demand for data
scientists is growing as the area of data science develops, and firms are creating new
employment on a daily basis to satisfy the industry's massive expectations. Data science
professions come with a wide range of duties.

10. Statistician:
As the name implies, a statistician is well-versed in statistical theory and data organization. They
not only extract and provide significant insights from data clusters, but they also assist engineers
in developing new approaches.
Few Important Roles and Responsibilities of a Statistician include:
 collecting, analyzing, and interpreting data 
 analyzing data, assessing results, and predicting trends/relationships using statistical
methodologies/tools 
 designing data collection processes 
 communicating findings to stakeholders 
 advising/consulting on organizational and business strategy based on data
How do you go about becoming a statistician?
A statistician must have a strong interest in reasoning. They are also proficient in a range of
database systems, including SQL, data mining, and machine learning.

11. Business Analyst:


Business analysts have a slightly distinct role than other data scientists. They understand how
data-oriented technologies function and how to handle massive volumes of data, but they also
know how to distinguish high-value data from low-value data. To put it another way, they figure
out how Big Data can be linked to valuable business insights to help companies grow.
Few Important Roles and Responsibilities of a Business Analyst include:
 Conducting extensive business analysis – highlighting issues, opportunities, and
solutions 
 Improving existing business processes 
 Pricing analysis
 Budgeting and forecasting

Question 2:
Explain various applications of Data Science?

Answer:
Applications of Data Science:
Data science has now overtaken almost every business on the earth. There isn't a single industry
on the planet that doesn't rely on data these days. As a result, data science has turned into a
source of energy for companies. Data science, often known as data-driven science, is a branch of
statistics and computers that transforms data into useful knowledge.
Data science brings together tools from a variety of fields to collect data, analyse it, produce new
insights, and apply it to make decisions. The technological disciplines that make up the data
science field include data mining, statistics, machine learning, data analytics, and some
programming. Data science is swiftly becoming one of the most sought-after specialties, having
applications across a wide range of industries. We all know it's changed the way we think about
data.
1. Healthcare
Data science applications are extremely beneficial to the healthcare industry. In the healthcare
industry, data science is making significant progress. In health care, data science is employed in
a range of areas. Image Analysis in Medicine, Genetics and Genomics, and Drug Development
are among them.
 Medical Image Analysis:
Procedures including detecting malignancies, artery stenosis, and organ delineation use a
number of methods and frameworks like MapReduce to find suitable parameters for
duties like lung texture categorization. It employs machine learning techniques such as
support vector machines (SVM), content-based medical picture indexing, and wavelet
analysis to classify solid textures.

 Genetics & Genomics:


Data Science applications also provide a better level of therapy customisation through
genetics and genomics research. The goal is to uncover particular molecular linkages
between genetics, disorders, and pharmaceutical response in order to gain a better
understanding of how DNA affects human health.
 Drug Development:
Data science applications and machine learning algorithms simplify and shorten the
process, offering a new perspective at each stage, from the first screening of medicinal
substances to the prediction of the success rate based on biological characteristics.
Instead of "lab tests," these algorithms use sophisticated mathematical modelling and
simulations to forecast how a chemical would behave in the body. The purpose of
computational drug discovery is to create computer model simulations in the form of a
biologically suitable network, making it easier to accurately predict future events.

2. Targeted Advertising:
From display banners on various websites to digital billboards at airports, data science
algorithms are utilised to determine nearly anything. This is why digital ads have a far higher
CTR (Call-Through Rate) than traditional ads. They can be customised based on a user's
previous behaviour. That's why, in the same region, you can see an advertisement for Data
Science Training Programs while someone else would see an advertisement for clothes.

3. Digital Advertisements:
Digital advertising also makes use of data science techniques. Despite the fact that internet
browsing is one of the most important uses of data science and machine learning, the full digital
marketing spectrum is another. Banners on various websites and digital billboards at airports are
shown using data science techniques. As a result, digital ads have a higher click-through
rate than traditional ads.

4. Website Recommendations:
Many firms have taken advantage of this engine to promote their products based on user interest
and information relevance. To improve the customer experience, firms like Amazon, Twitter,
Google Play, Netflix, LinkedIn, IMDb, and others adopt this strategy. The suggestions are based
on the results of a subscriber's previous searches.

5. Internet Searching:
Data science algorithms are used by all of these search engines, including Google, Yahoo, Bing,
Ask, AOL, and others, to deliver the best result for our searched query in a matter of seconds.
Because Google handles about 20 petabytes of data on a daily basis. Google would not be the
company it is today if data science had not been invented.

6. E-Commerce:
Natural language processing (NLP) and recommendation systems are examples of data science
and machine learning ideas that aid the e-commerce industry. E-commerce platforms could
employ such methodologies to analyse consumer purchases and comments in order to acquire
vital information for their business development. They analyse texts and online questionnaires
using natural language processing (NLP). It is used in collaborative and content-based filtering to
evaluate data and provide better services to its customers. Data science has influenced the data
science sector in a variety of ways, including recognising the consumer base, anticipating goods
and services, identifying the style of popular items, optimising pricing structures, and more.

7. Transport:
The emergence of self-driving autos is the most significant breakthrough or evolution that data
science has brought us in the world of transportation. Data science has developed a foothold in
transportation through a detailed examination of fuel usage patterns, driver behaviour, and
vehicle monitoring. It is establishing a name for itself by making driving circumstances safer for
drivers, enhancing vehicle performance, granting drivers more autonomy, and much more. Using
reinforcement learning and integrating autonomy, vehicle manufacturers can create smarter cars
and better logistical routes.

8. Text and Advanced Image Recognition:

Data science algorithms are in charge of speech and image recognition. We may see the excellent
work of these algorithms in our daily lives such as virtual speech assistants like Google
Assistant, Alexa, or Siri with their speech recognition engine working in the background, striving
to understand and assess your words and offering relevant results as a result of your use. Among
other social media platforms, image recognition can be found on Facebook, Instagram, and
Twitter. 

9. Gaming:

As the player passes through the stages, machine learning algorithms are increasingly being
employed to develop games that grow and upgrade. Your opponent (computer) in motion gaming
likewise studies your previous acts and adapts its game accordingly. Data science has been
employed by EA Sports, Zynga, Sony, Nintendo, and Activision-Blizzard to take gaming to the
next level.

10. Security:

Data science can be used to strengthen your company's security and protect sensitive data. For
example, banks deploy advanced machine-learning algorithms to detect fraud based on a user's
typical financial behaviour. These algorithms can detect fraud faster and more precisely than
individuals because of the vast volume of data collected every day. Even if you don't work in a
financial institution, you can utilise such algorithms to protect sensitive information.
Understanding data privacy can assist your company avoid misusing or revealing sensitive
consumer information including credit card numbers, medical records, Social Security numbers,
and contact information.

11. Finance:

Finance was one of the first industries to use data applications. Businesses were fed up with poor
loans and losses year after year. They did, however, have a lot of information gathered during the
initial loan application. To assist them recoup from their losses, they decided to hire data
scientists. Because both finance and data science deal with data, they are closely intertwined.
Companies used to have a lot of paperwork to start authorising loans, keep them current, lose
money, and go into debt. As a result, data science methods have been presented as a possible
solution. In order to estimate risk possibilities, they learned to separate the data by customer
profile, previous spending, and other essential factors. It also assists in the promotion of banking
products based on consumer purchasing power. Another example is customer portfolio
management, which evaluates data trends using business intelligence tools for data science. Data
science also provides algorithmic training, allowing financial institutions to make data-driven
decisions using rigorous data analysis. As a result, consumers will have a better customer
experience since financial institutions will be able to develop a personalised relationship with
their clients based on detailed research and adjustment of preferences.

12. Customer Insights:

Client data may reveal a lot about their habits, demographics, interests, and aspirations, among
other things. With so many different sources of consumer data, having a rudimentary
understanding of data science can help you make sense of it all. You might collect information
about a customer every time they visit your website or physical store, add an item to their basket,
make a purchase, read an email, or interact with a social media post, for example. After you've
double-checked that the data from each source is correct, you'll need to combine it in a data
wrangling process.

One example of this is matching a customer's email address to their credit card information,
social media handles, and transaction identifications. By merging the data, you can make
assumptions and uncover trends in their behaviour. Understanding your customers and what
motivates them can help you ensure that your product meets their demands and that your
marketing campaigns are successful.

13. Augmented Reality:

This is the final data science application that appears to have the greatest future promise. The
word "augmented reality" refers to one of the most interesting applications of technology. Data
Science and Virtual Reality are linked because a VR headset uses computer knowledge,
algorithms, and data to provide you the best viewing experience possible. Pokemon GO, a
popular game, is a small step in the right way.

14. Banking:
One of the most common uses of Data Science is in banking. Banks have been able to stay up
with the competition because to Big Data and Data Science. Banks may better manage their
resources with Data Science, and they can make smarter decisions through fraud detection,
customer data management, risk modelling, real-time predictive analytics, and customer
segmentation, among other things. Banks also calculate the client lifetime value, which allows
them to keep track of how many customers they have. It gives them a number of forecasts that
the business bank will make based on their consumers. When it comes to fraud detection, banks
make it possible for businesses to detect scams involving credit cards, insurance, and accounting.
Banks can also analyse consumer investment habits and cycles and present you with a variety of
offerings that are tailored to your needs. Furthermore, banks can use data science to risk model
their overall performance and assess their overall performance. Banks can use Data Science to
provide targeted marketing that is tailored to their customers' needs. Banks utilise machine
learning algorithms to improve their analytics approach in real-time and predictive analytics.
Banks also employ real-time data to figure out what's causing their performance troubles.

You might also like