You are on page 1of 3

ACQUIRE ACTIVITY WORKSHEET - USER INFORMATION

Date Dec 03, 2023

First Name Erickson

Last Name Gibson

Email gibso231@purdue.edu

Major Game Design

Course (Choose one) CGT 27000

Term Fall

Visualization Assignment Final Project

ACQUIRE ACTIVITY: Generate/Identify appropriate data sources

https://www.kaggle.com/datasets/jeffgallini/college-football-
URL to Data Source 1
team-stats-2019?select=cfb13.csv

Data Source 1 (Public)


Data availability Data Source 2 (NA)
Data Source 3 (NA)

Describe your search


methods used to locate the
data. Where did you look for
I used a website called kaggle. I searched up college
the data (web, research
football on the site.
articles, etc.)? What search
phrases did you use (if web
search performed)?

ACQUIRE ACTIVITY: Describe the data

Data format: what format is


Data Source 1 (A combination of quantitative (numerical
the data in? Structured vs
values) and qualitative (text) data)
instructed? All text, a
Data Source 2 (NA)
combination, multiple
Data Source 3 (NA)
sources?

Data format: Is the data Data Source 1 (Secondary)


primary, secondary or Data Source 2 (NA)
tertiary? Data Source 3 (NA)

ACQUIRE ACTIVITY: Describe Data Types

What data types are in Data


strings, integers, floats
Source 1?
What data types are in Data
N/A
Source 2?

What data types are in Data


N/A
Source 3?

Data Source 1 (Txt, csv)


Describe access to the data. Data Source 2 (NA)
Data Source 3 (NA)

Data Source 1 (Text/CSV)


What structure holds the
Data Source 2 (NA)
data?
Data Source 3 (NA)

ACQUIRE ACTIVITY: Evaluate the data

List of Variables. In the


space below, list the data
variables. DO NOT list the
actual data, just the variable
name. If your data is in table Team, Off.Yards, Total.TDs, Def.Rank, Yards.Allowed,
form, variable names are Off.TDs.Allowed
usually the column
headings (for example: First
name, Last name, zip code,
etc.)

Audience. Who is your


Anyone who interested in college football stats
intended audience?

Assumptions. List three (3)


assumptions you are
making about the data you 1. All the data is correct
have acquired. Keep in 2. The data follows a normal distribution
mind, observations are NOT 3. The variance of the data is equal
assumptions. Number your
assumptions.

ACQUIRE ACTIVITY: Examine the data

What real life behavior does


the data reflect? Does it
show patterns of activity, It reflects the behavior of football teams and how good the
regularity of events, a teams really are.
timeline, population data,
etc.? Explain.

What are the weaknesses of DS #1 (Might not be available in future)


the data source? Is it likely DS #2 (Other)
that the data source (DS) DS #3 (Other)
will be available in the
future? Is the data
complete? What is the
quality of the data? Is it
specific to your needs for
the current project? Is the
data in the format you
need? Are there missing
data? Explain.

If you answered "Other" to


"What are the weaknesses
N/A
of the data source?" in the
space below Explain.

What information is
What football teams has the better stats. This data shows
emphasized? What is the
certain stats of college football teams, and you could use
central focus of the data?
the stats to determine who the best team may be stat wise.
Explain.

ACQUIRE ACTIVITY: Data granularity

At what level of granularity DS #1 (Summarized)


is the data provided for DS #2 (NA)
each Data Source (DS)? DS #3 (NA)

ACQUIRE ACTIVITY: Scope of the data

There are many topics that can be covered such as what


What topics can be covered team has the most wins or most losses, what team has the
using the data? Explain. most offensive yards, what team has the best defensive
rank, etc.

Is there a time range/frame?


(example, 1940 to 1970) 2013-2020
Explain.

Is the data for a specific


area/discipline/demographic No
etc.? Explain.

You might also like