You are on page 1of 8

Data Warehousing and

Data Mining

IKRAMUDDEEN Z
RRN:220171601032

B.TECH A.I&D.S
DATASET

Source: https://www.kaggle.com/datasets/nehalbirla/vehicle-dataset-from-cardekho (file 3)

The dataset is titled "Vehicle Dataset from CarDekho" and is hosted on Kaggle. Here's a summary of the
dataset based on the information available on Kaggle:

Content: The dataset contains information about used cars listed on CarDekho. It includes various
attributes/features associated with each car listing, such as:

1
- Car model

- Year of manufacture

- Selling price

- Present price (current market value)

- Kilometers driven

- Fuel type (e.g., petrol, diesel, CNG)

- Seller type (e.g., dealer, individual)

- Transmission type (e.g., manual,


automatic)

- Number of owners

- Location

VISUALIZATION TECHNIQUES:

Visualization techniques like bar graphs, line plots, and scatter plots are crucial for exploring and
understanding this dataset. Here's why each technique is important:
This dataset serves as a treasure trove for anyone diving into the world of automobiles, particularly the
realm of pre-owned cars. Its potential stretches far and wide, catering to data enthusiasts, machine learning
aficionados, and automotive industry insiders alike.

In essence, this dataset isn't just a collection of numbers and attributes; it's a gateway to understanding the
pulse of the used car market, offering insights and opportunities for exploration that can fuel innovation and
informed decision-making in the automotive industry.

1. BAR GRAPHS:

Make/Model Distribution: A bar graph can show the distribution of car makes and models in the dataset,
helping to understand which brands and models are most prevalent.

Fuel Type Distribution: Bar graphs can display the distribution of different fuel types among the cars,
providing insights into the popularity of various fuel options.

2
Seller Type Distribution:By using bar graphs, you can visualize the distribution of seller types (e.g., dealer,
individual) to understand the market dynamics.

2. Line Plots:

Yearly Trends: Line plots can illustrate trends in car prices over the years, helping to identify whether prices
have been increasing, decreasing, or remaining stable over time.

Ex: Kilometers Driven vs. Price: Line plots can show the relationship between the number of kilometers driven
and the selling price of cars, revealing any patterns or correlations.

3. Scatter Plots:

Price vs. Kilometers Driven: Scatter plots serve as effective tools for illustrating the correlation between the
selling price of cars and the distance they've traveled. This visualization aids in comprehending depreciation
trends over time.

For example, Price vs. Year: Scatter plots also offer insights into the relationship between the selling price of cars
and their manufacturing year. This depiction illuminates how pricing fluctuates in accordance with the age of the
vehicle.

1.BAR GRAPH

3
1. Bar Graph Overview:

Visualize a bar graph showcasing data about various vehicle models and their traveled distances in kilometers.
Each vertical bar corresponds to a specific vehicle model, with its height representing the distance traveled by
that particular vehicle.

Adding to the clarity, the bars are color-coded to denote different attributes such as Year, Selling Price, Present
Price, Kms Driven, Fuel Type, Seller Type, Transmission, and Owner. This color scheme offers a quick and intuitive
way to discern the different aspects of the data, facilitating easier interpretation and analysis.

2. Key Elements:

The vertical axis of the graph features a roster of diverse vehicle models, spanning from the compact "Ritz" and
"Alto 800" to the nimble "K10," "Brio," "Amaze," "City," and beyond.
Stretching horizontally from each vehicle name are bars illustrating the distance traveled by each respective
vehicle. These bars serve as visual indicators of the mileage accumulated by each model, offering a
straightforward comparison of their travel histories.

X-Axis: Represents the distance traveled (ranging from 0 to 500,000 kilometers).

Y-Axis: Displays the names of the vehicle models.

3. Uses of Bar Graphs:

Utilizing bar graphs allows for effective comparison of data across different categories or groups, making
them invaluable tools in analyzing the traveled distances of various vehicles.

By scrutinizing the heights of the bars, trends can be easily identified. For instance, discerning which vehicles
have covered longer distances becomes a straightforward task, offering insights into usage patterns and
durability.

This data serves as a valuable resource for vehicle owners, buyers, or dealerships, empowering them to make
informed decisions. Whether considering purchasing a used car, evaluating trade-in options, or strategizing
inventory management, leveraging this information can lead to more confident and prudent choices.

Comparing Selling Prices: Analyze how prices vary with distances traveled.

Assessing Fuel Efficiency: Understand mileage by examining distances covered.

Planning Maintenance: Use distance data to schedule maintenance.

Understanding Resale Value: See how distances affect resale prices..

4
2.LINE PLOT

1. Graph Overview:

- The image displays a line plot graph that illustrates the relationship between various vehicle
models and their corresponding traveled distances.

- Each line on the graph represents a specific parameter, color-coded for clarity.

- The x-axis lists specific vehicle names, while the y-axis represents distance in kms values.

2. Key Elements:

Parameters Represented:

Selling Price: Indicates the price at which each vehicle was sold.

Present Price: Reflects the current market value of the vehicles.

Kilometers Driven: Represents the total distance covered by each

5 vehicle.
Fuel Type, Seller Type, Transmission, and Owner: Additional attributes influencing vehicle valuation.

3. Practical Uses:
Buyers and Sellers: Individuals in the market for vehicles or looking to sell can leverage this dataset.

Estimating Value: Evaluate how traveled distance impacts selling price and present price.

Condition Assessment: Kilometers driven offer insights into a vehicle's wear and tear.

Decision-Making: Consider fuel type, seller type, and transmission when making informed choices.

Market Insights: Dealerships and manufacturers can glean valuable insights into market trends.

Optimizing Fleet Management: Companies overseeing vehicle fleets can enhance maintenance scheduling and

management.

3.SCATTER PLOT

6
1. Graph Overview:
The scatter plot graph visually presents the correlation between the distance traveled by vehicles and
their respective selling prices. Each dot on the plot represents a unique data point for a specific vehicle,
amalgamating information regarding the vehicle's attributes and its selling price.

2. Key Insights from the Scatter Plot:


.Year: Vehicles spanning different production years are dispersed across a range of selling prices.

Selling Price & Present Price: The clustering of data points at lower price points indicates a prevalence of older

vehicls are those with higher usage tend to have lower selling prices.

Kilometers Driven: A noticeable trend emerges as kilometers driven increase, indicating a decline in selling
price. Higher mileage tends to diminish resale value.
Fuel Type & Seller Type: Clusters of data points unveil price variations influenced by fuel type (e.g., petrol,
diesel) and seller type (individual vs. dealer).
Transmission: Clear distinctions in price ranges are observed among vehicles with automatic or manual
transmission.
Owner: The number of owners significantly influences selling prices; single-owner vehicles often command
higher prices.

3. Practical Uses:

Pricing Guidance: Buyers can derive estimates of fair prices by considering attributes such as distance

traveled, aiding in informed decision-making during negotiations.

Negotiation: Armed with insights into how various factors impact resale value, sellers can negotiate more

effectively, ensuring fair deals and maximizing returns.. Fleet Management:

Companies managing vehicle fleets can optimize replacement schedules based on usage and depreciation.

Data-Driven Decisions:

Gain insights into how various attributes influence vehicle valuation,

facilitating informed decision-making regarding pricing and inventory

management. Utilize this understanding to devise effective inventory

and marketing strategies tailored to market trends and consumer


7
preferences

You might also like