You are on page 1of 3

LajakLaku Tablet Production Analysis for April 2023

Introduction
This report presents an analysis of the LajakKuruih tablet production data for April 2023. The analysis aims to
provide insights into the production process and product characteristics, including the weight and tensile
strength of the tablets. The report also examines whether there are significant differences in tablet weights
across different shifts and predicts the tensile strength of tablets with specific relative density values. This report
provides a comprehensive analysis of the LajakKuruih tablet production for April 2023.

Methodology

For
The analysis is performed using Python programming language with the help of libraries such as pandas for
data manipulation, numpy for numerical computation, matplotlib and seaborn for data visualization, and scipy
for statistical analysis:
Pandas: Used for data manipulation and analysis. It provides data structures and functions needed to
manipulate structured data.

preview
NumPy: Used for numerical computations. It provides a high-performance multidimensional array object and
tools for working with arrays.
Matplotlib and Seaborn: Used for data visualization. They provide functions for creating a variety of plots for
data exploration.
Scikit-learn: Used for machine learning. It provides simple and efficient tools for predictive data analysis.

only
SciPy: Used for scientific computing. It provides functions for statistical testing and other mathematical tools.

The dataset used in this analysis is obtained from the LajakLaku's production records for the month of April
2023. The dataset contains information about the weight, tensile strength, and relative density of the tablets
produced, along with the shift during which the tablets were produced.
The data was loaded from a CSV file into a Pandas DataFrame, a two-dimensional tabular data structure with
labeled axes.

Data Cleaning

Before proceeding with the analysis, it is important to clean the data to ensure its quality and reliability. This
involves handling missing values, outliers, and incorrect data entries. A random sample of 200 records was
selected for the analysis.

index Unnamed: 0 Weight (mg) Relative Density (%) Tensile Strength (Mpa) Shift

225 226 326.5 88.8 7.3 Night

122 123 346.07 87.2 6.9 Evening

92 93 314.49 89.6 7.1 Evening

157 158 341.25 84.1 6.8 Night


index Unnamed: 0 Weight (mg) Relative Density (%) Tensile Strength (Mpa) Shift

154 155 345.36 83.7 6.9 Night

The dataset does not contain any missing values. Outliers can significantly affect the results of our analysis and
predictive modeling. We will use boxplots to visualize the distribution of the numerical variables and identify
any outliers.

For
preview
only
From the boxplots, it can be observed that there are no significant outliers in the dataset. Therefore, no outlier
treatment is required.
Descriptive Analysis
In this part we will be providing a detailed description and summary of the sample data in numerical, tabular,
and graphical forms.Let's start by generating the descriptive statistics for the numerical variables in the sample
data. The sample data includes the following variables:
The 'Shift' variable contains three unique values: 'Morning', 'Evening', and 'Night', which are all valid.
Therefore, there are no incorrect data entries in the dataset. With the data cleaning process completed, we can
now proceed to the next step of the analysis, which involves selecting a random sample of 200 data points from
the dataset for further analysis.
Weight (mg): The weight of the tablet in milligrams.
Relative Density (%): The relative density of the tablet, expressed as a percentage.
Tensile Strength (Mpa): The tensile strength of the tablet in megapascals.
Shift: The shift during which the tablet was produced (Morning, Evening, or Night).
A numerical summary of the sample data was generated using the describe function in Pandas, which provides
count, mean, standard deviation, minimum, 25th percentile, median, 75th percentile, and maximum values for
each numerical variable.

Here is a numerical summary of the sample data:

index

count

mean

std
Unnamed: 0

200

122.515

72.3515
For
Weight (mg)

200

349.442

14.8167
Relative Density (%)

200

88.08

1.95294
Tensile Strength (Mpa)

200

7.0995

0.184989

min

25%

50%
1

59.5

122.5
preview
314.49

339.51

348.24
83.3

86.9

88.1
6.5

7.1

75%

max
183.25

250
only
359.2

389.48
89.3

92.8
7.2

7.6

The descriptive statistics provide the following insights into the sample data:
● The average weight of the tablets is approximately 349.44 mg, with a standard deviation of
approximately 14.82 mg.
● The average relative density of the tablets is approximately 88.08%, with a standard deviation of
approximately 1.95%.
● The average tensile strength of the tablets is approximately 7.10 Mpa, with a standard deviation of
approximately 0.18 Mpa.
The data was also visualized using histograms and pair plots to examine the distributions of the variables and
the relationships between them.

You might also like