Professional Documents
Culture Documents
Code:
import pandas as pd
import numpy as np
s=pd.Series(np.random.randint(6,size=6),index=['p','q
','r','n','t','v',])
print(s)
output:
2. Write a program to create a data series and then change the indexes
of the series object in any random order.
Code:
import pandas as pd
import numpy as np
s1=pd.Series(data=[100,200,300,400,500],
index=['I','J','K','L','M'])
print("original data series:")
print(s1)
s1=s1.reindex(index=['K','I','M','L','J'])
print("data series after changing the order of index:
")
print(s1)
output:
Code:
import pandas as pd
s1=pd.Series(data=[6700,5600,5000,5200],index=[1,2,3,
4])
s3=s1.sort_index(ascending=False)
print("Series object s1:")
print(s1)
print("Series object s3:")
print(s3)
Output:
Code:
import pandas as pd
n=eval(input("Enter any 3 numbers of your choice:"))
s4=pd.Series(n)
print("Series object s4:")
print(s4)
print("Cubes of s4 values:")
print(s4**3)
Output:
Q5. Write a program to create series object that stores the table of
number 7.
Code:
import pandas as pd
import numpy as np
n=np.arange(1,11)
s5=pd.Series(n*7)
print("Series displaying table of 7:")
print(s5)
Output:
Code:
import pandas as pd
n={'Name':['Raj','Kunal','Sam','Joy','Shiv'],
'Sales':[1000,2000,4000,8000,16000]}
i=['Zone1','Zone2','Zone3','Zone4','Zone5']
df1=pd.DataFrame(data=n,index=i)
print("DataFrame containing sales man details:")
print(df1)
Output:
Code:
import pandas as pd
SaleDict={2014:{'Madhu':100.5,'Kusum':150.8,'Kinshuk'
:200.9,'Ankit':30000,'Shriti':40000},
2015:{'Madhu':12000,'Kusum':18000,'Kinshuk':22000,'An
kit':30000,'Shriti':45000},
2016:{'Madhu':20000,'Kusum':50000,'Kinshuk':70000,'An
kit':100000,'Shriti':125000},
2017:{'Madhu':50000,'Kusum':60000,'Kinshuk':70000,'An
kit':80000,'Shriti':90000}}
sales=pd.DataFrame(SaleDict)
print("DataFrame containing Year-wise sales:")
print(sales)
Output:
Code:
a) sales.index
b) sales.columns
c) sales.dtypes
d) sales.ndim,sales.shape,sales.size,sales.values
e) sales.iloc[3: , ]
f) sales.iloc[ : , :2]
Output:
d)
(2 (5, 4) 20 [[1.005e+02 1.200e+04 2.000e+04 5.000e+04]
[1.508e+02 1.800e+04 5.000e+04 6.000e+04]
[2.009e+02 2.200e+04 7.000e+04 7.000e+04]
[3.000e+04 3.000e+04 1.000e+05 8.000e+04]
[4.000e+04 4.500e+04 1.250e+05
9.000e+04]])
e)
2014 2015 2016 2017
Ankit 30000.0 30000 100000 80000
Shriti 40000.0 45000 125000 90000
f)
2014 2015
Madhu 100.5 12000
Kusum 150.8 18000
Kinshuk 200.9 22000
Ankit 30000.0 30000
Shriti 40000.0 45000
Q9. Create a dictionary using the following data. Use this dictionary to
create a DataFrame Sales2.
2018
Madhu 160000
Kusum 110000
Kinshuk 500000
Ankit 340000
Shruti 900000
Check if Sales2 is empty or it contains data.
Code:
import pandas as pd
d1={2018:[160000,110000,500000,340000,900000]}
sales2=pd.DataFrame(d1,index=['Madhu','Kusum','Kinshu
k','Ankit','Shruti'])
print(sales2)
print(sales2.empty)
Output:
2018
Madhu 160000
Kusum 110000
Kinshuk 500000
Ankit 340000
Shruti 900000
False
Code:
a) sales=pd.append(sales2)
print(sales)
b) sales=sales.T
print(sales)
c) sales.loc[ : ,2017] # with original dataframe
Tsales.loc[2017, : ] #with Transposed dataframe
Output:
a)
2014 2015 2016 2017 2018
Madhu 100.5 12000 20000 50000 160000
Kusum 150.8 18000 50000 60000 110000
Kinshuk 200.9 22000 70000 70000 500000
Ankit 30000.0 30000 100000 80000 340000
Shriti 40000.0 45000 125000 90000 900000
b)
Madhu Kusum Kinshuk Ankit Shruti
2014 100.5 150.8 200.9 30000.0 40000.0
2015 12000.0 18000.0 22000.0 30000.0 45000.0
2016 20000.0 50000.0 70000.0 100000.0 125000.0
2017 50000.0 60000.0 70000.0 80000.0 90000.0
2018 160000.0 110000.0 500000.0 340000.0 900000.0
c)
Madhu 50000 #with original
dataframe
Kusum 60000
Kinshuk 70000
Ankit 80000
Shriti 90000
Name: 2017, dtype: int64
Madhu 50000.0
Kusum 60000.0
Kinshuk 70000.0
Ankit 80000.0
Shriti 90000.0
Name: 2017, dtype: float64
a) Display the sales made by Madhu and Ankit in the years 2017
and 2018.
b) Display the sales made by Shruti 2016.
c) Add data to Sales for salesman Sumeet where the sales made are
[196.2, 37800, 52000, 78438, 38852] in the years [2014, 2015,
2016, 2017, 2018] respectively.
Code:
a) sales.loc[['Madhu','Ankit'],[2017,2018]]
Tsales.loc[[2017,2018],['Madhu','Ankit']]
b) sales.loc[['Shruti'],[2016]]
Tsales.loc[[2016],['Shruti']]
c) sales.loc['Sumeet',:]=[196.2,37800,52000,78438,38
852]
print(sales)
Output:
a)
2017 2018
Madhu 50000.0 160000.0 #with original dataframe
Ankit 80000.0 340000.0
Madhu Ankit
2017 50000.0 80000.0 #with transposed dataframe
2018 160000.0 340000.0
b)
2016
Shruti 125000
Shruti
2016 125000.0
c)
2014 2015 2016 2017 2018
Madhu 100.5 12000 20000 50000 160000
Kusum 150.8 18000 50000 60000 110000
Kinshuk 200.9 22000 70000 70000 500000
Ankit 30000.0 30000 100000 80000 340000
Shriti 40000.0 45000 125000 90000 900000
Sumeet 196.2 37800 52000 78438 38852
Code:
a) sales.drop(['Kinshuk'])
b) sales=sales.rename(index={'Ankit':'Vivaan','Madhu
':'Shailesh'})
print(sales)
c) sales.loc['Shailesh',2018]=100000
Output:
a)
2014 2015 2016 2017 2018
Madhu 100.5 12000 20000 50000 160000
Kusum 150.8 18000 50000 60000 110000
Ankit 30000.0 30000 100000 80000 340000
Shriti 40000.0 45000 125000 90000 900000
Sumeet 196.2 37800 52000 78438 38852
b)
2014 2015 2016 2017 2018
Shailesh 100.5 12000 20000 50000 160000
Kusum 150.8 18000 50000 60000 110000
Kinshuk 200.9 22000 70000 70000 500000
Vivaan 30000.0 30000 100000 80000 340000
Shriti 40000.0 45000 125000 90000 900000
Sumeet 196.2 37800 52000 78438 38852
c)
2014 2015 2016 2017 2018
Shailesh 100.5 12000 20000 50000 100000
Kusum 150.8 18000 50000 60000 110000
Kinshuk 200.9 22000 70000 70000 500000
Vivaan 30000.0 30000 100000 80000 340000
Shriti 40000.0 45000 125000 90000 900000
Sumeet 196.2 37800 52000 78438 38852
Code:
import pandas as pd
import csv
df=pd.DataFrame({'Item':['Books','Pens','Erasers'],
'sales':[100,200,300]})
print(df)
df.to_csv("d:\\Users\\ABC\\Desktop\\IP\\sales.csv")
print("sucessfully stored")
Output:
Item sales
0 Books 100
1 Pens 200
2 Erasers 300
sucessfully stored
Q14. Write a program that read from CSV file where the separator
character is ‘$’. Read only first 5 rows in your DataFrame.
Code:
import pandas as pd
df=pd.read_csv("d:\\Users\\ABC\\Desktop\\IP\\data.csv
",
sep='$',names=['Itemname','Quantity','Price'],header=
None,nrows=5)
print(df)
Output:
Code:
import pandas as pd
import mysql.connector as sqltor
mycon=sqltor.connect(host=”localhost”,user=”root”,
passwd=1234,database=”nupur12s”)
if mycon.is_connected( ):
print(“Successfully connected to MySQL
database”)
Output:
Q16. Write a program to draw line chart from given financial data of
ABC Co. for 5 days in form a DataFrame namely fdf as shown:
Code:
import pandas as pd
import matplotlib.pyplot as plt
a={'day1':[74.25,76.6,69.50,72.55],
'day2':[56.03,68.71,62.89,56.42],
'day3':[59.30,72.07,77.65,66.46],
'day4':[69.00,78.47,65.53,76.85],
'day5':[89.65,79.65,80.75,85.08]}
i=[0,1,2,3]
fdf=pd.DataFrame(a,i)
print(fdf)
fdf.plot()
plt.show()
Output:
Code:
Output:
Q18. Prof Raj is doing some research in the field of environment. For
some plotting purpose, he has generated some data are:
mu=100
sigma=15
x=mu+sigma*numpy.random.randn(10000)
y=mu+30*np.random.randn(10000)
Write a program to plot this data on a cumulative barstacked
horizontal with both x and y-axes.
Code:
import numpy as np
import matplotlib.pyplot as plt
mu=100
sigma=15
x=mu+sigma*np.random.randn(10000)
y=mu+30*np.random.randn(10000)
plt.hist([x,y],bins=100,histtype='barstacked',cumulat
ive=True)
plt.title('research data histogram')
plt.show()
Output:
19. Write a program to plot a bar chart from the medals won by top
four countries. Make sure that bars are separately visible.
Code:
Output:
Output:
Code:
Output:
Code:
Output:
Code:
Output:
24.SELECT POW(2,3);
Code:
Output:
Code:
Output:
Code:
Output:
Code:
Output:
Code:
Output:
29.Display the names of students whose names end with the character
‘a’. Also, arrange the students in alphabetical order.
Code:
Output:
Code:
Output:
Code:
Output:
Code:
Output:
Code:
Output:
Code:
Output:
Code:
Output: