Professional Documents
Culture Documents
A PROJECT REPORT
Submitted By
BACHELOR OF ENGINEERING
in
i
BONAFIDE CERTIFICATE
SIGNATURE SIGNATURE
ii
ACKNOWLEDGEMENT
First and foremost, we thank the almighty for showering his abundant
blessings on us to successfully complete the project. Our sincere thanks to,
our beloved “Kalvivallal” Late Thiru T. Kalasalingam, B.Com., Founder
for his keen interest and affection towards us.
We thank entire Staff Members of our department and our friends for
helping us by providing valuable suggestions and timely ideas for successful
completion of the project.
Last but not the least our Family Members and Friends have been a
great source of inspiration and strength to us during the course of this project
work and our sincere thanks to them.
iii
ABSTRACT
amount. Existing system uses mysql data for datasets and mathematical
calculation for the prediction process which takes longer time for processing and
lesser. The above problem can be resolved by using big data processing for large
collection of data and with that data it is been used for training the machine using
the random forest algorithm for predicting the electricity consumption. Through
that we enter the application’s usage time accordingly in the GUI which takes the
input and checks with the machine and gives the predicted range of the electricity
iv
TABLE OF CONTENTS
ABSTRACT iv
LIST OF TABLES ix
LIST OF ABBREVIATIONS x
1 INTRODUCTION 1
1.1 OBJECTIVE 1
1.2 SCOPE 2
2 LITERATURE SURVEY 3
3 ANALYSIS 8
v
3.2.3 Feasibility Study 11
3.3.1 Hadoop 12
3.3.2 Sqoop 14
4 DESIGN 16
5 IMPLEMENTATION 22
5.1 MODULES 22
vi
6 TESTING 25
8 USER MANUAL 29
9 CONCLUSION 31
10 FUTURE ENHANCEMENT 32
APPENDICES
APPENDICES 2 SCREENSHOTS
APPENDICES 3 PUBLICATION
REFERENCES
vii
LIST OF FIGURES
viii
LIST OF TABLES
ix
LIST OF ABBREVIATIONS
SYMBOLS ABBREVATIONS
x
CHAPTER – 1
INTRODUCTION
1
1.1 OBJECTIVE:
1.2 SCOPE:
The system can be used to predict the household electricity bill, for the
home applications.
2
CHAPTER-2
LITERATURE SURVEY
Concept discussed
Work Done
Countries all over the world are trying to promote a revolution in energy industry
by exploring alternative ways of energy supply, consumption and storage.
Problem Identified
Knowledge Gain
Therefore, fast and accurate formation of the mathematical model and calculating
optimal dispatch of MES become vital issues in the MES study.
3
Title :”Benefits and Limitations of Tapping into Stored Energy for
Datacentres”
Author : SriramGovindan, AnandSivasubramaniam, BhuvanUrgaonkar
Publication: IEEE 2011 Conference
Concept Discussed
Power and energy consumption of datacenters have come under much scrutiny in
recent times due to their contribution to cost, design and reliability constraints,
and environmental concerns. In existing datacenter studies, the two terms - energy
and power - are often used rather interchangeably.
Work Done
However, these have very different connotations- energy is the integral of power
over time - and can have very different implications for datacenter design,
operation , and costs.
Problem Identified
For the same energy consumption, one could have different power draw profiles,
each having a different consequence on the design and reliability issues as well
as on the monthly electricity bill. These solutions can have adverse performance
consequences depending on the workload behaviour.
Knowledge Gain
This paper proposes a complementary solution to this problem of peak power that
does not have any performance consequence, by exploiting already existing
energy storage (UPS) facilities within thedatacenter. It can be combined with
existing throttling andworkload management techniques to further reduce the
peakpower draw and/or the duration of this draw.
4
Title :”Power Provisioning for a Warehouse-sized Computer”
Author : Xiaobo Fan, Wolf-Dietrich Weber,Luiz André Barroso
Publication : IEEE 2017 Conference
Concept Discussed
Large-scale Internet services require a computing infrastructure that can be
appropriately described as a warehouse-sized computing system. The cost of
building datacenter facilities capable of delivering a given power capacity to such
a computer can rival the recurring energy consumption costs themselves.
Work Done
Therefore, there are strong economic incentives to operate facilities as close as
possible to maximum capacity, so that the non-recurring facility costs can be
best amortized.
Problem Identified
That is difficult to achieve in practice because of uncertainties in equipment
power ratings and because power consumption tends to vary significantly with
the actual computing activity.
Knowledge Gain
Effective power provisioning strategies are needed to determine how much
computing equipment can be safely and efficiently hosted within a given power
budget.
5
Title :”Personality and ectoparasitic mites (Hemipteroseiusadleri) in
Firebugs”
Author :Eniko˝ Gyuris, JúliaFruzsinaHankó, OrsolyaFeró, ZoltánBarta
Publication: IEEE 2008 Conference
Concept Discussed
Individuals of the same species often consistently differ in their behaviour across
time and context. These stable differences are usually termed ‘animal
personality’. Parasitism is known to significantly influence the evolution of
animal personality at least in part because more explorative individuals may meet
parasites more frequently than less explorative ones.
Work Done
Previously, we have demonstrated that consistent individual differences (i.e.
boldness, activity, exploration) can be measured in firebugs. As continuation, we
examined here the relationship between firebug personality traits and their
ectoparasitic mite loads in a wild population.
Problem Identified
We showed that bugs that behaved in a more explorative way have more mites
and we also found a marginally significant interaction between sex, boldness and
activity: bolder and more active female firebugs were more infected. In addition,
we experimentally tested whether an artificial infection causes any alteration in
the bug’s behaviour and whether there is any relationship between firebug
personality and immune response.
Knowledge Gain
This treatment did not induce any alteration in bugs’ personality. We found that
bugs become more explorative but less active when repeating the experiment, but
at the same time all personality traits (boldness, activity and exploration) were
repeatable. Furthermore, firebugs with a stronger immune response behaved more
boldly but also less actively.
6
Title :”Providing Green SLAs in High Performance Computing
Clouds”
Author :Md.E.Haque, Kien Le,InigoGoiri, Ricardo Bianchini
Publication: IEEE 2013 Conference
Concept Discussed
Demand for clean products and services is increasing as society is becoming
increasingly aware of climate change. In response, many enterprises are setting
explicit sustainabilitygoals and implementing initiatives to reduce carbon
emissions.
Work Done
Quantification and disclosure of such goals and initiatives have become important
marketing tools. As enterprises and individuals shift their workloads to the cloud,
this drive toward quantification and disclosure will lead to demand for
quantifiable green cloud services.
Problem Identified
Thus, we argue that cloud providers should offer a new class of green services,
in addition to existing (energy-sourceoblivious) services.
Knowledge Gain
This new class would provide clients with explicit service-level agreements
(which we call Green SLAs) for the percentage of renewable energy used to run
their workload.
7
CHAPTER – 3
ANALYSIS
In the present days the consumptions of the electricity is getting higher and
it must also be conserved for the future. So, by predicting the electric bill we be
cost effective and it may also be helpful to conserve them according to their use.
Therefore, to achieve this the large amount of the data is been used for higher
accuracy in prediction.
Disadvantages
• Limitation of data.
• Time consumption is more and maintenance cost is very high.
• Accuracy is lesser.
8
3.1.3 Proposed System
Proposed concept deals with providing database by using Hadoop
tool it can analyse with no limitation of data and simply add number of machines
to the cluster and get the results with less time, high throughput and maintenance
cost is very less and we are using partitions and bucketing techniques in Hadoop.
For the prediction and analyses part uses random forest algorithm it calculates
faster with higher accuracy. The random forest algorithm is a supervised learning
where we could train the machine with desired training data for the desired output.
Advantages
Usability
It is easy to understand, learn, and operate the software system and is easily
usable and can be deployed in any environment.
9
Efficiency
Security
Reliability
The system runs completely on a separate system and all the functionality
is implemented using custom algorithms, so the system is highly reliable.
Compatibility
The program modules are able to execute without problems or conflict. The
system is also compatible with both 32bit and 64bit architectures.
Flexibility
Safety
Maintains integrity and processed data is not shared outside the system.
Portability
The software can be easily deployed to any windows system (32 or 64 bit)
without any dependency issues.
10
Performance
The system provides acceptable results and the performance is scalable and
can be upgraded easily.
Maintainability
• Technical Feasibility
• Economic Feasibility
• Operational Feasibility
Technical Feasibility
It is carried out to check the technical feasibility, that is, the technical
requirements of the system. Any system developed must not have a high demand
on the available technical resources. This will lead to high demands on the
available technical resources. This will lead to high demands being placed on the
client. The system that has been proposed to be developed has been analyzed
against several technical criteria and it will have a modest requirement.
Economic Feasibility
It is carried out to check the economic impact that the system will have on
the organization. The amount of fund that the company can pour into the research
11
and development of the system is limited. The expenditures must be justified.
Thus the system to be developed will be well within the budget and this can be
achieved because most of the technologies used are freely available. Only the
customized products will have to be purchased.
Operational Feasibility
Operation feasibility is a measure of how well the solution will work in the
organization. It is also a measure of how people feel about the system project.
The proposed system is operationally feasible because the level of acceptance by
the users solely depends on the methods that are employed to educate the user
about the system and to make him familiar with it. The proposed approach has
been designed with at most user friendliness and this will make the system that
will be developed socially feasible too.
Hadoop
12
computers. Hadoop is designed to scale up from single server to thousands of
machines, each offering local computation and storage.
HDFS :
Hadoop File System was developed using distributed file system design.
It is run on commodity hardware. Unlike other distributed systems, HDFS is
highly fault tolerant and designed using low-cost hardware. HDFS holds very
large amount of data and provides easier access. To store such huge data, the
files are stored across multiple machines. These files are stored in redundant
fashion to rescue the system from possible data losses in case of failure. HDFS
also makes applications available to parallel processing.
Map Reduce:
13
Sqoop:
When Big Data storages and analyzers such as MapReduce, Hive, HBase,
Cassandra, Pig, etc. of the Hadoop ecosystem came into picture, they required a
tool to interact with the relational database servers for importing and exporting
the Big Data residing in them. Here, Sqoop occupies a place in the Hadoop
ecosystem to provide feasible interaction between relational database server and
Hadoop’s HDFS.Sqoop is a tool designed to transfer data between Hadoop and
relational database servers. It is used to import data from relational databases
such as MySQL, Oracle to Hadoop HDFS, and export from Hadoop file system
to relational databases. It is provided by the Apache Software Foundation.
Random Forest:
14
3.3 REQUIREMENTS
HARDWARE REQUIREMENTS
PROCESSOR : PENTIUM IV 2.6 GHz, Intel Core 2 Duo.
RAM : 4GB DD RAM
MONITOR : 15” COLOR
HARD DISK : 40 GB
SOFTWARE REQUIREMENTS
Framework : Hadoop
Database : MYSQL 5.5
Languages : HQL
Data Access Tool : Sqoop
Operating System : Cent Os, Windows
Tool : Anaconda with Jupyter Notebook
15
CHAPTER – 4
DESIGN
16
4.2 Detailed Design
Use case diagrams are used to model the behavior of the system, a
subsystem or the class. Use cases are applied to capture the intended behavior of
the system developed, without specifying how the behavior is implemented. A
use case diagram contains the components.
• The use cases, the roles played by the actors in the system.
• The relationships between and among the actors and the use cases.
17
Use case Actors
• System
• user
• Dataset
• Store in HDFS
• Analyze with Hive
• Bucketing of data
• Training set
• Predicted bill
18
In our use case diagram we analyze data, which will be easy to predict.
Let us see the process of use case diagram in which whatever data analyzation
done is to be stored in hdfs (Hadoop distribution file system) by different type of
tools like sqoop and hive then the data is inserted for training set by the system
then the user send the data for which the system sends to the result of prediction.
In our class diagram the user login to his/her own account. In our class diagram
the client can be analysis the dataset, which will be easy to predicate. Let us see
the process of use class diagram in which whatever data analyzation done to be
19
stores in hdfs (hadoop distribution file system) by different type of styles like
hive, then the data is sent through the system for training set then the data is
predicted to user.
Data set
system
System Hadoop tool
organized large data set
result
Input
data
20
The data set is been given as a input in the system to the Hadoop tool where the
data is been sent to the hive system for classification and they are ordered grouped
according to their set values and sent in as a input to the training data set for the
algorithm to train the machine for the desired output.
21
CHAPTER 5
IMPLEMENTATION
5.1 MODULES
The proposed Electric bill prediction system has different modules for working
its efficient functionality. These modules help the system to work efficient and
also well in performance as well as in accuracy. The modules are listed below:
22
5.2.2 Migration and Storage
In here the dataset are been transferred into hadoop(HDFS) for storage,
Sqoop is a command-line interface application for transferring data between
relational databases and Hadoop. we fetch the dataset into hadoop (HDFS) using
sqoop Tool. Using sqoop we have to perform lot of the function, such that if we
want to fetch the particular column or if we want to fetch the dataset with specific
condition that will be support by Sqoop Tool and data will be stored in hadoop
(HDFS).
In here we analysis the dataset using HIVE tool which will be stored in
Hadoop (HDFS). For analysis dataset HIVE using SQL Language. Using hive,
we perform Tables creations, joins, Partition, Bucketing concept. Hive analysis
the only Structure Language. Through this hive the data is been analyzed and they
are been grouped according to their column name by the previous modules Sqoop
technique and been ordered as a dataset.
Validation techniques in machine learning are used to get the error rate of
the Machine Learning (ML) model, which can be considered as close to the true
error rate of the dataset. If the data volume is large enough to be representative of
the population, you may not need the validation techniques. However, in real-
world scenarios, to work with samples of data that may not be a true
representative of the population of given dataset. To finding the missing value,
duplicate value and description of data type whether it is float variable or integer.
The sample of data used to provide an unbiased evaluation of a model fit on the
23
training dataset while tuning model hyper parameters. Then the validation set is
used to evaluate the given model, this set used for frequent evaluation. This
validation data set allows us to choose which algorithm can be suitable for build
the model
The validation set data is been taken in as input for training the machine
using the Random forest algorithm then the trained set is been checked using test
set of data where the accuracy is been checked after testing the input from the
user is been asked from the GUI, then it is used as input and the machine runs
according to its trained data set and it gives the output in the GUI application.
24
CHAPTER – 6
TESTING
Testing is finding out how well something works and tells what level of
knowledge or skill has been acquired. Software testing is an investigation
conducted to provide stakeholders with information about the quality of the
product or service under the test. Software testing can also provide an objective,
independent view of the software to allow the business to appreciate and
understand the risk of software implementation.
Unit Testing
Integration Testing
Integration testing is any type of testing that seeks to verify the interface
between components against a software design. Software components may be
integrated in an iterative way or altogether.
System Testing
25
Table 6.1 Test Case Design
5 T05 Verify the 1. Enter the data Enter the Should Final output
output 2. Click submit required generate shows the
data the bill amount
output of range
predicted
bill
26
Table 6.2 Test Case Log Design
27
CHAPTER 7
The results obtained by comparing the efficiency of the proposed system with
the existing system. The purpose of the system is to predict the electricity bill to
conserve electrical energy and save money. By using the GUI application, we
made cost efficient system for the community to conserve electricity by
predicting the electricity bill.
28
CHAPTER – 8
USER MANUAL
Installing Python
Step 2: Ensure that the Install for all user radio button is pressed.
Step 4: A new Python 3.6.4 Setup pop-up window will appear (Select Destination
Directory).
Step 8: Use the default customization, which selects the Python Interpreter and
all its libraries (about 50 Mb).
Step 10: In a few seconds a pop-up window titled User Account Control will
appear, posing the question, “Do you want the following program to install
software on this computer?”
Step 12: A new Python 3.6.4 Setup pop-up window will appear.
29
Installation Hadoop
Step 3: Configure the correct version with the system os and jdk.
30
CHAPTER– 9
CONCLUSION
Then we use Random forest method for the prediction through machine learning
by using the data from the Hadoop ecosystem as input and analysis for prediction.
31
CHAPTER – 10
FUTURE ENHANCEMENT
In future, the proposed system can add some renewable source of electrical
energy into the system and predict how much the current can be saved, also able
to suggest some less electricity consuming application and also predict when
these applications can be switched off or kept idle to stop the wastage of the
electricity.
32
APPENDICES
33
APPENDIX – 1
BASE PAPER
34
CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, VOL. 5, NO. 2, JUNE 2019 249
Abstract—Smart grid enables consumers to control and sched- grid, which transfer information between two nodes. This pro-
ule the consumption pattern of their appliances, minimize energy cess is very much crucial to control and manage various smart
cost, peak-to-average ratio (PAR) and peak load demand. In grid components efficiently under variable demand situation.
this paper, a general architecture of home energy management
system (HEMS) is developed in smart grid scenario with novel To reduce the cost of infrastructure, environmental impacts and
restricted and multi-restricted scheduling method for the residen- increase the reliability of the system, distributed generations
tial customers. The optimization problem is developed under the like solar and wind can be integrated with the smart grid [4].
time of use pricing (TOUP) scheme. To optimize the formulated This helps to manage the energy consumption of smart home
problem, a powerful meta-heuristic algorithm called grey wolf appliances by energy management system (EMS), which is
optimizer (GWO) is utilized, which is compared with particle
swarm optimization (PSO) algorithm to show its effectiveness. A also the aspect of smart grid [5].
rooftop photovoltaic (PV) system is integrated with the system In DSM, various techniques and algorithms have been
to show the cost effectiveness of the appliances. For analysis, adopted to minimize the cost of electricity billing based
eight different cases are considered under various time scheduling on TOUP tariffs and incentives. Consumers can generate
algorithms. renewable energy to supply their appliances and if there is
any excess production, they will sell it to the utility grid
Index Terms—Demand side management, GWO, home energy
management system, PSO, peak-to-average ratio. based on the grid codes and TOUP, which varies through the
day [6]. A novel approach of power hubs is presented in [7]
for demand side management in smart homes. These power
hubs control the loads individually. Shah et al. [8] proposed an
energy management system for smart building, integrated with
I. I NTRODUCTION energy storage system by using multi-agent system (MAS).
This research ignored the PAR that controls the horizontal load
E VER increasing electricity demand, rising energy genera-
tion cost and growing renewable energy generation posed
the limit on the production of energy from the conventional
distribution within a day. Mahmood et al. [9] utilized BPSO
based realistic scheduling mechanism (RSM) to schedule the
energy sources [1]. All these challenges motivate the electric home appliances and minimize user frustration and maximize
utilities to focus on demand side management (DSM) tech- utilization of appliance under the given constraints. Huang
niques. Electricity usage report in United States suggested that et al. [10] proposed a hybrid PSO-DE algorithm in order to
at least 30% of electric power is wasted from the 72% of the manage appropriate allocation of energy resources to the end
total power that is consumed by the residential and commercial users.
users [2]. Further, European Union (EU) is also decided to In [11], a cooperative PSO has been used to optimize the
enhance the renewable energy production up to 20% of the energy consumption of both time-shiftable and power-shiftable
total electricity production till 2020. Along with that EU is also home appliances. Yang et al. [12] developed a new interactive
targeted to enhance the generation by improving the energy teaching–learning optimization (ITLO) method for voltage
efficiency up to the level of 20% [3]. High quality and reliable source converter based high voltage direct current (VSC-
power supply is available with the advancement of information HVDC) systems with the offshore wind farm integration.
and communication technology (ICT) in the field of energy Kazemi et al. [13] developed the EMS to manage energy
sector. ICTs are the most important component in the smart usage of appliances, by GWO, which is followed by genetic
algorithm (GA), but the system is not integrated with ESS
Manuscript received February 15, 2019; revised April 1, 2019; accepted and energy-shiftable scheduling. Yang et al. [14] proposed
May 17, 2019. Date of publication June 30, 2019; date of current version a new grouped GWO (GGWO) technique for getting the
May 20, 2019. optimum value of interactive proportional-integral controllers’
T. Molla, B. Khan (corresponding author, e-mail: baseem.khan04@gmail.
com), and B. Moges are with Hawassa University, Hawassa, Southern Nations parameters of doubly-fed induction generator based wind
1530, Ethiopia. turbines. An efficient HEMS has been introduced by Zhao
H. H. Alhelou is with Tishreen University, 2230 Lattakia, Syria. et al. [3] to minimize electricity cost and PAR. In this
R. Zamani is with Tarbiat Modares University, Tehran, Iran.
P. Siano is with The Department of Management & Innovation Systems, research GA is used to solve the formulated minimization
University of Salerno, Fisciano, Campania 84084, Italy. problem and the inclining block rate (IBR) model is adopted to
DOI: 10.17775/CSEEJPES.2019.00340 restrict over energy consumption of home appliances. Rahim
2096-0042 © 2019 CSEE
35
250 CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, VOL. 5, NO. 2, JUNE 2019
et al. [15] introduced the HEMS, which is formulated via be determined experimentally at equal duration of operation
multiple knapsacks and ant colony optimization (ACO). time. It is assumed that all the appliances are working with
However, due to the unpredictable nature of human behavior their maximum power rating specified in their Specification
and variable performance of most of the home appliances Manuel.
with non-linear and complex energy consumption pattern,
Photo voltaic (PV) generation profile
the majority of the techniques listed cannot tackle HEMS 5
problem efficiently. To accomplish energy cost minimization,
4
PAR reduction and peak load minimization sometimes these
Power (kW)
techniques ignored the comfort level of the users. In addition 3
to that, when the number of appliances increased to a certain
2
extent, the stated algorithms converged slowly. Therefore, in
this work the following methods are taken: 1
1) GWO is implemented to minimize the energy billing and 0
PAR, without highly affecting the comfort level of the user. 0 20 40 60 80 100 120
Hours of the day
2) An optimal control model is developed for smart building 1 unit=60 minutes
appliances to schedule the load. Most of the literatures sim-
plified these models as linear models. This work has extended Fig. 1. PV generation profile of Hawassa city.
these models as binary non-linear optimization problem.
3) Different constrained problems are solved simultane-
B. Electricity Tariff Model
ously. The peak demand, PAR and cost of electricity consump-
tion has been monitored without highly affecting the comfort The effectiveness of the proposed system, while solving the
level of the users by scheduling of the appliances to convenient fitness function, is explained by taking Hawassa city, Ethiopia
times within a day. as a case study and implementing the TOUP model.
4) Two novel techniques called ‘restricted’ and ‘multi- In order to promote the production of electricity by the
restricted’ time range scheduling are proposed for scheduling users, the electric utility gives incentives for the customers
the appliances in an efficient way. those are generating power from the renewable energy sources.
Beyond satisfying their energy demand, when they have sur-
plus production, they tend to sell that surplus energy to the
II. LOAD CATEGORIZATION AND E LECTRICITY TARIFF
utility grid and benefit from the net metering.
In EMS of residential building, scheduling of different loads In actual situation, to attract the users and produce energy
can be achieved by specifying the type of loads to be scheduled from renewable sources, the feed in tariff should be higher than
and characteristics of the given loads. These characteristics the grid tariff. But, in this study, to show the effectiveness
include the operation duration and average energy utilization of the proposed system model, the grid tariff (TOUP) and
of each appliance. Generally, home appliances are classified feed in tariff have taken the same value. In addition, to let
into three major groups, i.e. base line (non-shiftable), uninter- the users shift their energy consumption from peak hour to off-
ruptable and interruptable flexible loads [3]. peak hours, the electricity cost in off-peak duration must be
lowered than that in the off peak hours. This can reduce the
A. Home Appliances Used higher energy demand and following stress and instability
In this paper, for the analysis of the proposed problem a problems in the utility grid.
mid-size home is considered and the load profiles of each The cost of the electricity in Ethiopia for residential con-
home appliance are discussed in Table I. A 5 kW roof top sumers for the first 50 kWh is 0.2730 Birr/kWh. By assuming
solar PV on-grid system has integrated into this work. For the electricity cost during peak duration is 50% higher than
that purpose the PV generation profile of Hawassa city is that of off-peak duration then, the TOUP model designed in
presented in Fig. 1. Each of the appliances has a definite this work is shown in Table II.
interval of time for the completion of the operation and thus TABLE II
has a definite power usage vector that has to be developed THE PROPOSED TOUP MODEL
either from the specification of the given appliance or can
Hours Price (TOUP) in Birr/kWh
10:00 PM–7:00 AM 0.2730
TABLE I 7:12 AM–9:48 PM 0.4095
APPLIANCES USED IN THIS STUDY
36
MOLLA et al.: INTEGRATED OPTIMIZATION OF SMART HOME APPLIANCES WITH COST-EFFECTIVE ENERGY MANAGEMENT SYSTEM 251
perform the problem by GWO. Therefore, 12 minutes is the where, gs is the feed-in tariff, ρs ij represents the power
shortest operation time of any appliance. The integer multiples produced by the roof top solars PVs system ins sth time slot.
of the 12-minute time intervals is fixed to the length of By replacing the variable P Y with P and ρs Y s
ak ak sch,ak ak ak
operation time interval (LOT) of each schedulable appliance. with G sschm,ak , the objective function for reduction of con-
The unit of LOT in this study is assumed to be the number sumers’ electricity bill without including the solar PV system
of time slots. It represents very small errors of few seconds, is presented by:
120
which can be ignored. The power consumption scheduling
vector Pa is given by: min ss
sch ) (7)
(C
s=1P
Pa � P , P , . . . , P
1 2 120
(1)
s.t. αa ≤ ta ≤ (βa − la)
where, P as represents the power consumption of ath appliance
With the incorporation of the solar PV system, the objective
for sth time slot in kWh. The power consumption value per
function will be updated as:
hour is assumed to be fixed for all appliances based on their
specifications. For appliance a, the energy consumption per 120
37
252 CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, VOL. 5, NO. 2, JUNE 2019
TABLE III
PARAMETERS OF SCHEDULABLE LOAD APPLIANCES
Appliances Power rating Actual OTD Daily Energy Energy Number of slots
(kW) (min) consumption consumption per assigned
(kWh) slot (kWh)
Washing machine with dryer 3 180 9 0.6 15
Electric oven-1 (morning hours) 2.15 45 1.72 0.43 4
Electric oven-2 (evening hours) 2.15 45 1.72 0.43 4
Refrigerator 0.225 1380 5.175 0.045 115
Electric iron 1.5 24 0.6 0.3 2
Water heater 1.5 60 1.5 0.3 5
Table fan 0.025 120 0.05 0.005 10
Coffee grinder 0.1 12 0.02 0.02 1
38
MOLLA et al.: INTEGRATED OPTIMIZATION OF SMART HOME APPLIANCES WITH COST-EFFECTIVE ENERGY MANAGEMENT SYSTEM 253
leaders in the pack and known as alpha (α). The second levels A. Shiftable Load Appliances Scheduled with Fixed Time
of the grey wolves (subordinate wolves) are known as beta Range
(β). The tasks of the beta grey wolves are helping the alpha In this scheduling, the operation time range for each appli-
wolves in the decision making and other activities in the pack. ance is adjusted by the user on the assigned time slots. There
The wolves at third level are known as delta (δ) those have is one or more than one operation cycle available for each
to submit to alphas and betas, but control the lowest rank schedulable appliance in a day. Users always have to schedule
grey wolves i.e. omega (ω) in the hierarchy. These omega (ω) and adjust the parameters of the appliances manually, followed
wolves act as scapegoat in the pack. Fig. 2 shows the flow by the utility electricity pricing (TOUP) signal. In Table IV
chart of GWO for proposed smart home appliances scheduling parameters of residential load appliances for fixed time range
problem. scheduling are provided.
Figure 3 shows the residential daily load demands under
Start fixed time scheduling with TOUP. It also shows the maximum
load demand within 24 hours in a day. Fig. 4 shows the
Initialization of Population size, Coefficients vectors, Maximum
iterations, Number of generations and their limits residential daily load demands in the fixed time range without
scheduling. The maximum peak load in a day is found at time
Define the initial cost function slot 38 up to 39, i.e. 7:24 AM to 7:36 AM, which is not
preferable in terms of cost minimization. For example, rather
Generation of Initial population randomly than using electric iron during this period of time, the user
can shift it to 12:00 AM to 12:48 AM, which is the off-peak
Calculate Xα, Xβ, Xδ duration. But it might not be good in terms of maximizing the
Iter+1
comfort of the user who is not interested in using it early in
Update the position of current search agent X(t+1)
X(t+1)=Xα+Xβ+Xδ/3 the morning. Most of the time, slots from 1 up to 15, which
are off-peak hours during night, are occupied.
Update A and C. Calculate Fitness value
Daily load demand without PV
5
Daily load demand (kW/slot)
The proposed smart home appliances scheduling problem is Fig. 3. Residential daily load demand pattern in fixed time scheduling (1unit
solved in eight different scenarios by using GWO algorithm. = 12 minutes).
Further, a comparison is presented with PSO [18] technique to
show the effectiveness of the GWO algorithm. The different To measure the PAR during the day, the mean value of the
scenarios for shiftable load appliances are introduced in the scheduled load demand is calculated. For minimizing the peak
following subsections. load demand, the appliances scheduled should be distributed
The maximum load constraint, control the peak load demand to all the time slots within a day, without disturbing the
in home and it is less than or equal to 5.5 kW in the above comfort level of the resident. To simulate the peak load, the
cases. fitness function in (9) is optimized within 24 hours of the day
TABLE IV
PARAMETERS OF SCHEDULABLE APPLIANCES FOR FIXED TIME SCHEDULING
Appliances Power rating Energy Number of Start time End time OTI (time
(kW) consumption per slots assigned (hour) (hour) slot)
slot (kWh)
Washing machine with dryer 3 15 15 12:00 AM 2:48 AM 1–15
Electric oven-1 2.15 4 4 7:00 AM 7:45 AM 36–39
Electric oven-2 2.15 4 4 7:15 PM 8:00 PM 97–100
Refrigerator 0.225 115 115 12:00 AM 10:48 PM 1–115
Electric iron 1.5 2 2 7:15 AM 7:30 AM 37–38
Water heater 1.5 5 5 10:00 AM 11:00 AM 51–55
Table fan 0.025 10 10 2:00 PM 3:00 PM 71–80
Coffee grinder 0.1 1 1 12:15 PM 12:20 PM 62
39
254 CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, VOL. 5, NO. 2, JUNE 2019
Load demand
4 AM–2:48 AM, is 3.225 kW by GWO; and from time slot 35–
Toup
3.5 36, i.e. 6:48 AM–7:00 AM, is 3.875 kW by PSO schedulers.
3
2.5 Daily load demand without Daily load demand without
PV using PSO PV using GWO
2 10 10
1.5 9 Load demand 9 Load demand
TABLE V
OPERATION START AND END TIME FOR RESTRICTED TIME SCHEDULING
40
MOLLA et al.: INTEGRATED OPTIMIZATION OF SMART HOME APPLIANCES WITH COST-EFFECTIVE ENERGY MANAGEMENT SYSTEM 255
TABLE VI
OPERATION START AND END TIME FOR MULTI-RESTRICTED SCHEDULING
Daily load demand without Daily load demand without Daily load demand without Daily load demand without
PV using PSO PV using GWO PV using PSO PV using GWO
10 10 10 10
9 Load demand 9 Load demand 9 Load demand 9 Load demand
PV generation profile PV generation profile PV generation profile PV generation profile
Load demand (kW)
8 8 8 8
slot and less than or equal to the end time slot minus the
starting time slot. The starting operation time slots are within of the loads are occupied between those time ranges, which
their available operation cycle range for an appliance, which increases the peak demand. Using PSO, the calculated peak
has to be scheduled. Table VII shows parameters used for the load demand within a day is 3.25 kW, which is from the time
simulation of schedulable load appliances under variable time slots 25–39 (i.e. 4:48 AM–7:36 AM). The drawback of this
range scheduling. type of scheduling mechanism is that it assigns some of the
The daily load pattern under variable time scheduling appliances to inconvenient time slots to operate. The main
scheme is shown in Fig. 7. concern in this scheduling is the minimization of peak load
At maximum iteration, which is 500, the load demand while maintaining the load distribution within a day as smooth
pattern during the day is shown in Fig. 7. The peak load as possible. The PAR obtained on variable time scheduling
demand with GWO scheduler is 3.225 kW, which is from at maximum iteration using GWO and PSO are 3.9121 and
time slot 35 to 49 (i.e. from 6:48 AM to 9:36 AM). 3.9424, respectively. The cost of electricity calculated by using
Even if most of the time slots within a day are occupied, few GWO and PSO are 5.8497 and 6.0906 Birr/day, respectively.
TABLE VII
PARAMETERS OF SCHEDULABLE APPLIANCES
41
256 CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, VOL. 5, NO. 2, JUNE 2019
Load demand
6 Energy Export with PV generation using fixed time range scheduling is
4 15.78% lower than the energy cost of unscheduled load.
2 Similarly, for a system integrated with PV, by using fixed
0
time range scheduling, the peak load and PAR are higher than
−2
−4
unscheduled load. Most of the load demands in unscheduled
−6 load profile are compensated with PV than that of fixed time
−8 range scheduling.
−10 Rather than compensating the load demand fully, most of the
0 20 40 60 80 100 120
Hours of the day PV generation in fixed time scheduling is exported to utility
grid. The cost of energy that utility should pay the users in a
Fig. 8. The net daily load profile pattern integrated with PV energy under system integrated with PV using fixed time range scheduling
fixed time scheduling (1 unit = 12 minutes). is 2.35% higher than the energy cost in unscheduled load. A
comparative analysis is presented in Table VIII.
As seen from Fig. 9, the net peak load demand during
the day is 3.225 kW, while the net peak PV generation after E. Restricted Time Scheduling Integrated with PV
satisfying the load demand is−4.3566 kW per slot in 24 hours
time horizon. PAR is 3.9121. The total energy demand per day In restricted time range scheduling, the same parameters
of appliance is 19.785 kWh. The energy amount of 8.69852 used in case 2 and Table V is used with the integration of
kWh can be sold to utility grid based on TOUP tariff. The net 5 kW PV generation. Fig. 10 shows the net daily load demand
energy imported from utility grid in kWh is 0. For example, pattern. The upper portion, which is greater than zero, shows
if the shifting of the position of washing machine in day time the import from the utility grid. The lower portion, which is
during peak hours from 9:00 AM to 11:48 AM is done then negative, shows the export to utility grid.
the cost of electricity is rise to−3.82 Birr/day, which is 149% In a system, integrated with PV, energy cost in restricted
higher than the unscheduled load without PV. The PAR is time range scheduling is 4.86% and PAR is 1% lower than
3.7523, which is 34% lower than the unscheduled load without fixed time range scheduling, respectively by GWO scheduler.
integrating with PV energy production. The net daily peak power demand from utility grid with PV
Daily demand pattern for un-scheduled load is shown in integration using GWO is 3.1909 kW and by PSO scheduler is
Fig. 9. Energy cost, peak load and PAR are lower in home 3.225 kW within a day. The minimum PAR with PV using
TABLE VIII
DETAIL SUMMERY AND RESULT COMPARISON BETWEEN FIXED TIME RANGE SCHEDULING AND A SYSTEM WITHOUT SCHEDULING
Type of scheduling mechanism Working Mechanism Peak load PAR Energy cost (Birr/day)
Utility paid for the user User paid to the grid
Unscheduled System without integration of PV 4.725 5.7316 0 7.7853
System integrated with PV 3.0933 3.7523 3.82 0
Fixed time scheduling System without integration of PV 3.875 4.7005 0 6.5568
System integrated with PV 3.225 3.9121 3.9121 0
42
MOLLA et al.: INTEGRATED OPTIMIZATION OF SMART HOME APPLIANCES WITH COST-EFFECTIVE ENERGY MANAGEMENT SYSTEM 257
Daily load demand with Daily load demand with VII. C ONCLUSION
PV using PSO PV using GWO
10 10 In this work, a smart home appliance scheduling problem
8 Net Load demand 8 Net Load demand
Net energy export Net energy export is formulated and optimized using novel restricted and multi-
6 6
Load demand (kW)
43
258 CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, VOL. 5, NO. 2, JUNE 2019
44
APPENDIX – 2
SCREEN SHOTS
45
Customer Details
46
Bill Details
47
Appliance Details
48
GUI without data entered
49
GUI with predicted result
50
APPENDIX-3
PUBLICATIONS
51
ISSN (Online) 2278-1021
IJARCCE ISSN (Print) 2319-5940
Abstract:
Conversation of Electricity is a must and it can be done by start predicting the electricity consumption. The electricity
consumption is higher than the required amount. Existing system uses mysql data for datasets and mathematical
calculation for the prediction process which takes longer time for processing and prediction through mathematical
calculations takes more time and accuracy is lesser. The above problem can be resolved by using big data processing for
large collection of data and with that data it is been used for training the machine using the random forest algorithm for
predicting the electricity consumption. Through that we enter the application’s usage time accordingly in the GUI which
takes the input and checks with the machine and gives the predicted range of the electricity consumption in units with
the price.
Keywords: Big Data, Hadoop tool kit, Random forest algorithm, GUI.
I. INTRODUCTION
Energy conservation is the effort made to reduce the consumption of energy by using less of an energy service. This can
be achieved either by using energy more efficiently (using less energy for a constant service) or by reducing the amount
of service used (for example, by driving less). Energy conservation is a part of the concept of Eco-sufficiency. The
electricity is also a type of energy which should be conserved by using low electric consumption application or application
that consumes electricity efficiently by storing the excess power in a battery and use them for future usage. Due to increase
in global warming every where they have started to adopt renewable energy like solar, wind where the half of the energy
is been lost while converting the solar or wind energy into electricity. Electricity can be conserved by predicting the
electricity usage and check with the application independently so that the application can be used according to the need
and by reducing its usage efficiently. For the prediction the data are been analysed and processed in a data set format for
the machine to use the data as input for training and testing. The Hadoop tool is been used for pre-processing and analyse
the large set of data. The random forest algorithm is been used for training the machine, where the random forest algorithm
is a supervised learning method. The supervised learning is a method where the input and the output is been given to the
system for the training and then it is compared to the test data for checking its accuracy.
Existing System supervises the given backend by utilizing MySQL which contains heap of disadvantages i.e.
information prerequisite is that preparing time is high when the information is tremendous and once information is lost,
it can't recoup . In the prediction part it uses normal mathematical calculations it causes higher time consumption when
large data is been processed it gives lesser accuracy.
.
Proposed concept deals with providing database by using Hadoop tool it can analyse with no limitation of data
and simply add number of machines to the cluster and get the results with less time, high throughput and maintenance
cost is very less and we are using partitions and bucketing techniques in Hadoop. For the prediction and analyses part
uses random forest algorithm it calculates faster with higher accuracy. The random forest algorithm is a supervised
learning where we could train the machine with desired training data for the desired output.
52
ISSN (Online) 2278-1021
IJARCCE ISSN (Print) 2319-5940
This system is designed to predict the electricity consumption of home applications. The system creates a table and
loads the data from the csv and the hdfs filesystems through the hive execution engine the data is been ordered and
analysed the true data only then they are been partitioned accordingly to their columns and they are bucketed for
their values and stored as a csv file data. The data set is been analysed and the required dataset is been taken as a input
accordingly to train the system for the required output. The trained dataset is been predicted accordingly to their
trained data then the results are been checked and then it gets as the output in the GUI application.
V. SYSTEM IMPLEMENTATION
53
ISSN (Online) 2278-1021
IJARCCE ISSN (Print) 2319-5940
• Prediction:
The pre-processed data is been taken in as input for training the machine using the Random forest algorithm
then the trained set is been checked using test set of data where the accuracy is been checked after testing the
input for the user is been used as input and the machine runs according to its trained data set and it gives the
output in the GUI application.
VI. CONCLUSION
Here the Power Distribution to Datacenters data is help to give awareness to how much power consumption in different
datacenters. To run a analysis on the Power Distribution to Datacenters data in hadoop ecosystem. Then the local data
set of electricity consumption is been added into the Hadoop ecosystem for analysis. Hadoop ecosystem uses hive, pig,
map reduce to analysis faster than those data are been organized and been used in for the random forest for the bill
prediction.
REFERERENCES
[1] H. P. Khomami and M. H. Javidi, “An efficient home energy management system for automated residential demand
response,” in Proceedings of the 13th International Conference on Environment and Electrical Engineering, Wroclaw,
Poland, 2013, pp. 307–312.
[2] S. Shah, R. Khalid, A. Zafar, S. M. Hussain, H. Rahim, and N. Javaid, “An optimized priority enabled energy
management system for smart homes,” in Proceedings of the 31st International Conference on Advanced Information
Networking and Applications (AINA), Taipei, China, 2017, pp. 1035–1041
[3] Z. Zhao, W. C. Lee, Y. Shin, and K. B. Song, “An optimal power scheduling method for demand response in home
energy management system,” IEEE Transactions on Smart Grid, vol. 4, no. 3, pp. 1391–1400, Sep. 2013.
[4] Z. R. Dogaheh and M. R. Dogaheh, “Optimizing the performance of smart grids in relation with residential energy
centers equipping with Solar Power Units (PV),” Sindhological Studies, no. 1, pp. 14–22, 2017.
54
E-Certificates
55
56
57
58
REFERENCES
[1] S. D. Beigvand, H. Abdi, M. La Scala, “A general model for energy hub economic
dispatch,” Appl. Energy, vol. 190, pp. 1090-1111, Mar. 2017.
[2] M. Geidl. Integrated modeling and optimization of multi-carrier energy systems. Ph.D.
Thesis. Austria: TU Graz, Styria; 2007.
[3] M. Geidl and G. Andersson, “Optimal power flow of multiple energy carriers,” IEEE Trans.
Power Syst., vol. 22, no. 1, pp. 145-155, Feb.2007..
[4] R. Lasseter and P. Piagi, “Microgrid: A Conceptual Solution,” in IEEE Annual Power
Electronics Specialists Conference, 2004
[5] Microsoft looking to test grid-independent data center,
http://www.datacenterdynamics.com.
[6] Q. Deng, D. Meisner, A. Bhattacharjee, T. F. Wenisch, and R. Bianchini, “Coscale:
Coordinating cpu and memory system dvfs in server systems,” in Proceedings of the
IEEE/ACM International Symposium on Microarchitecture, December 2012.
[7] R. Teodorescu and J. Torrellas, “Variation-Aware Appli-cation Scheduling and Power
Management for Chip Multi-processors,” in International Symposium on Computer
Architecture, June 2008.
[8] Rangan, G. Wei and D. Brooks, “Thread Motion: Fine-Grained Power Management for
Multi-Core System,” in International Symposium on Computer Architecture, June 2009.
[9] D. J. Sorin, O. Matthews and M. Zhang, “Architecting Dynamic Power Management to be
Formally Verifiable,” in Annual Design Automation Conference, June 2014.
[10] S. Kirkpatrick, “Optimization by Simulated Annealing: Quantitative Studies,” Journal of
Statistical Physics, vol. 34, no. 5, 1984.
59