Professional Documents
Culture Documents
Preoject Phase 1 Final Report
Preoject Phase 1 Final Report
submitted in the partial fulfilment of the requirement for the award degree of
BACHELOR OF ENGINEERING
IN
COMPUTER SCIENCE AND ENGINEERING
Submitted By
CERTIFICATE
I express my heartfelt sincere gratitude to Dr. SRIDHAR, Professor & Head of Department
of Computer Science and Engineering, AMRUTA INSTITUTE OF ENGINEERING AND
MANAGEMENT SCIENCES, for his valuable suggestions and support.
Finally, I would like to thank all the Teaching Technical faculty and supporting staff members
of Department of Computer Science and Engineering, AMRUTA INSTITUTE OF
ENGINEERING & MANAGEMENT SCIENCES, for their support.
Our project aims to improve the accuracy of Earthquake Early Warning systems by means of machine
learning. These are designed to detect and characterize medium and large earthquakes before their
damaging effects reach a certain location. Traditional EEW methods based on seismometers fail to
accurately identify large earthquakes due to their sensitivity to the ground motion velocity. The recently
introduced high-precision GPS stations, on the other hand, are ineffective to identify medium
earthquakes due to its propensity to produce noisy data. In practice, EEW can be seen as a typical
classification problem in the machine learning field: multisensory data are given in input, and
earthquake severity is the classification result. It is distinguished from earthquake prediction in that the
earthquake has nucleated to provide detectable ground motion when and EEW is issued. In this project,
we introduce the Distributed Multi-Sensor Earthquake Early Warning (DMSEEW) system, a novel
machine learning-based approach that combines data from both types of sensors (GPS stations and
seismometers) to detect medium and large earthquakes.
INDEX
Acknowledgement
Abstraction
01 Introduction 1-2
04 Objectives 7-9
05 Methodology 10-12
06 Expected Results 13
Conclusion 14
References 15
Seismowatch Advance Guard Using Machine Learning 2023-24
CHAPTER 1
INTRODUCTION
Earthquakes cause substantial loss of life and damage to the built environment across areas
spanning hundreds of kilometers from their origins. These large ground motions often lead to
hazards such as tsunamis, fires and landslides. Earthquake early warning (EEW) is the delivery
of ground shaking alerts or warnings. It is distinguished from prediction in that the earthquake
has nucleated to provide detectable ground motion when an EEW is issued. The warning time
available is the time between detection and when ground motion is experienced by a user.
Potential warning times are therefore seconds to minutes. Likewise, the time available to collect
and process geophysical data and deliver alerts is seconds to minutes, and the actions of users
must be possible in seconds to minutes. To mitigate the disastrous effects, a number of
Earthquake Early Warning (EEW) systems have been built around the world (Allen and Melgar
2019). These critical systems, operating 24/7, are expected to automatically detect and
characterize earthquakes as they happen, and to deliver alerts before the ground motion actually
reaches sensitive areas so that protective measures could be taken. A recent review by (Allen
and Melgar 2019) identified the detection of the whole spectrum of earthquakes with damaging
potential and particularly large earthquakes as an outstanding problem in the field of EEW. An
EEW system needs to be able to detect both medium (5 ≤ magnitude < 6, Richter scale) and
large earthquakes (6 ≤ magnitude, Richter scale). Depending on the distance from the origin of
the earthquake, both of these can cause serious damages. Seismometers, which have long been
the bulwark of seismology to detect earthquakes, have a difficulty to detect and characterize
large earthquakes (Melgar et al. 2013) due to a well-known saturation issue caused by their
sensitivity to ground motion velocity. As a result, earthquakes over magnitude 7.5 tend to be
underestimated. A promising solution to this issue (Melgar et al. 2015) emerged with novel high-
precision Global Positioning System (GPS) sensors, with their millimeter to centimeter
accuracy. Traditionally, inertial seismometers are used to detect primary waves. The inertial
mass is designed to remain stationary following sudden movements while the frame and drum
move with the ground to record waves.
Rapid growth in the quantity of seismic data has posed challenges for processing and analysis
techniques in modern seismology. Many popular techniques used in major data centres and
seismological research originated from the age when seismic data were small and computational
power was limited. Today machine learning provides a large collection of tools to extract valuable
information from voluminous data. Trained with sufficient data, machine learning algorithms have
human-like capability to recognize natural objects and make expert-level decisions in various
disciplines. In addition, although the computational cost for training is usually high, their online
operation is low cost (e.g., Perol et al., 2018). These advantages make machine learning particularly
suitable for applications in real-time seismology and earthquake early warning.
CHAPTER 2
LITERATURE SURVEY
CHAPTER 3
PROBLEM STATEMENT
➢ A recent review by (Allen and Melgar 2019) identified the detection of the whole spectrum
of earthquakes with damaging potential and particularly large earthquakes as an
outstanding problem in the field of EEW.
➢ An EEW system needs to be able to detect both medium (5 ≤ magnitude < 6, Richter scale)
and large earthquakes (6 ≤ magnitude, Richter scale).
➢ Depending on the distance from the origin of the earthquake, both of these can cause
serious damages. Seismometers, which have long been the bulwark of seismology to detect
earthquakes, have a difficulty to detect and characterize large earthquakes (Melgar et al.
2013) due to a well-known saturation issue caused by their sensitivity to ground motion
velocity. As a result, earthquakes over magnitude 7.5 tend to be underestimated.
➢ A promising solution to this issue (Melgar et al. 2015) emerged with novel high- precision
Global Positioning System (GPS) sensors, with their millimeter to centimeter accuracy
when measuring high ground motion velocity.
➢ However, GPS are unable to characterize medium earthquakes, as they are prone to
containing significant signals from a variety of noise sources, mostly of atmospheric origin.
➢ Integrating and processing high-frequency data streams from multiple sensors scattered
over a large territory in a timely manner requires high-performance computing techniques
and equipment’s.
➢ EEW systems are designed to detect and characterize medium and large earthquakes before
their damaging effects reach a certain location. Traditional EEW methods based
on seismometers fail to accurately identify large earthquakes due to their sensitivity to the
CHAPTER 4
OBJECTIVES
• Seismic Sensors: Networks of seismic sensors are strategically placed in regions prone
to earthquakes. These sensors detect ground motion caused by seismic waves.
• P-wave Identification: When an earthquake occurs, the sensors detect the initial P-
waves, which are characterized by their compressional motion and higher frequency
compared to other seismic waves.
➢ Data Processing:
• Establish a robust data processing pipeline to handle seismic information efficiently.
Clean and preprocess seismic data to ensure high-quality input for subsequent stages.
• Data Collection: The seismic sensors continuously collect and transmit raw data,
capturing the movement of the Earth in multiple directions. This data is time-stamped
and sent to centralized servers for analysis.
• Real-time Data Transmission: The data from multiple sensors are collected in real-time
and sent to a central processing system, ensuring swift access to information from
various locations.
• Feature Extraction: Various features of the seismic waves are extracted from the
preprocessed data, such as wave amplitude, frequency content, and arrival times of
different wave types (like P-waves and S-waves).
➢ Database Creation:
• Create comprehensive databases for training, testing, and validating neural networks.
Organize the datasets to reflect a diverse and representative range of seismic
conditions.
• Indexing and Organization: The database is organized and indexed based on various
parameters like time, location, magnitude, and waveform characteristics.
• Setting Thresholds: Thresholds are established based on the characteristics of the recorded
data. These thresholds determine the minimum level of ground motion required to trigger
an alert. They are set by analyzing the amplitude, frequency, and duration of seismic waves
during typical non-seismic activities.
• Event Confirmation: Once seismic activity surpasses the established thresholds and passes
initial filtering, the system proceeds with further analysis to confirm the event's nature,
location, depth, and magnitude.
➢ Prediction and Alert System:
• Integrate the neural network model with a real-time prediction system. Develop an alert
mechanism that triggers when seismic activity exceeds the established threshold.
➢ Communication with Authorities:
• Implement a communication protocol to send alerts promptly to the relevant
authorities. Ensure seamless integration with existing alert systems for efficient
dissemination of information.
• Alert Generation: When seismic activity is detected the early detection system generate
alerts based on the analysis of seismic data, estimating the earthquake's location,
magnitude, and potential impact.
• Notification Protocols: Alerts are transmitted via various communication channels such as
dedicated emergency alert systems, specialized software, emails, SMS, phone calls, mobile
apps, sirens, radio, television broadcasts, and social media platforms.
CHAPTER 5
METHODOLOGY
➢ This project involves the creation of seismic sensor which is the combination of ADXL
senser and Global Positioning System (GPS).
➢ Using these two combinations we are detecting P-wave signals.
➢ The GPs and Seismic sensor are used for data collection.
➢ The ADXL series of sensors, developed by Analog Devices, are accelerometers that measure
acceleration in one, two, or three axes.
➢ These sensors are commonly used in various applications to detect changes in acceleration,
tilt, and motion.
➢ Global Positioning System (GPS) technology has become ubiquitous in various applications,
providing accurate positioning and navigation information.
➢ The data can be processed using Microcontrollers and Microcontroller are versatile
integrated circuits that are designed to execute specific tasks in embedded systems. They
find application in a wide range of electronic devices due to their ability to process data,
control operations, and interact with other components.
➢ Data storage is used to collect real time data & train the system.
➢ Here we are using a CNN. The CNN (Convolutional Neural Networks) are a type of deep
learning algorithm specifically designed for image recognition and processing.
➢ "Alert" and "Broadcast" are critical components for notifying the public and relevant
authorities about potential seismic threats.
ADXL
Sensor Data
Micro Storage CNN Alert
controller Broad
GPS casting
Seismic Sensor
Fig 5.1: Block Diagra
ADXL Sensor: The ADXL sensors can contribute to earthquake monitoring and early warning
systems, it's important to note that specialized seismometers are typically more precise for detecting
seismic events. These sensors are designed specifically for earthquake monitoring and can provide
more accurate data for seismic analysis. Nonetheless, accelerometers like the ADXL series can still
play a role in earthquake monitoring systems, especially when integrated into larger sensor network
GPS: GPS stations are installed in seismically active regions to measure movements in the Earth's crust.
These stations continuously monitor ground displacement, which can be indicative of tectonic plate
movements or strain accumulation along faults.
Micro controller: Microcontrollers interface with various sensors like accelerometers, seismometers,
GPS modules, or other seismic sensors. They collect raw data from these sensors that detect ground
motion, seismic waves, or other earthquake-related parameters. Microcontrollers compare the observed
seismic activity against predefined thresholds. When the activity exceeds these thresholds, it can trigger
further analysis or initiate alerts.
Data Storage: To ensure data integrity and availability, redundant storage and backup systems are
often employed. Multiple copies of the seismic data might be stored in different physical locations or
on separate servers to prevent data loss due to system failures or disasters.
CNN: While direct earthquake prediction remains challenging, CNNs might be involved in statistical
modeling using historical seismic data. They could assist in identifying trends or patterns that could
contribute to probabilistic seismic hazard assessments.
Alert Broad casting: Alert broadcasting in earthquake early warning systems aims to provide timely
information to individuals and authorities, allowing for quick responses that can mitigate risks and
reduce the impact of seismic events on lives and infrastructure.
SYSTEM REQUIREMENTS
➢ HARDWARE REQUIREMENTS
• ADXL Sensor
• Seismic Sensor
• Micro controller
• Data Storage
➢ SOFTWARE REQUIREMENTS
CHAPTER 6
EXPECTED RESULT
The expected result of implementing the outlined steps for P-Wave detection and earthquake
prediction is the development of a robust and effective earthquake early warning system. The
system should be capable of accurately detecting P-waves, a type of seismic wave that typically
precedes the more damaging S-waves and surface waves associated with earthquakes. Here's a
breakdown of the expected outcomes:
CONCLUSION
The use of machine learning methods in seismology is still in an early phase. One area of
development where it demonstrated promising results is earthquake early warning (EEW), i.e.
the characterization of an earthquake before it reaches sensitive areas. Current state-of-the-art
methods based on seismometers data only demonstrated an applicability limited to medium
earthquakes. In contrast, GPS based methods are only suitable for large earthquake detection.
We propose DMSEEW, a novel stacking ensemble approach for characterizing the whole
spectrum of earthquakes with damaging potential by combining both GPS and seismometer
data. Our evaluation on a real-world dataset collected with domain experts demonstrates that
the proposed distributed stacking ensemble approach improves the detection of both medium
and large earthquakes compared to traditional seismometer only approach and the combined
sensors (GPS and seismometers) baseline approach that adopts the rule of relative strength (F1
score: +7% and +6% on medium earthquakes, +45% and +27% on large earthquakes). In
addition, DMSEEW detects all large earthquakes with a precision of 100%. While existing
solutions rely on fully centralized processing of the sensor data, our approach assumes
distributed data processing based on a geographically distributed cyberinfrastructure. This
design significantly reduces the volume of data transmitted in the network, meets the real-time
requirements while increasing reliability of the EEW system.
REFERENCES
➢ Allen, R. M., and Melgar, D. 2019. Earthquake Early Warning: Advances, Scientific
Challenges, and Societal Needs. Annual Review of Earth and Planetary Sciences 47:361–
388
➢ Baydogan, M. G., and Runger, G. 2014. Learning a Symbolic Representation for
Multivariate Time Series Classification. Data Mining and Knowledge Discovery 29:400–
422.
➢ Bergstra, J.; Yamins, D.; and Cox, D. 2013. Making a Science of Model Search:
Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. In
Proceedings of the 30th International Conference on Machine Learning
➢ Bergstra, J.; Bardenet, R.; Bengio, Y.; and Kegl, B. 2011. ´ Algorithms for Hyper-
Parameter Optimization. In Proceedings of the 24th International Conference on Neural
Information Processing Systems.