This action might not be possible to undo. Are you sure you want to continue?
Gaurav Gupta1, Geetika Hans2, Tamanna Sehgal3
Sr. Lecturer in Deptt of Computer Sc. & Engg., RIMT – IET. Mandi Gobindgarh. email@example.com 2 Technical Associate, Tech Mahindra. firstname.lastname@example.org 3 Lecturer in Computer Sc. & Engg., CIET, Rajpura. email@example.com with data warehousing and database systems, the standardization of data mining languages, and data privacy protection and security.
The advent of computing technology has significantly influenced our lives and two major impacts of this effect are Business data Processing and Scientific Computing. During the early years of the development of computer techniques for business, computer professionals were concerned with designing files to store the data so that information could be efficiently retrieved. There were restrictions on storage size for storing data and on the speed of accessing the data. Needless to say, the activity was restricted to a very few, highly qualified professionals. Then came an era when Database Management System simplified the task. The responsibility of intricate tasks, such as declarative aspects of the programs was passed on to the database administrator and the user could pose his query in simpler languages such as query languages. Thus almost any business-small, medium or large scale began using computers for day-today activities. Now what is the use of all this data? Up to the early 1990’s the answer to this was “NOT much”. No one was really interested in utilizing data, which was accumulated during the process of daily activities. As a result a new discipline in computer science, Data Mining gradually evolved.
Data mining is the process of extraction of interesting (nontrivial, implicit, previously unknown and potentially useful) patterns or knowledge from huge amount of data. It is the set of activities used to find new, hidden or unexpected patterns in data or unusual patterns in data. Using information contained within data warehouse, data mining can often provide answers to questions about an organization that a decision maker has previously not thought to ask. • • • • • Which products should be promoted to a particular customer? What is the probability that a certain customer will respond to a planned promotion? Which securities will be most profitable to buy or sell during the next trading session? What is the likelihood that a certain customer will default or pay back a schedule? What is the appropriate medical diagnosis for this patient?
Data Mining is the exploration and analysis of large sets, in order to discover meaningful patterns and rules. The key idea is to find effective ways to combine computers power to process data with the human eye’s ability to detect patterns. The techniques of data mining are designed for work best with large data sets. Since Data Mining is a young discipline with wide and diverse applications, there is still a nontrivial gap between general principals of Data Mining and domain specific, effective Data Mining tools for particular applications. In this paper we will illustrate a few application domains of Data Mining (such as finance, the retail industry and telecommunication) and Trends in Data Mining which include further efforts towards the exploration of new application areas and new methods for handling complex data types, algorithms scalability, constraint based mining and visualization methods, the integration of data mining
These types of questions can be answered surprisingly easily if the information hidden among the petabytes of data in your databases can be located and utilized. In the following paragraphs we will discuss about the applications and trends in the fields of data mining.
As data mining matures, new and increasingly innovative applications for it emerge. Although a wide variety of data mining scenarios can be described. For the purpose of this paper the applications of data mining are divided in the following categories: • • • Healthcare Finance Retail industry
Web mining is especially exciting because it enables tasks that were previously difficult to implement. which facilitates systematic data analysis and data mining. especially due to the increasing ease. For example it can also help in fraud detection by detecting a group of people who stage accidents to collect on insurance money.• • • Telecommunication Text Mining & Web Mining Higher Education business involved. Some also offer insurance services and stock services. Finance Most banks and financial institutions offer a wide variety of banking services (such as checking. Retail industry provides a rich source for data mining. Healthcare The past decade has seen an explosive growth in biomedical research. identify telecommunication patterns. For example the search engines work on the principle of data mining. prevention and treatment. Data mining has quickly emerged as a highly desirable tool for using current reporting capabilities to uncover and understand hidden patterns in vast databases. They can be configured to monitor and gather data from a wide variety of locations and can analyze the data across one or multiple sites. Institutions can better address these students and alumni through the analysis and presentation of data. the telecommunication market is rapidly expanding and highly competitive. catch fraudulent activities. and improve the quality of service Text Mining and Web Mining Text mining is the process of searching large volumes of documents from certain keywords or key phrases. Recent research in DNA analysis has led to the discovery of genetic causes for many diseases and disabilities as well as approaches for disease diagnosis. By searching literally thousands of documents various relationships between the documents can be established. stock market. availability and popularity of the business conducted on web. It enhances the web site with intelligent behavior. pager. Internet and numerous other means of communication and computing are underway. saving. cellular phone. and the applications of data mining to solve large applications large application problems are important tasks for data mining researches and data mining system and application developers. An extension of text mining is web mining. Financial data collected in the banking and financial industry is often relatively complete. The design of a standard data mining languages. telecommunication and many more. goods transportation and consumption and service records and so on. make better use of resources. fraud detection. Retail Industry Retail industry collects huge amount of data on sales. Higher Education An important challenge that higher education faces today is predicting paths of students and alumni. Retail data mining can help identify customer behavior. ranging from the development of new pharmaceuticals and in cancer therapies to the identification and study of human genome by discovering large scale sequencing patterns and gene functions. The integration of telecommunication. But as data mining is becoming more popular it is gaining wide acceptance in other fields also such as biomedicine. the construction of interactive and integrated data mining environments. to overcome the competitors. improve the quality of customer service. computer network. achieve better customer retention and satisfaction. Web mining is an exciting new field that integrates data and text mining within a website. computer and web data transmission and other data traffic. or e-commerce. credit (such as business. the development of effective and efficient data mining methods and systems. and automobile loans). and business and individual customer transactions). fax. reliable and high quality. e-mail. discover customer shopping patterns and trends. Moreover. Using text mining however. This creates a great demand from data mining in order to help understand TRENDS As different types of data are available for data mining tasks. The quantity of data collected continues to expand rapidly. Telecommunication The telecommunication industry has quickly evolved from offering local and long distance telephone services to provide many other comprehensive communication services including voice. images. mortgage. enrollment management and time-todegree. customer shopping history. so data mining approaches poses many challenging research issues in data mining. enhance goods consumption ratios design more effective goods transportation and distribution policies and reduce the cost of business. continue to exert pressure on colleges to search for new and faster solutions. such as suggesting related links or recommending new products to the consumer. we can easily derive certain patterns in the comments that may help identify a common set of customer perceptions not captured by the other syrvey questions. And many . Which student will enroll in particular course programs? Who will need additional assistance in order to graduate? Meanwhile additional issues. with the deregulation of the telecommunication industry in many countries and the development of new computer and communication technologies. and investment services (such as mutual funds). Here we will discuss some of the trends in data mining that reflect the pursuit of these challenges: Application Exploration Earlier data mining was mainly used for business purpose.
Visual data mining It is rightly said a picture is worth a thousand words. the web also poses great challenges for efficient resource and knowledge discovery. data warehouse systems. Based on the above facts. advertisements. SGI Mineset. Mining and Visualization Core Concepts by George M. sequence and text data poses an important research area in field of data mining. the integration of data mining with data warehousing and database systems. But we are drowning in data but deficient of knowledge Data is useless. New methods for mining complex types of data The complex types of data like geospatial. algorithms scalability. providing a huge source for data mining. One important direction towards improving the repair efficiency of the timing process while increasing user interaction is constraint-based mining. multimedia. There is still a huge gap between the needs for these applications and the available technology. Combination of data mining with database systems. online analytical processing and online analytical mining should be integrated into one unified framework. science. SAS Enterprise Miner. effective data mining tools for particular applications. Data Mining Concepts and Techniques – Jiawei Han & Micheline Kamber Modern Data Warehousing. government. The data mining methods should be more interactive and user friendly. but as data is expanding at a massive rate. time series. In addition for data mining for business continues to expand as e-commerce and marketing becomes mainstream elements of the retail industry. Web mining The World Wide Web is huge collection of globally distributed collection of news. financial. NEED OF DATA MINING The massive growth of data from terabytes to perabytes is due to the wide availability of data in automated form from various sources as WWW. Visual data mining is an effective way to discover knowledge from huge amounts of data.new explorations are being done for this purpose. REFERENCES 1. Scalable data mining methods The current data mining methods capable of handling only a particular type of data and limited amount of data. there is a need to develop new data mining methods which are scalable and can handle different types of data and large volume of data. Marakas. and data privacy protection and security. and WWW are loaded with huge amounts of data and have thus become the major information processing systems. The desired architecture for data mining system is the tight coupling with database and data warehouse systems. improved interpretability among multiple data mining systems and functions. data warehouse systems. Business. A lot has been done in this field and lot more need to be done. the standardization of data mining languages. Society and many more. IBM Intelligent Miner. Transaction management query processing. and web database systems Database systems. Standardization of data mining language: Today few data mioning languages are commercially available in the market like Microsoft’s SQL server 2005. e-commerce and many other services. there is still a nontrivial gap between general principles of data mining and domain specific. . if it cannot deliver knowledge. the retail industry and telecommunication) and Trends in Data Mining which include further efforts towards the exploration of new application areas and new methods for handling complex data types. CONCLUSION Since data mining is a young discipline with wide and diverse applications. It is important to make sure that data mining serves as essential data analysis component that can be easily included in to such an information-processing environment. consumer records. The systematic study and development of visual data mining techniques will promote the use for data mining analysis. So if the result of the mined data can be shown in the visual form it will further enhance the worth of the mined data. A few application domains of Data Mining (such as finance. Clementine . The WWW also contains huge and dynamic collection hyper linked information. constraint based mining and visualization methods. This provide user with more control by allowing the specification and use of constraints to guide data mining systems in their search for interesting patterns. DBMiner and many more but a standard data mining language or other standardization efforts will provide the orderly development of data mining solutions. 2. That is why data mining is gaining wide acceptance in today’s world. education.
This action might not be possible to undo. Are you sure you want to continue?
We've moved you to where you read on your other device.
Get the full title to continue listening from where you left off, or restart the preview.