Practical Machine Learning: A New Look at Anomaly Detection
By Ted Dunning and Ellen Friedman
()
Currently unavailable
Currently unavailable
About this ebook
Finding Data Anomalies You Didn't Know to Look For
Anomaly detection is the detective work of machine learning: finding the unusual, catching the fraud, discovering strange activity in large and complex datasets. But, unlike Sherlock Holmes, you may not know what the puzzle is, much less what “suspects” you’re looking for. This O’Reilly report uses practical examples to explain how the underlying concepts of anomaly detection work.
From banking security to natural sciences, medicine, and marketing, anomaly detection has many useful applications in this age of big data. And the search for anomalies will intensify once the Internet of Things spawns even more new types of data. The concepts described in this report will help you tackle anomaly detection in your own project.
- Use probabilistic models to predict what’s normal and contrast that to what you observe
- Set an adaptive threshold to determine which data falls outside of the normal range, using the t-digest algorithm
- Establish normal fluctuations in complex systems and signals (such as an EKG) with a more adaptive probablistic model
- Use historical data to discover anomalies in sporadic event streams, such as web traffic
- Learn how to use deviations in expected behavior to trigger fraud alerts
Ted Dunning
Ted Dunning is Chief Applications Architect at MapR Technologiesand active in the open source community. He currently serves as VP for Incubator at the Apache Foundation,as a champion and mentor for a large number of projects, and ascommitter and PMC member of the Apache ZooKeeper and Drillprojects. He developed the t-digest algorithm used to estimateextreme quantiles. T-digest has been adopted by several open sourceprojects. He also developed the open source log-synth projectdescribed in the book Sharing Big Data Safely (O’Reilly). Ted was the chief architect behind the MusicMatch (now YahooMusic) and Veoh recommendation systems, built fraud-detectionsystems for ID Analytics (LifeLock), and has issued 24 patents todate. Ted has a PhD in computing science from University of Sheffield.When he’s not doing data science, he plays guitar and mandolin.Ted is on Twitter as @ted_dunning.
Related to Practical Machine Learning
Related ebooks
The Influence Edge: How to Persuade Others to Help You Achieve Your Goals Rating: 4 out of 5 stars4/5Practice Makes Perfect Statistics Rating: 0 out of 5 stars0 ratingsBreakthrough Improvement with QI Macros and Excel: Finding the Invisible Low-Hanging Fruit: Finding the Invisible Low-Hanging Fruit Rating: 0 out of 5 stars0 ratingsMathematical Modelling Rating: 4 out of 5 stars4/5Open Source Fuzzing Tools Rating: 0 out of 5 stars0 ratingsGrokking Machine Learning Rating: 0 out of 5 stars0 ratingsInference and Prediction in Large Dimensions Rating: 4 out of 5 stars4/5Experimental Design: A Chemometric Approach Rating: 0 out of 5 stars0 ratingsBeyond Binary Exploring the Depths of Artificial Intelligence: programming, #2 Rating: 0 out of 5 stars0 ratingsFrom Novice to ML Practitioner: Your Introduction to Machine Learning Rating: 0 out of 5 stars0 ratings15 Dangerously Mad Projects for the Evil Genius Rating: 4 out of 5 stars4/5GROKKING ALGORITHMS: Simple and Effective Methods to Grokking Deep Learning and Machine Learning Rating: 0 out of 5 stars0 ratingsDeep Learning with Keras: Beginner’s Guide to Deep Learning with Keras Rating: 3 out of 5 stars3/5Grokking Artificial Intelligence Algorithms Rating: 0 out of 5 stars0 ratingsExcel VBA Macro Programming Rating: 0 out of 5 stars0 ratingsApplied Bayesian Modeling and Causal Inference from Incomplete-Data Perspectives Rating: 0 out of 5 stars0 ratingsRisk Analysis: Assessing Uncertainties Beyond Expected Values and Probabilities Rating: 0 out of 5 stars0 ratingsDeep Learning: Computer Vision, Python Machine Learning And Neural Networks Rating: 0 out of 5 stars0 ratingsCalculus DeMYSTiFieD, Second Edition Rating: 3 out of 5 stars3/5Animal and Translational Models for CNS Drug Discovery: Psychiatric Disorders Rating: 0 out of 5 stars0 ratingsForensic Investigations, Grades 6 - 8: Using Science to Solve Crimes Rating: 0 out of 5 stars0 ratingsIntroduction to Business Analytics Using Simulation Rating: 3 out of 5 stars3/5The Matrixial Brain: Experiments in Reality Rating: 0 out of 5 stars0 ratingsAlgorithms: Computer Science Unveiled Rating: 0 out of 5 stars0 ratingsIntroduction to Stochastic Search and Optimization: Estimation, Simulation, and Control Rating: 4 out of 5 stars4/5Hands-On Value-at-Risk and Expected Shortfall: A Practical Primer Rating: 0 out of 5 stars0 ratingsJust Think! Grade 5 Rating: 0 out of 5 stars0 ratingsUnusual Suspects: Essays on Social Learning Rating: 0 out of 5 stars0 ratings
Databases For You
Practical Data Analysis Rating: 4 out of 5 stars4/5Grokking Algorithms: An illustrated guide for programmers and other curious people Rating: 4 out of 5 stars4/5SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL Rating: 4 out of 5 stars4/5Learn SQL in 24 Hours Rating: 5 out of 5 stars5/5Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight Rating: 5 out of 5 stars5/5Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program Rating: 4 out of 5 stars4/5COMPUTER SCIENCE FOR ROOKIES Rating: 0 out of 5 stars0 ratingsLearn SQL Server Administration in a Month of Lunches Rating: 3 out of 5 stars3/5SQL Server: Tips and Tricks - 1 Rating: 5 out of 5 stars5/5Access 2019 For Dummies Rating: 0 out of 5 stars0 ratingsSQL: Practical Guide for Developers Rating: 2 out of 5 stars2/5Blockchain Basics: A Non-Technical Introduction in 25 Steps Rating: 5 out of 5 stars5/5Spring in Action, Sixth Edition Rating: 5 out of 5 stars5/5COBOL Basic Training Using VSAM, IMS and DB2 Rating: 5 out of 5 stars5/5Access 2010 All-in-One For Dummies Rating: 4 out of 5 stars4/5100+ SQL Queries T-SQL for Microsoft SQL Server Rating: 4 out of 5 stars4/5Building a Scalable Data Warehouse with Data Vault 2.0 Rating: 4 out of 5 stars4/5Learn Git in a Month of Lunches Rating: 0 out of 5 stars0 ratingsExcel 2021 Rating: 4 out of 5 stars4/5Jump Start MySQL: Master the Database That Powers the Web Rating: 0 out of 5 stars0 ratingsData Science Strategy For Dummies Rating: 0 out of 5 stars0 ratingsBeginning Microsoft SQL Server 2012 Programming Rating: 1 out of 5 stars1/5CompTIA DataSys+ Study Guide: Exam DS0-001 Rating: 0 out of 5 stars0 ratingsPython Projects for Everyone Rating: 0 out of 5 stars0 ratingsOracle DBA Mentor: Succeeding as an Oracle Database Administrator Rating: 0 out of 5 stars0 ratingsLearning PostgreSQL Rating: 1 out of 5 stars1/5
Reviews for Practical Machine Learning
0 ratings0 reviews