Big Data Black Book PDF

BIG DATA
Covers Hadoop 2, MapReduce, Hive, YARN, Pig,

R and Data Visualization
THIS BOOK AIMS TO:

 Acquaint the readers with the entire data analytics
lifecycle
 Familiarize the readers with the role and use of Big
Data in various relevant industries through case
studies
 Provide complete technical know-how of basic and
advanced Big Data analytics and data visualization
techniques used to analyze data, and provide
business insights
 Give hands-on experience of working with Big Data
analytics tools on datasets, including R and Hadoop
 Enable readers to develop MapReduce and Pig
programs, manipulate distributed files, and
understand APIs supporting MapReduce programs
ISBN: 9789351197577 | Author: DT Editorial Services
ABOUT THE AUTHOR

ABOUT THE BOOK DT Editorial Services has seized the market of computer books,
Big Data is one of the most bringing excellent content in software development to the fore.
popular buzzwords in The team is committed to excellence—excellence in the quality of
technology industry today. content, excellence in the dedication of its authors and
Organizations worldwide have editors, excellence in the attention to detail, and excellence
realized the value of the immense
in understanding the needs of its readers.
volume of data available, and are
trying their best to manage, analyse,
and unleash the power of data to build THE BOOK COVERS:
strategies and develop a competitive  Overview of Big Data
edge. At the same time, the advent of
 Big Data in Business Context
the technology has led to the evolution of a
variety of new and enhanced job roles.  Hadoop Ecosystem
 MapReduce Fundamentals
The objective of this book is to create a new  Big Data Technologies
breed of versatile Big Data analysts and
developers, who are thoroughly conversant with
 Data Processing with MapReduce
the basic and advanced analytic techniques for  YARN, Hive, and Pig
manipulating and analysing data, the Big Data  Data manipulation using R
platform, and the business and industry  Functions and Packages in R
requirements to be able to participate productively in  Graphical Analyses in R
Big Data projects.  Big Data Visualization Techniques
` 799/-
946 PAGES
/dtechpress /dtechpress /dreamtechpress dreamtechpress.wordpress.com
TABLE OF CONTENTS
1: Getting an Overview of Big Data 11: Testing and Debugging MapReduce Applications  Using the scan() Command
 What is Big Data?  Performing Unit Testing for MapReduce  Reading Multiple Data Values from Large Files
 History of Data Management – Evolution Applications  Reading Data from R Studio
of Big Data  Performing Local Application Testing with Eclipse  Exporting Data from R
 Structuring Big Data, Elements of Big Data  Logging for Hadoop Testing
21: Manipulating and Processing Data in R
 Big Data Analytics, Careers in Big Data  Application Log Processing
 Selecting the Most Appropriate Data Structure
 Future of Big Data  Defensive Programming in MapReduce
 Creating Data Subsets, Merging Datasets in R
2: Exploring the Use of Big Data in Business Context 12: Understanding Hadoop YARN Architecture  Sorting Data, Putting Your Data into Shape
 Use of Big Data in Social Networking  Background of YARN, Advantages of YARN  Managing Data in R Using Matrices
 Use of Big Data in Preventing Fraudulent  YARN Architecture, Working of YARN  Managing Data in R Using Data Frames
Activities  YARN Schedulers
22: Working with Functions and Packages in R
 Use of Big Data in Detecting Fraudulent  Backward Compatibility with YARN
 Using Functions Instead of Scripts
Activities in Insurance Sector  YARN Configurations, YARN Commands
 Using Arguments in Functions
 Use of Big Data in Retail Industry  Log Management in Hadoop 1
 Built‐in Functions in R, Introducing Packages
3: Introducing Technologies for Handling Big Data 13: Exploring Hive  Working with Packages
 Distributed and Parallel Computing for Big Data  Introducing Hive, Getting Started with Hive
23: Performing Graphical Analysis in R
 Introducing Hadoop  Data Types in Hive, Built‐In Functions in Hive
 Using Plots, Saving Graphs to External Files
 Cloud Computing and Big Data  Hive DDL, Data Manipulation in Hive
 In‐Memory Computing Technology for Big Data  Data Retrieval Queries, Using JOINS in Hive 24: Integrating R and Hadoop and Understanding Hive
 RHadoop―An Integration of R and Hadoop
4: Understanding Hadoop Ecosystem 14: Analyzing Data with Pig
 Text Mining in RHadoop
 Hadoop Ecosystem  Introducing Pig, Running Pig
 Data Analysis Using the MapReduce Technique in
 Hadoop Distributed File System  Getting Started with Pig Latin Rhadoop, Data Mining in Hive
 MapReduce, Hadoop YARN, Hbase, Hive  Working with Operators in Pig
 Pig and Pig Latin, Sqoop, ZooKeeper 25: Data Visualization‐I
 Working with Functions in Pig
 Flume, Oozie  Introducing Data Visualization
15: Using Oozie  Techniques Used for Visual Data Representation
5: Understanding MapReduce Fundamentals and  Introducing Oozie  Types of Data Visualization
HBase  Installing and Configuring Oozie  Applications of Data Visualization, Visualizing Big
 The MapReduce Framework  Understanding the Oozie Workflow Data, Tools Used in Data Visualization,
 Techniques to Optimize MapReduce Jobs  Oozie Coordinator, Oozie Bundle  Tableau Products
 Uses of MapReduce  Oozie Parameterization with EL
 Role of HBase in Big Data Processing 26: Data Visualization with Tableau (Data
 Oozie Job Execution Model
Visualization‐II)
6: Understanding Big Data Technology Foundations  Accessing Oozie, Oozie SLA
 Introduction to Tableau Software
 Exploring the Big Data Stack 16: NoSQL Data Management  Tableau Desktop Workspace
 Virtualization and Big Data  Introduction to NoSQL, Aggregate Data Models  Data Analytics in Tableau Public
 Virtualization Approaches  Key Value Data Model, Document Databases  Using Visual Controls in Tableau Public
7: Storing Data in Databases and Data Warehouses  Relationships, Graph Databases
27: Social Media Analytics and Text Mining
 RDBMS and Big Data  Schema‐Less Databases, Materialized Views
 Introducing Social Media
 Non‐Relational Database, Polyglot Persistence  Distribution Models, Sharding
 Introducing Key Elements of Social Media
 Integrating Big Data with Traditional Data  MapReduce Partitioning and Combining
 Introducing Text Mining
Warehouses  Composing MapReduce Calculations
 Understanding Text Mining Process
 Big Data Analysis and Data Warehouse 17: Understanding Analytics and Big Data  Sentiment Analysis
 Changing Deployment Models in Big Data Era  Comparing Reporting and Analysis  Performing Social Media Analytics and Opinion
8: Storing Data in Hadoop  Types of Analytics Mining on Tweets
 Introducing HDFS, Introducing HBase  Points to Consider during Analysis
28: Mobile Analytics
 Combining HBase and HDFS  Developing an Analytic Team
 Introducing Mobile Analytics
 Selecting the Suitable Hadoop Data  Understanding Text Analytics
 Introducing Mobile Analytics Tools
Organization for Applications 18: Analytical Approaches and Tools to Analyze Data  Performing Mobile Analytics
9: Processing Your Data with MapReduce  Analytical Approaches, History of Analytical Tools  Challenges of Mobile Analytics
 Recollecting the Concept of MapReduce  Introducing Popular Analytical Tools
29: Finding a Job in the Big Data Market
Framework  Comparing Various Analytical Tools, Installing R
 Importance and Scope of Big Data Jobs
 Developing Simple MapReduce Application 19: Exploring R  Big Data Opportunities
 Points to Consider while Designing MapReduce  Exploring Basic Features of R, Exploring RGui  Skill Assessment for Big Data Jobs
10: Customizing MapReduce Execution  Exploring RStudioHandling Basic Expressions in R  Roles and Responsibilities in Big Data Jobs
 Controlling MapReduce Execution with  Variables in R, Working with Vectors  Gaining a Foothold in the Big Data Market
InputFormat  Storing and Calculating Values in R  Basic Educational Requirements for Big Data Jobs
 Reading Data with Custom RecordReader  Creating and Using Objects  Basic Technological Requirements for Big Data
 Organizing Output Data with OutputFormats  Interacting with Users Jobs, Tools Supporting Big Data
 Customizing Data with RecordWriter  Handling Data in R Workspace  Consultants and In‐House Specialists in Big Data
 Optimizing MapReduce Execution with  Executing Scripts, Creating Plots  Tactics for Searching Big Data Jobs
Combiner  Accessing Help and Documentation in R  Preparing for Interviews
 Controlling Reducer Execution with Partitioners  Using Built‐in Datasets in R  Obtaining Big Data Jobs through Social Media
 Implementing a MapReduce Program for 20: Reading Datasets and Exporting Data from R
Sorting Text Data  Using the c() Command
Books are available on:
DREAMTECH PRESS WILEY INDIA PVT. LTD. Distributed by:
Published by: 19-A, Ansari Road, Daryaganj 4435-36/7, Ansari Road, Daryaganj
New Delhi-110 002, INDIA New Delhi-110 002, INDIA
Tel: +91-11-2324 3463-73, Fax: +91-11-2324 3078 Tel: +91-11-4363 0000, Fax: +91-11-2327 5895
Email: feedback@dreamtechpress.com Email: csupport@wiley.com
Website: www.dreamtechpress.com Website: www.wileyindia.com
Regional Offices: Bangalore: Tel: +91-80-2313 2383, Fax: +91-80-2312 4319, Email: blrsales@wiley.com
Mumbai: Tel: +91-22-2788 9263, 2788 9272, Telefax: +91-22-2788 9263, Email: mumsales@wiley.com
/dtechpress /dtechpress /dreamtechpress dreamtechpress.wordpress.com

Big Data Black Book PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Big Data Black Book PDF

Uploaded by

Copyright:

Available Formats

BIG DATA

Covers Hadoop 2, MapReduce, Hive, YARN, Pig,

THIS BOOK AIMS TO:

ISBN: 9789351197577 | Author: DT Editorial Services

ABOUT THE AUTHOR

/dtechpress /dtechpress /dreamtechpress dreamtechpress.wordpress.com

You might also like