Transparent

Documents tagged with hadoop

Hadoop Performance Tuning

Impetus caters to diverse business needs using HPC based innovative solutions including software (Hadoop, Korus, Grids, Erlang) as well as hardware centric GPU (using CUDA) solutions. This paper ...
  • Pdf_16x16 13 pages
  • Impetus Technologies published this 1 day ago
  • 30 reads
  • 0 comments

Cloud Computing Without the Hype - An Executive Guide (1.00 Scribd)

Defining Cloud Computing and identifying the current players This document offers a high-level summary of Cloud Computing, targeted at Executives who find themselves bombarded with Cloud Computing...
  • Pdf_16x16 20 pages
  • lustratusrepama published this 2 days ago
  • 36 reads
  • 0 comments

Apache Pig -- Pittsburgh Hug

Presentation on Apache Pig for the Pittsburgh Hadoop User Group. Intro to language, join algorithm descriptions, upcoming features, pie-in-the-sky research ideas.
  • Powerpoint_16x16 30 pages
  • squarecog published this 11/03/2009
  • 119 reads
  • 0 comments

Hadoop Architecture

Hadoop, flexible and available architecture for large scale computation and data processing on a network of commodity hardware.
  • Pdf_16x16 34 pages
  • PhilippeJulio published this 10/31/2009
  • 107 reads
  • 0 comments

Business Intelligence Value

Business Intelligence and data warehousing architecture. SAP, Oracle, Greenplum, MySQL, Hadoop...
  • Pdf_16x16 17 pages
  • PhilippeJulio published this 10/31/2009
  • 134 reads
  • 0 comments

Security Compatibility Hadoop World 2009

Owen O'Mailley's slides from Hadoop World.
  • Pdf_16x16 16 pages
  • squarecog published this 10/04/2009
  • 185 reads
  • 0 comments

Cloud Computing - Market Landscape - REV 1 (0.93)

This presentation from Lustratus REPAMA presents a market segmentation model/taxonomy for cloud computing. It includes the infrastructure as a service, platform as a service and software as a servi...
  • Pdf_16x16 48 pages
  • lustratusrepama published this 09/20/2009
  • 378 reads
  • 0 comments

Information Platforms and the Rise of the Data Scientist

My chapter from O'Reilly's book, "Beautiful Data": http://oreilly.com/catalog/9780596157111/.
  • Pdf_16x16 12 pages
  • jhammerb published this 08/14/2009
  • 257 reads
  • 0 comments

Hadoop Operations: Managing Big Data Clusters

Do you manage big data for hungry users? Do you, or have you considered, using Hadoop? Come join the Cloudera team as we show you the tips and tricks we use to manage some of the largest Hadoop clu...
  • Pdf_16x16 59 pages
  • BestTechVideos published this 06/28/2009
  • 1,462 reads
  • 0 comments

HBase Goes Realtime

  • Pdf_16x16 21 pages
  • kovyrin published this 06/24/2009
  • 874 reads
  • 0 comments

Pig: Web Scale Data Processing

  • Pdf_16x16 32 pages
  • kovyrin published this 06/23/2009
  • 594 reads
  • 0 comments

Interpreting the Data: Parallel Analysis with Sawzall

Very large data sets often have a flat but regular structure and span multiple disks and machines. Examples include telephone call records, network logs, and web document repositories. These large ...
  • Pdf_16x16 33 pages
  • kpumuk published this 05/12/2009
  • 415 reads
  • 0 comments

Hadoop Benchmark Results 2009 - Winning a 60 Second Dash with a Yellow Elephant

  • Pdf_16x16 9 pages
  • hoisie published this 05/11/2009
  • 1,544 reads
  • 0 comments

Map Reduce using Cascading

Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.
  • Pdf_16x16 22 pages
  • parthpatil published this 05/11/2009
  • 130 reads
  • 0 comments

Benchmarks Sigmod09

  • Pdf_16x16 14 pages
  • zzztimbo published this 04/16/2009
  • 280 reads
  • 0 comments

Sigmod'08: Pig Latin: A Not-So-Foreign Language For Data Processing

  • Powerpoint_16x16 27 pages
  • kovyrin published this 04/10/2009
  • 643 reads
  • 0 comments

Hadoop Summit 2008: Pig: Web Scale Data Processing

Christopher Olston and many others Yahoo! Research
  • Pdf_16x16 15 pages
  • kovyrin published this 04/10/2009
  • 715 reads
  • 0 comments

Pig Latin: A Not-So-Foreign Language for Data Processing

There is a growing need for ad-hoc analysis of extremely large data sets, especially at internet companies where innovation critically depends on being able to analyze terabytes of data collected e...
  • Pdf_16x16 12 pages
  • kovyrin published this 04/09/2009
  • 595 reads
  • 0 comments

Hadoop Training #6: Introduction to Hive

Hive is a powerful data warehousing application built on top of Hadoop which allows you to use SQL to access your data. This lecture will give an overview of Hive and the query language. This lectu...
  • Pdf_16x16 27 pages
  • kpumuk published this 03/16/2009
  • 1,306 reads
  • 0 comments

Hadoop Training #5: MapReduce Algorithm

After we've introduced you to the tools, it's time to learn how to use them efficiently. Algorithms designed for running on MapReduce look a little different than those you've written before. We'll...
  • Pdf_16x16 31 pages
  • kpumuk published this 03/16/2009
  • 1,951 reads
  • 0 comments
1 2 Next >