Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Save to My Library
Look up keyword
Like this
1Activity
0 of .
Results for:
No results containing your search query
P. 1
Democratic Big Data Drupal

Democratic Big Data Drupal

Ratings: (0)|Views: 356 |Likes:
Bonita

Drupal

Aegir BOA

Cloudera

Solr

Nutch

Ubuntu
Bonita

Drupal

Aegir BOA

Cloudera

Solr

Nutch

Ubuntu

More info:

Categories:Types, Research
Published by: Permaculture Cooperative on May 28, 2013
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

05/28/2013

pdf

text

original

 
 
1
DEMOCRATIC BIG DATA DRUPALPROOF-OF-CONCEPT
May 27, 2013
AUTHOR: NICHOLAS ROBERTS
OVERVIEW
1. Problem Description
PROBLEM: Big Data is a code-word for Big Business & Big Brother. The tools are open-source but the knowledge islargely locked in massive multinational corporations and national government data centers. Maintenance up thesystem stack and across the workflow network is obscured by esoteric command-line.
2. Project Scope
 
Proof-of-concept
: live, web-based, self-hosted working proof-of-concept system.Next stage is to put test system into live servers at cloud host such as Linode or Amazon.
3. High-Level Requirements
Commercial Open Source:
free software, download with potential enterprise support 
The new system must include the following:
  Ability to allow both internal and external users to access the application without downloading any software
  Ability to integrate with existing commercially supported stacks for business process modelling, big dataprocessing, web crawling, search indexing & web content management system hosting
  Ability to incorporate automated routing and notifications based on business rules
4. Deliverables
Proof-of-concept
: live, web-based, self-hosted working proof-of-concept system.Commercial Open-Source Stacks included;
 Ubuntu Precise 12.04.2:Host physical server, 4 node Hadoop/Mapreduce server cluster, web-server 
 
 
2
 VMWare Workstation 9:virtualization software running clone Ubuntu 12 guest cluster 
 Cloudera 4 Free Edition:big data stack (Apache; Hadoop, Mapreduce, Impala, Pig, Hive, Hue, Zookeeper etc)
 Bonita Open Solution:business process modelling & script task automation
 BOA Aegir ;Drupal hosting turn-key system with Nginx Drupal & Jetty Solr search engines
5. Stake-holders
 
Who should be interested?
: search-driven domain-specific web portal developers
Web-hosting companies & organizations, especially Drupal
Developers of search-driven domain-specific web-portals
Big data developers & administrators
Business process modelling, automation & integration specialists
 
 
3
6. Business Systems
Business processes & systems:
search-driven domain-specific web portal developers
6.2 Business Systems
Bonita Open Solution Studio 

You're Reading a Free Preview

Download
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->