You are on page 1of 1

SOLR - Enterprise Search Platform based on Lucene.

Competition - Elastisearch, V
ivisimo / Watson, Exalead etc,
Marketing (developing tools to help marketing teams in SEO and SEM efforts) Integrating with APIs (like Google Map APIs etc.) for Android, iOS and Windows /
Mobile Apps for m-commerce
Customer cohorts
Multivariate Testing
Web Services, SOA (Service Oriented Architecture), Graph Databases, REST API
Google Analytics / Web Analytics
Marketing Analytics: Spends optimization, channel ROIs
Product analytics: User behavior, AB Testing, Design optimization
Category analytics: Performance scorecards for categories, brands and merchants
Large scale Data Processing: Map reduce, hadoop, hive
Programming Language: Java, JSP /JSTL, Servlets, C/C++, ASP.net, SharePoint, MVC
architecture (Spring 3.0 & Spring MVC), Hadoop
Database: SQL Server 2012
Application Servers: Understanding of Tomcat, IIS Server,
Apache Storm
: Stream data engine - deals with each event specifically - use
s Apache Zookeeper as the coordinator and not direct hadoop clusters
Apache Kafka
: Messaging system
Apache Spark
: In memory distributed data analysis platform - near realtime a
nd processes data as batches of events
HBase / Cassandra : Non relational distriubuted DB used for sparse data. Uses No
SQL. Cassandra has a CQL that is modeled on SQL. Both are column oriented DBs
Oozie
: Job Scheduling / workflow scheduler
Hue
: Visualization tool / web interface for Hadoop data
Aerospike
: In memory noSQL Db
Pentaho
: DW / BI tool
Vertica
: In memory DB like Hana, Netezza, Greenplum, Teradata
Graph DB Vendors : Neo4J, FlockDB by Twitter, GraphDB, Oracle Spatial and graph,
Teradata Aster
Document Oriented DB : Lotus Notes, MongoDB,
Key Value Stores :

You might also like