Professional Documents
Culture Documents
Data engineering company focused on building solutions that create value from
client's data.
FOCUS ON CLOUD, BIG TRANSFORMING PROVEN PROJECT CERTIFIED IN DATA VENDOR AGNOSTIC
DATA AND EVENT BUSINESS EXPERTISE METHODOLOGY AND INTEGRATIONS, BIG APPROACH
DRIVEN ARCHITECTURES INTO MEASURABLE FAST RESULTS DATA & DATA
IMPACT WAREHOUSES
PROTOTYPE
Automation level
Quick iteration Value proof in Stabilize code, BAM, monitor Transfer insights
to prove value real business extend reach (UI closely into IT system
situation / interface)
PREGLED Batch Ingest AWS Data Transfer Services (various options) Import/Export Service
Data Factory
Cloud DataFlow Sqoop
File Transfer
Flume
SERVISA Persistent
Storage
S3, Glacier
RDS
Storage Blob
Data Lake Store
SQL Database
Persistent Disk
Google Cloud Storage
Cloud SQL
HDFS
RDBMS
ZA ANALITIKU Transient
Storage
Kinesis Event Hubs
IOT Hub
HDInsight (Kafka)
Cloud Pub/Sub
Cloud IoT Core
Kafka
Batch EMR Spark Azure Batch Cloud Dataflow (open source Apache Beam) Hive
Processing EMR Hadoop HDInisght (Spark/Map Reduce) Cloud DataProc (Spark, Hadoop) Flink, Spark
EMR Presto SQL Data Warehouse MapReduce
AWS Batch Data Lake Analytics PostgreSQL
Redshift Azure Functions
Stream Amazon Kinesis Streams Stream Analytics Cloud Dataflow (open source Apache Beam) Flink
Processing Amazon Kinesis Analytics HDInsight (Storm, Spark) DataProc (Spark, Hadoop) Spark
EMR Spark Beam
Machine Lex Azure ML Natural Language Scikit
Learning Polly Cognitive Services SpeechTranslation Tensorflow
Recognition Vision Spark MLLib
Amazon Machine Learning Video TensorFlow etc.
ML Engine Huge number of libraries
Serving Storage Neptune CosmosDB N/A JanusGraph
Graph Storage Redshift
Serving SQL Data Warehouse BigQuery Impala + Kudu
BI/EDW Athena Analysis Services (OLAP Cubes)
Serving Storage Amazon CloudSearch Azure Search N/A Marketplace, e.g. Solr Solr
Search Amazon Elasticsearch
(keywords +
Serving Storage RDS SQL DB Cloud SQL PostgreSQL
RDBMS
Serving Storage DynamoDB HDInsight (HBase) BigTable HBase
NoSQL CosmosDB Spanner
DataStore
Sandboxes EMR Zeppelin Azure Notebooks Cloud Datalab Zeppelin
Notebook
Sandboxes Data N/A Marketplace only, e.g. Dataiku DSS N/A Marketplace only, e.g. Dataiku DSS Cloud DataPrep (Trifacta) Dataiku DSS Community Edition (not open source)
Science or
Clients/Data Quicksight PowerBI Google Data Studio Superset (BI)
Apps
Orchestration AWS Data Pipeline Data Factory N/A Marketplace Airflow
ETL Tool AWS Glue Data Factory N/A Marketplace N/A
MDM Hub N/A Marketplace N/A Marketplace N/A Marketplace N/A
Lineage AWS Glue N/A N/A N/A
Catalog AWS Glue Data Catalog N/A Marketplace N/A
Izvor: https://www.whizlabs.com/blog/big-data-and-cloud-computing/
Copyright © 2018 Syntio. All Rights Reserved. 12
KLJUČNA PITANJA
marko.krajnik@syntio.hr