Professional Documents
Culture Documents
• https://streamsets.com/
– Data collector. Open source. https://streamsets.com/products/dataops-platform/open-source/
– Control Hub. https://streamsets.com/try-dataops/control-hub-trial/
▪ Jobs
▪ Repository
▪ Scheduler
▪ Security
▪ Topologies
• Similar tools
– Apache Nifi
– Apache Storm
– Apache Flink
– Apache Spark
– Amazon Kinesis, Google Dataflow …
• Concept: Offsets
• Concepts:
– Offsets
– Consumer groups
– Partitions
– Replication factor
– Retention policy
– See commands: Show topic, partitions, reset offsets, etc
• See
https://www.confluent.io/blog/apache-kafka-vs-enterprise-service-bus-esb-friends-e
nemies-or-frenemies/