[Cloud Computing] Kafka
What is the paper trying to do? The paper is trying to introduce a novel messaging system for log processing, called Kafka. Kafka is a combination of the benefits of traditional log aggregator and messaging systems. Kafka is useful in processing huge volumes of log data streams, such as a messaging system. It is able to achieve higher throughput than other messaging systems because it focuses on log processing applications. There is a also integration for distributed support and can scale out. What do you think is the contribution of the paper? I think the main contribution of Kafka is the decisions the developer made to implement Kafka. For example, to make the system efficient, Kafka has a very simple storage layout, where each partition of a topic corresponds to a logical log. Another interesting decision is by making the consumer keep track of how much each consumer has consumed, as opposed to traditional methods where it is done by the broker. This reduces a lot of complexity...