Introduction简介HBase is a high-performance, distributed NoSQL key-value store built on top of Hadoop Distributed File System (HDFS). It is designed for high-throughput, low-latency read/write access to large datasets and is implemented in Java. HBase is optimized for real-time data...

Chapter 4第 4 章Hive OperatorsHive 运算符Operators in Hive are special symbols or keywords used in expressions to perform operations on data values (columns, literals, etc.). Hive 中的运算符是用于在表达式中对数据值(列、字面量等)执行操作的特殊符号或关键字。 They are categorized by their purpose — arithmetic, comparison, logical, relational,...

SELECT Statement Syntax:SELECT 语句语法:12345678SELECT [ALL | DISTINCT] select_expr, select_expr, ...FROM table_reference[WHERE where_condition][GROUP BY col_list [HAVING condition]][ CLUSTER BY col_list| [DISTRIBUTE BY col_list] [SORT BY| ORDER BY col_list]][LIMIT number] ALL and DISTINCT clausesALL 和...

Chapter 1第 1 章Topic: a topic is like a category/an index, it groups messages together. Topic:主题就像一个类别或索引,它将消息分组在一起。 Producer: processes that push messages to Kafka topics. Producer:向 Kafka 主题推送消息的进程。 Consumer: processes that consume messages from Kafka topics. Consumer:从 Kafka...

Chapter 6第 6 章Stream is an abstraction representing an unbounded dataset. Unbounded means infinite and ever growing. 流(Stream)是一个代表无界数据集的抽象。无界意味着无限且不断增长。 Event streams are ordered, each event is immutable in event streams and Event streams are replayable. 事件流是有序的,事件流中的每个事件都是不可变的,并且事件流是可重放的。 In Kafka...

Chapter 1第 1 章Topic: a topic is like a category/an index, it groups messages together. Topic:主题就像一个类别或索引,它将消息分组在一起。 Producer: processes that push messages to Kafka topics. Producer:向 Kafka 主题推送消息的进程。 Consumer: processes that consume messages from Kafka topics. Consumer:从 Kafka...

Chapter 2第 2 章Kafka Console Tool: - Kafka offers command-line tools to manage topics, consumer groups, to consume and publish Messages and so forth. Kafka 控制台工具:- Kafka 提供命令行工具来管理主题、消费者组,以及消费和发布消息等。 Kafka console scripts are different for Unix-based and Windows platforms. Kafka 控制台脚本在基于 Unix 和...

Chapter 3第 3 章Kafka producers are used to write messages to Kafka. Kafka 生产者用于向 Kafka 写入消息。 ProducerConfig class helps to set configuration for the Kafka Producer. ProducerConfig 类有助于设置 Kafka 生产者的配置。 The buffer.memory controls the total amount of memory available to the producer for...

Chapter 4第 4 章Kafka is comparable to traditional messaging systems such as P2P, ActiveMQ. Kafka 可与传统的各种消息系统(如 P2P、ActiveMQ)相媲美。 Once the consumer subscribes to topics, the poll loop handles all details of A. coordination, B. partition rebalances, C. heartbeats. 一旦消费者订阅了主题,poll 循环就会处理 A. 协调,B....

Kafka Chapter 5Kafka 第 5 章Kafka provides a command line utility called bin/kafka-topics.sh to operate topics on the Kafka server. Kafka 提供了一个名为 bin/kafka-topics.sh 的命令行工具,用于操作 Kafka 服务器上的主题。 The replication factor for a topic controls how many servers will replicate each message that is...