课程名称:1.3 A Brief History About the TiDB database platform(TiDB 发展简史)
学习时长:8min
课程收获: TiDB 的发展历史
课程内容: TiDB;HTAP;数据中台;TiSpark;TiFlash
1,1.0.0 GA版本得特性
A freely scalable (computing, storage) database
Compatible with MySQL syntax and protocol
Transparent Data Spliting Policy - Range Spliting
Strongly consistent, distributed transaction support
TiDB 1.0.0原始架构
具备很强得数据中台能力
通过coprocessor再tikv中进行聚合,生成报表
此时的TiDB存在如下问题:
TP Scenarlo:
CUSTOMER: There are still some problems though… Smells good!
AP Scenario:
Client 1: Complex statements are so slow!
Client 2:Always OOM!
Client 3: Can’t integrate with a big data platform!
于是TiSpark诞生了
TiSpark的优点:
Spark helps us do distributed computing:
A mature distributed computing platform
Faster (?), more stable (?).
Complete succession to the Apache Spark ecosystem:
Painlessly integrating into the big data ecosystem
Scripting, Python,R,Apache Zeppelin,Hadoop…
TiSpark的缺点:
Apache Spark can only provide low concurrency computation:
Heavy computational model and high resource consumption
Better for Reports and Heavyweight Adhoc Queries
Users still need high concurrency, small to medium-sized AP capacity in many situations:
Complex query capability with low consumptiono
TiDB is far simpler to maintain than Spark clusters.
为此引入了TiFlash:
至此为目前TiDB的整体架构: