【TiDB 4.0 PCTA 学习笔记】-1.3 A Brief History About the TiDB database platform(TiDB 发展简史)@ 4班+袁童

课程名称:1.3 A Brief History About the TiDB database platform(TiDB 发展简史)

学习时长:8min

课程收获: TiDB 的发展历史

课程内容: TiDB;HTAP;数据中台;TiSpark;TiFlash

1,1.0.0 GA版本得特性
A freely scalable (computing, storage) database
Compatible with MySQL syntax and protocol
Transparent Data Spliting Policy - Range Spliting
Strongly consistent, distributed transaction support

TiDB 1.0.0原始架构

具备很强得数据中台能力

通过coprocessor再tikv中进行聚合,生成报表

此时的TiDB存在如下问题:
TP Scenarlo:
CUSTOMER: There are still some problems though… Smells good!

AP Scenario:
Client 1: Complex statements are so slow!
Client 2:Always OOM!
Client 3: Can’t integrate with a big data platform!

于是TiSpark诞生了

TiSpark的优点:
Spark helps us do distributed computing:
A mature distributed computing platform
Faster (?), more stable (?).

Complete succession to the Apache Spark ecosystem:
Painlessly integrating into the big data ecosystem
Scripting, Python,R,Apache Zeppelin,Hadoop…

TiSpark的缺点:
Apache Spark can only provide low concurrency computation:
Heavy computational model and high resource consumption
Better for Reports and Heavyweight Adhoc Queries

Users still need high concurrency, small to medium-sized AP capacity in many situations:
Complex query capability with low consumptiono
TiDB is far simpler to maintain than Spark clusters.

为此引入了TiFlash:



至此为目前TiDB的整体架构:

学习过程中参考的其他资料