database-kernel-develop
database-kernel-develop copied to clipboard
Focus on Database kernel Development, include Basic Skill Content.
📔 Database Kernel develop RoadMap
Author is a database amateur, so the content is not authoritative, welcome to point out my error.
Since the database course is recommended, only good articles, papers or blogs are listed starting from part 3.
List of Knowledge
- List of Knowledge
- 1. Description
-
2. Recommended Basic Course and Books
- Pre-Skill
-
2.1 Computer Organization and Design
- 2.1.1 Courses
- 2.1.2 Books
-
2.2 Computer Operating System
- 2.2.1 Courses
- 2.2.2 Books
-
2.3 Computer Network
- 2.3.1 Courses
- 2.3.2 Books
-
2.4 Data Structure and Algorithmn
- 2.4.1 Courses
- 2.4.2 Books
-
2.5 Algorithms Design and Analysis
- 2.5.1 Courses
- 2.5.2 Books
-
2.6 Database
- 2.6.1 Course
- 2.6.2 Books
- 2.6.3 Papers
-
2.7 Distributed System
- 2.7.1 Courses
- 2.7.2 Books
- 2.7.3 Papers
- 2.7.4 Blogs
- 3. SQL & Relational Algebra
- 4. DDL & DML
- 5. Relational Model
-
6. Storage management
- 6.1 Buffer Pool
- 6.2 Index
- 6.3 B+Tree
- 6.4 B-Tree
- 6.5 Hash table
- 6.6 LSM-Tree
- 6.7 Bw-Tree
- 6.8 Storage engine
- 7. Query Processing
-
8. SQL Parser
- 8.1 Syntax Check
- 8.2 Semantic Check
- 8.3 Shared Pool Check
- 8.4 Actions
- 9. SQL Executor
-
9. SQL Optimization
- 9.1 SQL Query Optimization
- 9.2 Indexes Optimization
- 9.3 Table Optimization
- 9.4 Buffer Cache Optimization
- 9.5 Storage Optimization
-
9.7 table Structure Optimization
- 9.7.1 data reduction
- 9.7.2 data partition
- 9.8 Optimizer
- 9.9 SQL调优博客List
- 10. Transaction management
- Lock manager
- 11. Network
- 12. Serialization
- 13. Concurrency Control
- 14. Crash Recovery management
- 15. NoSQL
- 16. NewSQL
- 17. Distributed & Paralleled
-
15. OLAP、OLTP、HTAP
- 15.1 OLAP
- 15.2 OLTP
- 15.3 HTAP
- 15.4 HSAP
-
16. Graph Database
- 16.1 Courses
- 16.2 Books
- 16.3 Papers
- 16.4 Blogs
- 16. Project Source Code Analysis
- 17. Mini-Project Labs
- 18. AI4DB and DB4AI (frontier tech)
- Database Retrospective
1. Description
This repo will not include knowledge about basic programming language. it mainly focus on direction of database kernal development.
The tutorial will include content:
- Courses
- Blogs
- Papers
- Activities
- Talks
Ok, The repo include two language content:
- English Content
- Chinese Content.
If you have any good materials, welcome to submit a pull request for this repo.
Finally, I would like to thank the professor who provided the Course resource, papper resource Etc. Also thanks the blogger who provided the study notes. Thanks to all content resource providers.
2. Recommended Basic Course and Books
I will not update course notes and labs.
Pre-Skill
- MIT The Missing Semester of Your CS Education
- C/C++
- Go
- Rust
You only need to know a programming language.
2.1 Computer Organization and Design
2.1.1 Courses
University | ID | Name | Time | Comment |
---|---|---|---|---|
CMU | 15-213/14-513/15-513 | Intro to Computer Systems(ICS) | Fall 2015 | CS:APP3e |
UC Berkeley | CS61C | Great Ideas in Computer Architecture (Machine Structures) | Spring 2017 | N/A |
UC Berkeley | CS 152/252A | Computer Architecture and Engineering | Spring 2022 | N/A |
N/A | N/A | Crash Course Computer Science | bilibili地址 |
2.1.2 Books
- Reference Course Resources
2.2 Computer Operating System
2.2.1 Courses
University | ID | Name | Time | Comment |
---|---|---|---|---|
Stanford | CS110 | Principles of Computer Systems | Winter 2022 | |
MIT | 6.828 | Operating System Engineering | Fall 2018 | |
UC Berkeley | CS162 | Operating Systems and System Programming | Fall 2020 | |
MIT | 6.033 | http://web.mit.edu/6.033/www/index.shtml | N/A | covers four units of technical content: operating systems, networking, distributed systems, and security |
2.2.2 Books
- Reference Course Resources
2.3 Computer Network
2.3.1 Courses
University | ID | Name | Time | Comment |
---|---|---|---|---|
Stanford | CS144 | Introduction to Computer Networking | Fall 2021 | |
CMU | CS 15-744 | Computer Networks | Spring 2018 | |
中科大USTC | N/A | 计算机网络 | N/A | undergraduate stage |
中科大USTC | N/A | 高级计算机网络 | N/A | postgraduate stage |
2.3.2 Books
- Reference Course Resources
2.4 Data Structure and Algorithmn
2.4.1 Courses
University | ID | Name | Time | Comment |
---|---|---|---|---|
UC Berkeley | CS 61B | Data Structures | Fall 2020 | |
CMU | CS 15-122 | Principles of Imperative Computation | Spring 2023 | Past Courses Page 404 |
CMU | CS 15-121 | Introduction Data Structures | Spring 2018 |
2.4.2 Books
- Reference Course Resources
2.5 Algorithms Design and Analysis
2.5.1 Courses
University | ID | Name | Time | Comment |
---|---|---|---|---|
UC Berkeley | UC Berkeley | Efficient Algorithms and Intractable Problems | ||
MIT | 6.006 | Introduction to Algorithms | Spring 2020 | Introduction to Algorithmns |
CMU | CS 15-451/651 | Algorithm Design and Analysis | Fall 2021 | |
CMU | CS 15-850 | Advanced Algorithms | Fall 2020 |
2.5.2 Books
- Reference Course Resources
2.6 Database
2.6.1 Course
University | ID | Name | Time | Comment |
---|---|---|---|---|
CMU | CS 15-445/645 | Database Systems | Fall 2022 | Instructor: Andy Pavlo |
UC Berkeley | CS 186 | Introduction to Database Systems | Spring 2022 | |
Pennsylvania | CMPSC 431W | Database Management Systems | Fall 2015 | |
Stanford | CS346 | Database System Implementation | Spring 2015 | |
uwaterloo | CS 448/648 | Database Systems Implementation | Winter 2009 | |
CMU | CS 15-721 | Advanced Database Systems | Spring 2020 | Instructor: Andy Pavlo |
CMU | CS 15-799 | Special Topics: Self-Driving Database Management Systems | Spring 2022 | Instructor: Andy Pavlo |
uwaterloo | CS 856 | Distributed data management fundamentals (architectures, data placement, query optimization) Distributed transaction processing, concurrency control, recovery, interoperability |
Fall 2002 | Only need these two slides, nothing else |
N/A | N/A | Let's Build a Simple Database: Writing a sqlite clone from scratch in C | Thanks to cstack | |
Stanford | CS 345 | Topics in Database Management Systems | Winter 2014 | Paper Readings topic, similar to CMU CS15-721 |
2.6.2 Books
-
fundamentals of database systems 7th edition solutions, by Ramez Elmasri, Shamkant B. Navathe
-
Database System Concepts Seventh Edition, by Silberschatz, Korth and Sudarshan 中文版:《数据库系统概念》
-
Database Management Systems, by Ramakrishnan and Gehrke 中文版:《数据库管理系统原理与设计》
-
Database Internals A Deep-Dive into How Distributed Data System Work, by Alex Petrov 中文版:《数据库系统内幕》
-
Designing Data-Intensive Applications, by Martin Kleppmann
-
Database Design and Implementation, by Edward Sciore
-
Database System Implementation, by Hector Garcia-Molina, Jeff Ullman, and Jennifer Widom 中文版:《数据库系统实现》
- This book has been replaced by Database Systems: The Complete Book
-
Transaction Processing Concepts and Techniques by Jim Gray
-
Principles of Distributed Database Systems, by M. Tamer Özsu and Patrick Valduriez 中文版:《分布式数据库系统原理》
-
nosql distilled a brief guide to the emerging world of polyglot persistence, by Pramod J. Sadalage and Martin Fowler 中文版:《NoSQL精髓》
-
High Performance MySQL Third Edition, by Baron Schwartz, Peter Zaitsev, Vadim Tkachenko 中文版:《高性能MySQL》
-
MySQL High Availability: Tools for Building Robust Data Centers, by CharlesBell,MatsKindahl,LarsThalmann 中文版:《高可用MySQL:构建健壮的数据中心》
2.6.3 Papers
-
CS 15-721 Topics Papers
-
Stanford CS 345 Winter2014 Topic Paper
-
Paper 2017 : How to Build a Non-Volatile Memory Database Management System
-
Aticle : Main Memory Database Systems
-
Paper thesis 2018 : The Design and Implementation of a Non-Volatile Memory Database Management System
-
harvard Aticle : The Design and Implementation of Modern Column-Oriented Database Systems
2.7 Distributed System
This Section is for distributed database.
2.7.1 Courses
University | ID | Name | Time | Comment |
---|---|---|---|---|
MIT | 6.824 | MIT Distributed Systems | Spring 2021 | Golang |
N/A | N/A | Distributed Systems | ||
Columbia University | COMS 4113 | Distributed Systems Fundamentals | ||
CMU | CS 15-440 | CMU Distributed Systems | ||
Princeton | COS 418 | Princeton Distributed Systems | Fall 2019 | Golang |
Columbia University | Advanced Distributed Systems | Research Papers | ||
MIT | 6.852 | MIT Distributed Algorithms | ||
ETHZ | N/A | Principles of Distributed Computing (lecture collection) | ||
Stanford | CS244b | Distributed Systems | Spring 2020 | |
Washington | CSE 490H | Distributed Systems | Autumn 2010 |
2.7.2 Books
2.7.3 Papers
-
Reference :
- http://reiddraper.github.io/distreader/
- MIT6.824 : http://nil.csail.mit.edu/6.824/2015/schedule.html
- http://henryr.github.io/distributed-systems-readings/
-
Causality
- Paper 1978 : Time, Clocks, and the Ordering of Events in a Distributed Syste
- Paper 2008 : Interval Tree Clocks: A Logical Clock for Dynamic Systems , ITC2008
-
Consistency
-
Consensus
- consensus-bridging-theory-and-practice
- Raft Refloated: Do We Have Consensus?
- The Part Time Parliament
- Memory Coherence in Shared Virtual Memory Systems
- TreadMarks: Distributed Shared Memory on Standard WorkStations and Operating Systems
- Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System
- Resolving File Conflicts in the Ficus File System
- Consistency Analysis in Bloom: a CALM and Collected Approach
- Paper 2009 : Object Storage on CRAQ High-throughput chain replication for read-mostly workloads , USENIX2009
- Distributed SQLite: a replicated SQLite service powered by RAFT
-
Data Stores
- Simplified Harp File System
- Google File System 2.0: A Modern Design and Implementation
- Replication in the Harp File System
- Simplified GFS
- Chunky: a distributed GFS-based file store
- Append-only Datastore
- Key-Value Store Using Chain Replication
- ACID Compliant Distributed Key-Value Store
- RAFT based Key-Value Store with Transaction Support
-
General
- Paper 2010 : The Declarative Imperative: Experiences and Conjectures in Distributed Logic , SIGMOD2010
- Paper 2004 : MapReduce: Simplified Data Processing on Large Clusters , OSDI2004
- Paper : Availability in Globally Distributed Storage Systems
-
Fault Tolerance
- Remus: High Availability via Asynchronous Virtual Machine Replication
- Paxos Made Simple , ACM SIGACT News 2001
- In Search of an Understandable Consensus Algorithm
- Harvest, Yield, and Scalable Tolerant Systems
- Paxos Made Moderately Complex
- Distributed Multi-Agent Consensus for Fault Tolerant Decision Making
-
Atomicity
-
Application
- Paper 2003 : The Google File System , SOSP2003
- Paper 2003 : Bigtable: A Distributed Storage System for Structured Data , OSDI2006
- Paper 2007 : Dynamo: Amazon’s Highly Available Key-value Store , SOSP2007
- Peer-to-Peer Note Taking App
- factoryOS: A Distributed and Self-Organizing Planning System in a Supply-Chain Context
- Megastore: Providing Scalable, Highly Available Storage for Interactive Services , CIDR 2011
- PNUTS: Yahoo!’s Hosted Data Serving Platform , VLDB 2008
2.7.4 Blogs
-
Oracle7 Server Concepts Manual : Distributed Databases
-
University of Cambridge : Distributed Systems
-
分布式系统学习资料汇总 , Blogger : 木鸟杂记
-
分布式系统(Distributed System)资料 , Github Blogger : ty4z2008
3. SQL & Relational Algebra
-
SQL 知识点:Book《Database System Conceptsm》Chapter3、Chapter4、Chapter5
-
Relational Algebra 知识点:Book《Database System Conceptsm》Chapter6
-
w3cSchools SQL: https://www.w3schools.com/sql/
-
slide : Relational Algebra and SQL
4. DDL & DML
-
DDL : https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language
-
DML : https://cloud.google.com/bigquery/docs/reference/standard-sql/dml-syntax
5. Relational Model
6. Storage management
CS 15-445 课程 Lecture03、Lecture04
-
Microsoft Research Paper : Faster: A Concurrent Key-Value Store with In-Place Updates, SIGMOD2018
-
How Lock-free Data Structures Perform in Dynamic Environments: Models and Analyses
6.1 Buffer Pool
-
CS 15-445 课程 Lecture05
6.2 Index
-
wikiPedia : Database index
-
Google Cloud docs : Indexes
6.3 B+Tree
-
wikiPedia : B+ tree
-
CS 186 Spring notes pdf : B+ tree
-
University of Utah sliede : Database Systems Index: B+ Tree
-
Paper 2004 : Query and Update Efficient B-Tree Based Indexing of Moving Object
6.4 B-Tree
-
课程:《Let's Build a Simple Database》
-
Paper 2010 , Efficient B-tree Based Indexing for Cloud Data Processing, VLDB, National University of Singapore & IBM Watson Research Center
6.5 Hash table
-
wikipedia : Hash table
-
CMU CS15-445 slide : Hash table slide
-
GAtech.edu slide : Hash table & Extendible Hash & Linear Hash
-
Blog : An Introduction to B-Tree and Hash Indexes in PostgreSQL
6.6 LSM-Tree
-
slide : Log Structured Merge Tree, thanks to Pinglei Guo
-
AlibabaCloud Community Blog : Starting from Zero: Build an LSM Database with 500 Lines of Code
6.7 Bw-Tree
-
Paper : The Bw-Tree: A B-tree for New Hardware Platforms , Microsoft Research
-
Lock-free data structures. Inside. Memory management schemes
-
Blog : Bw-Tree技术解读
-
Blog : 微软提出的无锁 B 族树 —— Bw-Tree , 木鸟杂记
6.8 Storage engine
-
MySQL 8.0 docs : Chapter 15 The InnoDB Storage Engine
-
MySQL 8.0 docs : Chapter 16 Alternative Storage Engines
-
Hybrid storage engine for geospatial data using NoSQL and SQL paradigms
-
Oracle docs : Chapter 13. Storage Engines
-
Paper 2021 : Designing a persistent-memory-native storage engine for SQL database systems , , IEEE NVMSA 2021
- Paper : https://nvmsa2021.github.io/paper/NVMSA_2021_Session_5-2_paper.pdf
- Slide : https://nvmsa2021.github.io/slides/NVMSA_2021_Session_5-2_slides.pdf
7. Query Processing
-
Database SQL Tuning Guide : 3 SQL Processing
8. SQL Parser
8.1 Syntax Check
8.2 Semantic Check
8.3 Shared Pool Check
8.4 Actions
9. SQL Executor
9. SQL Optimization
9.1 SQL Query Optimization
-
Top 10 SQL Query Optimization Tips to Improve Database Performance, by Avishek Singh
-
SQL Query Optimization: How to Tune Performance of SQL Queries
-
Oracle docs : 5 Query Optimization
9.2 Indexes Optimization
-
Google Cloud docs : Optimizing Indexes
-
Optimize index maintenance to improve query performance and reduce resource consumption
-
How to create and optimize SQL Server indexes for better performance
9.3 Table Optimization
-
MySQL Optimize Table: How to Keep Your Database Running Smoothly
-
MySQL docs : 8.5 Optimizing for InnoDB Table
9.4 Buffer Cache Optimization
-
Buffer cache: What is it and how does it impact database performance?
-
EXPLAIN (ANALYZE) needs BUFFERS to improve the Postgres query optimization process
9.5 Storage Optimization
-
AWS Storage Blog : Storage for I/O-intensive SQL Server using Amazon EBS io2 Block Express
9.7 table Structure Optimization
9.7.1 data reduction
- MySQL docs : 15.9.1.5 How Compression Works for InnoDB Tables
9.7.2 data partition
9.8 Optimizer
Reference : https://zhuanlan.zhihu.com/p/363997416 , thanks henry liang
-
Paper : Orca: A Modular Query Optimizer Architecture for Big Data , SIGMOD 2014
-
Paper : An Overview of Cost-based Optimization of Queries with Aggregates
-
Paper : Optimizer plan change management: improved stability and performance in Oracle 11g , VLDB
-
Paper : Optimizing Queries over Partitioned Tables in MPP Systems , SIGMOD
-
Paper : Optimization of Common Table Expressions in MPP Database Systems , VLDB
9.9 SQL调优博客List
-
OceanBase SQL 调试优指南 , Alibaba Ant Group
-
PolarDB-X SQL 调优指南 , Alibaba Cloud
-
TiDB SQL性能调优 , PingCAP
-
了解数据库性能优化 , thanks to TiDB
-
MySQL 调优笔记 , Github Blogger : wardseptember
-
101 MySQL tuning and optimization tips , Alibaba Cloud
10. Transaction management
-
Google Cloud docs : Transactions
-
Paper 2006, Cost-based query transformation in Oracle, VLDB
-
Blog : 使用数据库事务
-
Blog : 11. 数据库事务
Lock manager
11. Network
12. Serialization
13. Concurrency Control
-
Mass Tree Paper : Cache Craftiness for Fast Multicore Key-Value Storage
-
Palm Tree Paper : PALM: Parallel Architecture-Friendly Latch-Free Modifications to B+ Trees on Many-Core Processors
-
ARTree Paper : The Adaptive Radix Tree: ARTful Indexing for Main-Memory Databases
14. Crash Recovery management
15. NoSQL
16. NewSQL
- Database · 发展前沿 · NewSQL数据库概述
- 我们是怎样打造一款分布式数据库的?
- 如何编写一个分布式数据库? , PingCAP CEO刘奇
- Understand the Differences Between NewSQL and Distributed SQL
17. Distributed & Paralleled
-
Experiences with a Distributed, Scalable, Methodological File System: AnalogicFS
-
Monarch: Google’s Planet-Scale In-Memory Time Series Database
-
Paper 2012 : Spanner: Google's Globally-Distributed Database , OSDI
15. OLAP、OLTP、HTAP
15.1 OLAP
-
WikiPedia : OnLine Analytical Processing
-
AWS Blog : What Is Online Analytical Processing?
-
Azure Blog : Online analytical processing (OLAP)
15.2 OLTP
-
wikiPedia : Online transaction processing
-
Azure Blog : Online transaction processing (OLTP)
15.3 HTAP
-
Tsinghua University databaseGroup Sigmod2022 slide : HTAP database : A Tutorial
-
A Common Database Approach for OLTP and OLAP Using an In-Memory Column DataBase
-
PingCAP : How We Build an HTAP Database That Simplifies Your Data Platform
-
StoneDB 文章
-
AlibabaCloud Community Blog : : 400x Faster HTAP Real-time Data Analysis with PolarDB
-
GreenPlum database Blog : World Class Open Source Distributed HTAP Database Based On PostgreSQL
-
Paper 2020 : TiDB: A Raft-based HTAP Database, VLDB
-
Paper 2022 : Kernel-Assisted Copy-on-Write Snapshots for Main-Memory HTAP Databases
15.4 HSAP
-
Paper : Alibaba Hologres: A Cloud-Native Service for Hybrid Serving/Analytical Processing
- VLDB Link : http://www.vldb.org/pvldb/vol13/p3272-jiang.pdf
- 《Alibaba Hologres: A Cloud-Native Service for Hybrid Serving/Analytical Processing 》论文导读
-
AlibabaCloud Community Blog : What Is the Next Stop for Big Data? Hybrid Serving/Analytical Processing (HSAP)
16. Graph Database
16.1 Courses
- YouTube Video : Introduction to Graph Databases Series
16.2 Books
-
Graph Database : New Oppotunities For Connected Data , Ian Robinson, Jim Webber & Emil Eifrem
-
Graph Databases For Beginners , Merkl Sasaki, Joy Chao & Rachel Howard
-
Graph-Databases-For-Dummies , Dr. Jim Webber & Rik Van Bruggen
-
The Definitive Guide to Graph Databases for the RDBMS Developer , Michael Hunger, Ryan Boyd & William Lyon
16.3 Papers
- Paper 2022 : ByteGraph: A High-Performance Distributed Graph Database in ByteDance , VLDB
16.4 Blogs
-
美团技术团队 Blog : 美团图数据库平台建设及业务实践
-
Graph Databases for Beginners: Why Graph Technology Is the Future
-
Graph Databases: How They Work, When to Use Them & the Advantages They Offer
-
Developing a Small-Scale Graph Database: A Ten Step Learning Guide for Beginners
-
白皮书 : 图数据库技术十大案例
16. Project Source Code Analysis
Just collect, some databases have not been read yet. Thanks to all Authors.
Database | DataBase Type | Blog | Github |
---|---|---|---|
SQLite | a small relational database management system | SQLite源码分析 | https://github.com/sqlite/sqlite |
LevelDB | fast key-value storage library | LevelDB 源码剖析 | https://github.com/google/leveldb |
MySQL | |||
PostGreSQL | PostGreSQL源码解读系列 | ||
Redis | in-memory database that persists on disk. | 1. 如何阅读 Redis 源码? 2. redis源码解析 |
https://github.com/redis/redis |
MongoDB | Cloud-Native Document Database | MongoDB 内核源码分析 | https://github.com/mongodb/mongo |
TiDB | cloud-native, distributed, MySQL-Compatible database | TiDB源码阅读分析 | https://github.com/pingcap/tidb |
TiKV | distributed key-value database | TiKV源码解析系列 | |
PolarDB-X | cloud native distributed SQL Database | PolarDB-X 源码解读 | https://github.com/polardb/polardbx-sql |
OceanBase | distributed relational database | https://github.com/oceanbase/oceanbase | |
openGauss | open source relational database management system | openGauss数据库源码解析 | https://github.com/opengauss-mirror |
StoneDB | A Real-time HTAP Database | StoneDB 源码解读系列 | |
RocksDB | A Persistent Key-Value Store for Flash and RAM Storage | 官方wiki文档 | https://github.com/facebook/rocksdb |
ToplingDB | RocksDB的增强分支 | N/A | https://github.com/topling/toplingdb |
Greenplum | open-source massively parallel data platform for analytics, machine learning and AI. | Greenplum 分布式数据库内核揭秘(上篇) Greenplum 分布式数据库内核揭秘(下篇) |
https://github.com/greenplum-db/gpdb |
YugabyteDB | high-performance, cloud-native, distributed SQL database that aims to support all PostgreSQL features. | 待更新 | https://github.com/yugabyte/yugabyte-db |
Neo4j | Graph Database | 待更新 | https://github.com/neo4j/neo4j |
JanusGraph | open-source, distributed graph database | 待更新 | https://github.com/JanusGraph/janusgraph |
OpenMLDB | an open-source machine learning database | 待更新 | https://github.com/4paradigm/OpenMLDB |
taobao MySQL 数据库内核月报 :http://mysql.taobao.org/monthly/
17. Mini-Project Labs
-
DeepDB : https://github.com/deepbodra97/Database-System-Implementation
-
Mini-OB : https://open.oceanbase.com/activities/4921877
-
CS 15-445 Labs
18. AI4DB and DB4AI (frontier tech)
- University of Magdeburg slides:
- slide 01 : https://www.dbse.ovgu.de/en/-p-578-EGOTEC-35lsmf8qh52t9v3rhrjdojrd46/_/5_ai-1.pdf
- slide 02 : https://www.dbse.ovgu.de/-p-578/_/5_ai-2.pdf
- Tsinghua University databaseGroup Github : https://github.com/TsinghuaDatabaseGroup/AIDB
- Tsinghua & MIT Paper : AI Meets Database: AI4DB and DB4AI , SIGMOD2021
- openGauss Blog : openGauss AI4DB and DB4AI
- MIT databaseGroup : http://dsg.csail.mit.edu/mlforsystems/papers/
- Blogger Github : https://github.com/LumingSun/ML4DB-paper-list
Database Retrospective
- Databases in 2021: A Year in Review , Andy Pavlo
- Databases in 2022: A Year in Review , Andy Pavlo