db-papers icon indicating copy to clipboard operation
db-papers copied to clipboard

A list papers of learning how to building database system

README autogen


Database Papers

This is a comprehensive list of papers on database theory for understanding and building database systems. It covers various aspects of database systems, including the essential theoretical background, classic system design, and multiple modules within the database.

The list is organized into different categories and subcategories for easy navigation. Each paper is accompanied by a title, author, and publication year, along with a link to the full text if available.

This collection serves as a learning and training resource primarily for the Tencent Cloud Database Team and is also open to external researchers, students, and learners interested in database systems.

In case you are reading this and making the effort to comprehend these papers, we would really like to have a conversation with you regarding opportunities at Tencent Cloud Database Team (@Henry L.).

Contribution

This list is generated from a Sheet document automatically. If you have any suggestions or would like to contribute to this list, please feel free to file an issue. And we will update our sheet to make the chagnes available for public.

Any contribution that can help improve this list and make it more comprehensive and useful to the community are welcome. Here are some ways you can contribute:

  • Add a new paper: If you have a paper that you think should be included in this list, please file an issue to provide the paper's title, author, publication year, and a link to the full text (if available).
  • Update an existing paper: If you find any errors or outdated information in the list, please file an issue to provide the correct information.
  • Remove a paper: If you think a paper is no longer relevant or useful, please file an issue to suggest its removal.
  • General suggestions: If you have any general suggestions or feedback on how to improve this list, please file an issue to share your thoughts.

Table of Contents

  • Basics
    • Essentials
    • Consensus
    • Consistency
  • System Design
    • Architecture
    • RDBMS
    • NoSQL
  • SQL Engine
    • Optimizer Framework
    • Transformation
    • Nested Query
    • Functional Dependencies
    • Join Order
    • Cost Model
    • Statistics
    • Probabilistic Counting
    • Execution Engine
    • Parallel Execution
  • Storage Engine
    • Storage Media
    • Storage Structure
    • Transaction
    • Scheduling
  • Miscellaneous
    • Workload
    • Network
    • Quality
    • Diagnosis and Tuning

Basics

Essentials

  • A Relational Model Of Data For Large Shared Data Banks (1970) - Codd, Edgar F.
  • Sequel: A Structured English Query Language (1974) - Chamberlin, Donald D., and Raymond F. Boyce.
  • Ingres: A Relational Data Base System (1975) - Held, G. D., M. R. Stonebraker, and Eugene Wong.
  • Extending The Database Relational Model To Capture More Meaning (1979) - Codd, Edgar F.
  • A Critique Of The Sql Database Language (1984) - Date, C. J.
  • A Critique Of Snapshot Isolation (2012) - Yabandeh M, Gómez Ferro D.

Consensus

Consistency

System Design

Architecture

  • Architecture Of A Database System. Foundations And Trends In Databases (2007) - Hellerstein J M, Stonebraker M, Hamilton J.

RDBMS

NoSQL

SQL Engine

Optimizer Framework

Transformation

Nested Query

Functional Dependencies

  • Fundamental Techniques For Order Optimization (1996) - Simmen, David, Eugene Shekita, and Timothy Malkemus.
  • [Thesis] Exploiting Functional Dependence In Query Optimization (2000) - Paulley, Glenn Norman.
  • An Efficient Framework For Order Optimization (2004) - Neumann, Thomas, and Guido Moerkotte.
  • Incorporating Partitioning And Parallel Plans Into The Scope Optimizer (2010) - Zhou, Jingren, Per-Ake Larson, and Ronnie Chaiken.
  • Accelerating Queries With Groupby And Join By Group Join (2011) - Moerkotte, Guido, and Thomas Neumann.

Join Order

Cost Model

Statistics

Probabilistic Counting

Execution Engine

Parallel Execution

Storage Engine

Storage Media

Storage Structure

Transaction

Scheduling

Miscellaneous

Workload

Network

Quality

Diagnosis and Tuning

  • Automatic Sql Tuning In Oracle 10G (2004) - Dageville B, Das D, Dias K, et al.
  • Automatic Performance Diagnosis And Tuning In Oracle (2005) - Dias K, Ramacher M, Shaft U, et al.