WL#6380: Formulate framework for API for DD

Affects: Server-5.7   —   Status: Complete

A Data Dictionary is a collection of information about database objects, such
as tables, views, stored programs. Also known as metadata. In other words,
Data Dictionary stores information about e.g. table structure -- which
columns, indexes, etc every table has.

Data Dictionary also stores information about tables in INFORMATION_SCHEMA
and PERFORMANCE_SCHEMA. These tables don't physically exist in a storage
engine, they populated by the server on the fly.

As we're developing a new Data Dictionary (where all data is stored in
transactional tables instead of just simple files or MyISAM tables
as it is now), we want to come up with a clean and straightforward API
to deal with the new Data Dictionary.

The Data Dictionary API means a few things in this WL:

  - mainly, it is the internal API the server code will use to get and
    update Data Dictionary information.

  - external API, which should be used for accessing Data Dictionary
    information from plugins (including 3rd party Storage Engines),
    will also be discussed.

  - discussing API (both internal and external) means discussing public
    interface and key implementation concepts.

The Data Dictionary API is an intraprocess API, i.e. it can be used
within the server process only.

Developing a new Data Dictionary is large project. This WL is intended to:

  - outline our vision on the new Data Dictionary API, including how
    Data Dictionary information will be represented in main memory.
    I.e. define how we would like it to be eventually.

  - describe every major aspect of the new API and discuss possible
    alternatives.

New Data Dictionary project will be implemented step by step. There is a
large legacy, which should be supported and maintained. Also, the current
server code does not have clear boundaries between various modules, which
means there is no "old API", that can be "just replaced" by the new API.
Implementing and using a new API means refactoring of the server to break the
code into different modules and make proper interfaces among them.
Most likely, the new API will be formulated and implemented iteratively.
This WL serves like a guideline in that process.

Strictly speaking the Data Dictionary consists of two parts:
  - "operational part" -- Data Dictionary information which is in memory (RAM);
  - "stored part" -- Data Dictionary information which is stored in tables
    (on disk);
This WL describes the "operational part".

See also:
  - WL#6379: Schema definitions for new DD