MySQL 9.0.0
Source Code Documentation
recv_sys_t Struct Reference

Recovery system data structure. More...

#include <log0recv.h>

Classes

struct  Encryption_Key
 
struct  Mlog_record
 Mini transaction log record. More...
 
struct  Space
 Every space has its own heap and pages that belong to it. More...
 

Public Types

using Pages = std::unordered_map< page_no_t, recv_addr_t *, std::hash< page_no_t >, std::equal_to< page_no_t > >
 
using Missing_Ids = std::set< space_id_t >
 
using Spaces = std::unordered_map< space_id_t, Space, std::hash< space_id_t >, std::equal_to< space_id_t > >
 
using Encryption_Keys = std::vector< Encryption_Key >
 
using Mlog_records = std::vector< Mlog_record, ut::allocator< Mlog_record > >
 

Public Member Functions

void save_rec (size_t rec_num, space_id_t space_id, page_no_t page_no, mlog_id_t type, const byte *body, size_t len)
 Save mlog record information. More...
 
bool get_saved_rec (size_t rec_num, space_id_t &space_id, page_no_t &page_no, mlog_id_t &type, const byte *&body, size_t &len)
 Return saved mlog record information, if there. More...
 

Public Attributes

ib_mutex_t mutex
 mutex protecting the fields apply_log_recs, decrements of n_pages_to_recover, and the state field in each recv_addr struct More...
 
ib_mutex_t writer_mutex
 mutex coordinating flushing between recv_writer_thread and the recovery thread. More...
 
os_event_t flush_start
 event to activate page cleaner threads More...
 
os_event_t flush_end
 event to signal that the page cleaner has finished the request More...
 
buf_flush_t flush_type
 type of the flush request. More...
 
bool apply_log_recs
 This is true when log rec application to pages is allowed; this flag tells the i/o-handler if it should do log record application. More...
 
bytebuf
 Buffer for parsing log records. More...
 
size_t buf_len
 Size of the parsing buffer. More...
 
ulint len
 Amount of data in buf. More...
 
lsn_t parse_start_lsn
 This is the lsn from which we were able to start parsing log records and adding them to the hash table; zero if a suitable start point not found yet. More...
 
lsn_t checkpoint_lsn
 Checkpoint lsn that was used during recovery (read from file). More...
 
ulint bytes_to_ignore_before_checkpoint
 Number of data bytes to ignore until we reach checkpoint_lsn. More...
 
lsn_t scanned_lsn
 The log data has been scanned up to this lsn. More...
 
uint32_t scanned_epoch_no
 The log data has been scanned up to this epoch_no. More...
 
ulint recovered_offset
 Start offset of non-parsed log records in buf. More...
 
lsn_t recovered_lsn
 The log records have been parsed up to this lsn. More...
 
lsn_t previous_recovered_lsn
 The previous value of recovered_lsn - before we parsed the last mtr. More...
 
lsn_t last_block_first_mtr_boundary {}
 Tracks what should be the proper value of first_rec_group field in the header of the block to which recovered_lsn belongs. More...
 
bool found_corrupt_log
 Set when finding a corrupt log block or record, or there is a log parsing buffer overflow. More...
 
bool found_corrupt_fs
 Set when an inconsistency with the file system contents is detected during log scan or apply. More...
 
bool is_cloned_db
 Data directory has been recognized as cloned data directory. More...
 
bool is_meb_db
 Data directory has been recognized as data directory from MEB. More...
 
bool dblwr_state
 Doublewrite buffer state before MEB recovery starts. More...
 
Spacesspaces
 Hash table of pages, indexed by SpaceID. More...
 
ut::Todo_counter n_pages_to_recover
 Number of unique unprocessed page ids in the spaces nested hash table. More...
 
dblwr::recv::DBLWRdblwr
 Doublewrite buffer pages, destroyed after recovery completes. More...
 
MetadataRecovermetadata_recover
 We store and merge all table persistent data here during scanning redo logs. More...
 
Encryption_Keyskeys
 Encryption Key information per tablespace ID. More...
 
Missing_Ids missing_ids
 Tablespace IDs that were ignored during redo log apply. More...
 
Missing_Ids deleted
 Tablespace IDs that were explicitly deleted. More...
 
Mlog_records saved_recs
 

Static Public Attributes

static constexpr size_t MAX_SAVED_MLOG_RECS = 8 * 1024
 While scanning logs for multi-record mini-transaction (mtr), we have two passes. More...
 

Detailed Description

Recovery system data structure.

Member Typedef Documentation

◆ Encryption_Keys

◆ Missing_Ids

◆ Mlog_records

◆ Pages

using recv_sys_t::Pages = std::unordered_map<page_no_t, recv_addr_t *, std::hash<page_no_t>, std::equal_to<page_no_t> >

◆ Spaces

using recv_sys_t::Spaces = std::unordered_map<space_id_t, Space, std::hash<space_id_t>, std::equal_to<space_id_t> >

Member Function Documentation

◆ get_saved_rec()

bool recv_sys_t::get_saved_rec ( size_t  rec_num,
space_id_t space_id,
page_no_t page_no,
mlog_id_t type,
const byte *&  body,
size_t &  len 
)
inline

Return saved mlog record information, if there.

Works only in single threaded recovery scanner.

Parameters
[in]rec_numrecord number in multi record group
[out]space_idspace ID for the log record
[out]page_nopage number for the log record
[out]typelog record type
[out]bodypointer to log record body in recovery buffer
[out]lenlength of the log record
Returns
true iff saved record data is found.

◆ save_rec()

void recv_sys_t::save_rec ( size_t  rec_num,
space_id_t  space_id,
page_no_t  page_no,
mlog_id_t  type,
const byte body,
size_t  len 
)
inline

Save mlog record information.

Silently returns if cannot save. Works only in single threaded recovery scanner.

Parameters
[in]rec_numrecord number in multi record group
[in]space_idspace ID for the log record
[in]page_nopage number for the log record
[in]typelog record type
[in]bodypointer to log record body in recovery buffer
[in]lenlength of the log record

Member Data Documentation

◆ apply_log_recs

bool recv_sys_t::apply_log_recs

This is true when log rec application to pages is allowed; this flag tells the i/o-handler if it should do log record application.

◆ buf

byte* recv_sys_t::buf

Buffer for parsing log records.

◆ buf_len

size_t recv_sys_t::buf_len

Size of the parsing buffer.

◆ bytes_to_ignore_before_checkpoint

ulint recv_sys_t::bytes_to_ignore_before_checkpoint

Number of data bytes to ignore until we reach checkpoint_lsn.

◆ checkpoint_lsn

lsn_t recv_sys_t::checkpoint_lsn

Checkpoint lsn that was used during recovery (read from file).

◆ dblwr

dblwr::recv::DBLWR* recv_sys_t::dblwr

Doublewrite buffer pages, destroyed after recovery completes.

◆ dblwr_state

bool recv_sys_t::dblwr_state

Doublewrite buffer state before MEB recovery starts.

We restore to this state after MEB recovery completes and disable the doublewrite buffer during MEB recovery.

◆ deleted

Missing_Ids recv_sys_t::deleted

Tablespace IDs that were explicitly deleted.

◆ flush_end

os_event_t recv_sys_t::flush_end

event to signal that the page cleaner has finished the request

◆ flush_start

os_event_t recv_sys_t::flush_start

event to activate page cleaner threads

◆ flush_type

buf_flush_t recv_sys_t::flush_type

type of the flush request.

BUF_FLUSH_LRU: flush end of LRU, keeping free blocks. BUF_FLUSH_LIST: flush all of blocks.

◆ found_corrupt_fs

bool recv_sys_t::found_corrupt_fs

Set when an inconsistency with the file system contents is detected during log scan or apply.

◆ found_corrupt_log

bool recv_sys_t::found_corrupt_log

Set when finding a corrupt log block or record, or there is a log parsing buffer overflow.

◆ is_cloned_db

bool recv_sys_t::is_cloned_db

Data directory has been recognized as cloned data directory.

◆ is_meb_db

bool recv_sys_t::is_meb_db

Data directory has been recognized as data directory from MEB.

◆ keys

Encryption_Keys* recv_sys_t::keys

Encryption Key information per tablespace ID.

◆ last_block_first_mtr_boundary

lsn_t recv_sys_t::last_block_first_mtr_boundary {}

Tracks what should be the proper value of first_rec_group field in the header of the block to which recovered_lsn belongs.

It might be also zero, in which case it means we do not know.

◆ len

ulint recv_sys_t::len

Amount of data in buf.

◆ MAX_SAVED_MLOG_RECS

constexpr size_t recv_sys_t::MAX_SAVED_MLOG_RECS = 8 * 1024
staticconstexpr

While scanning logs for multi-record mini-transaction (mtr), we have two passes.

In first pass, we check if all the logs of the mtr is present in current recovery buffer or not. If yes, then in second pass we go through the logs again the add to hash table for apply. To avoid parsing multiple times, we save the parsed records in first pass and reuse them in second pass.

Parsing of redo log takes significant amount of time and this optimization of avoiding second parse gave about 1.8x speed up on recovery scan time of 1G of redo log from sysbench rw test.

There is currently no limit for maximum number of logs in an mtr. Practically, from sysbench rw test recovery with 1G of redo log to recover from the record count were spread from 3 - 1235 with majority between 600 - 700. So, it is likely by saving 1k records we could avoid most of the re-parsing overhead. Considering possible bigger number of records in other load and future changes the limit for number of saved records is kept at 8k. The same value from the contribution patch. The memory requirement 32 x 8k = 256k seems fine as one time overhead for the entire instance.

◆ metadata_recover

MetadataRecover* recv_sys_t::metadata_recover

We store and merge all table persistent data here during scanning redo logs.

◆ missing_ids

Missing_Ids recv_sys_t::missing_ids

Tablespace IDs that were ignored during redo log apply.

◆ mutex

ib_mutex_t recv_sys_t::mutex

mutex protecting the fields apply_log_recs, decrements of n_pages_to_recover, and the state field in each recv_addr struct

◆ n_pages_to_recover

ut::Todo_counter recv_sys_t::n_pages_to_recover

Number of unique unprocessed page ids in the spaces nested hash table.

Increments are done only from the main recovery thread before apply starts. Decrements are done from multiple threads during batch apply phase, and are protected by the recv_sys_t::mutex.

◆ parse_start_lsn

lsn_t recv_sys_t::parse_start_lsn

This is the lsn from which we were able to start parsing log records and adding them to the hash table; zero if a suitable start point not found yet.

◆ previous_recovered_lsn

lsn_t recv_sys_t::previous_recovered_lsn

The previous value of recovered_lsn - before we parsed the last mtr.

It is equal to recovered_lsn before we parsed any mtr. This is used to find moments in which recovered_lsn moves to the next block in which case we should update the last_block_first_rec_group (described below).

◆ recovered_lsn

lsn_t recv_sys_t::recovered_lsn

The log records have been parsed up to this lsn.

◆ recovered_offset

ulint recv_sys_t::recovered_offset

Start offset of non-parsed log records in buf.

◆ saved_recs

Mlog_records recv_sys_t::saved_recs

◆ scanned_epoch_no

uint32_t recv_sys_t::scanned_epoch_no

The log data has been scanned up to this epoch_no.

◆ scanned_lsn

lsn_t recv_sys_t::scanned_lsn

The log data has been scanned up to this lsn.

◆ spaces

Spaces* recv_sys_t::spaces

Hash table of pages, indexed by SpaceID.

◆ writer_mutex

ib_mutex_t recv_sys_t::writer_mutex

mutex coordinating flushing between recv_writer_thread and the recovery thread.


The documentation for this struct was generated from the following file: