WL#13066: Improve predicate handling in loose index scan for group by.

Affects: Server-8.0   —   Status: Complete

Based on BUG#90951 Contribution by Facebook: Enhance group-by loose index scan

This worklog improves loose index scan(LIS) used for GROUP BY queries by
lifting the current limitation of its usage w.r.t the number of infix ranges
(range on non-grouping column) that can be present in the query.
This improvement would make optimizer pick LIS for more queries.

Currently LIS cannot be used when a query has more than one infix range. 

Consider the following test case:

CREATE TABLE t1 (f1 INT NOT NULL, f2 INT NOT NULL, f3 INT NOT NULL, PRIMARY
KEY(f1, f2, f3));
INSERT INTO t1 VALUES
(1,1,1), (1,2,2), (1,3,3), (1,4, 4), (1,5,5),
(2,1,1), (2,2,2), (2,3,3), (2,4, 4), (2,5,5),
(3,1,1), (3,2,2), (3,3,3), (3,4, 4), (3,5,5);
INSERT INTO t1 SELECT f1, f2 + 5, f3 + 5 FROM t1;
INSERT INTO t1 SELECT f1, f2 + 10, f3 + 10 FROM t1;
INSERT INTO t1 SELECT f1, f2 + 20, f3 + 20 FROM t1;
INSERT INTO t1 SELECT f1, f2 + 40, f3 + 40 FROM t1;
ANALYZE TABLE t1;

The following query uses LIS as there is only one infix range ("f2=2")

SELECT f1, MAX(f3) FROM t1 WHERE (f1 > 2) AND (f2 = 2) GROUP BY f1;

However this query does not use LIS if one more infix range is added (say "f2 =
4")like in the query below.

SELECT f1, MAX(f3) FROM t1 WHERE (f1 > 2) AND (f2 = 2 OR f2 = 4) GROUP BY f1;

When optimizer picks the key(f1,f2,f3) for doing loose index scan,
->It gets a distinct value for "f1"
->This distinct value followed by the next key columns's value
 ("f2"s value would be 2 for the query above) is used to form a key.
->It then uses this key to read the first/last (min/max) qualifying row.
However when it has two different values for "f2", the second step cannot be
performed with the current code. Hence LIS is not picked for the second query
mentioned above.

As a result, currently we have the following restriction mentioned in the rules
applied for picking LIS for group by:
"NGA3.If BA <> {}, there can only be one range. TODO: This is a code limitation
and is not strictly needed. See BUG#15947433 (Sakila)"

The objective of this task is to lift the limitation mentioned above to allow
dis-junction of equality predicates for the infix ranges.

The following example shows the advantage of using LIS. As can be seen, the
number of accessed rows are decreased if LIS is used.

Without LIS
---------------------
explain SELECT f1, MAX(f3) FROM t1 WHERE (f1 > 2) AND (f2 = 2 OR f2 = 4) GROUP
BY f1;
id	select_type	table	partitions	type	possible_keys	key	key_len	ref	rows
filtered	Extra
1	SIMPLE	t1	NULL	range	PRIMARY	PRIMARY	4	NULL	80	19.00	Using where; Using index
Warnings:
Note	1003	/* select#1 */ select `test`.`t1`.`f1` AS `f1`,max(`test`.`t1`.`f3`)
AS `MAX(f3)` from `test`.`t1` where ((`test`.`t1`.`f1` > 2) and
((`test`.`t1`.`f2` = 2) or (`test`.`t1`.`f2` = 4))) group by `test`.`t1`.`f1`
FLUSH STATUS;
SELECT f1, MAX(f3) FROM t1 WHERE (f1 > 2) AND (f2 = 2 OR f2 = 4) GROUP BY f1;
f1	MAX(f3)
3	4
SHOW STATUS LIKE 'handler_read%';
Variable_name	Value
Handler_read_first	0
Handler_read_key	1
Handler_read_last	0
Handler_read_next	80
Handler_read_prev	0
Handler_read_rnd	0
Handler_read_rnd_next	0
---------------------

With LIS
---------------------
explain SELECT f1, MAX(f3) FROM t1 WHERE (f1 > 2) AND (f2 = 2 OR f2 = 4) GROUP
BY f1;
id	select_type	table	partitions	type	possible_keys	key	key_len	ref	rows
filtered	Extra
1	SIMPLE	t1	NULL	range	PRIMARY	PRIMARY	8	NULL	3	100.00	Using where; Using index
for group-by
Warnings:
Note	1003	/* select#1 */ select `test`.`t1`.`f1` AS `f1`,max(`test`.`t1`.`f3`)
AS `MAX(f3)` from `test`.`t1` where ((`test`.`t1`.`f1` > 2) and
((`test`.`t1`.`f2` = 2) or (`test`.`t1`.`f2` = 4))) group by `test`.`t1`.`f1`
FLUSH STATUS;
SELECT f1, MAX(f3) FROM t1 WHERE (f1 > 2) AND (f2 = 2 OR f2 = 4) GROUP BY f1;
f1	MAX(f3)
3	4
SHOW STATUS LIKE 'handler_read%';
Variable_name	Value
Handler_read_first	0
Handler_read_key	5
Handler_read_last	1
Handler_read_next	0
Handler_read_prev	0
Handler_read_rnd	0
Handler_read_rnd_next	0
---------------------