Improve performance of order by index optimization #1843

andrii0lomakin · 2013-11-23T08:04:29Z

Current order by optimization of queries is simple.
If we have small enough records in index and have non composite index we iterate through them.
But there is much more elegant idea of optimization of order by.
When we fetch data using index range query we can iterate through index in both directions descending and ascending so just by passing the flag of direction of iteration in index range query method we can provide much broader "order by" optimizations, actually we will simply not have restrictions of order by optimization which we have now.
Also very interesting point is that index iterators, they ask many additional memory areas to iterate which kill CPU cache locality for data which we really need to fetch so such proposed approach will be faster too.

Actually I would remove iterators from index at all.
They may be not thread safe either, because implementation com.orientechnologies.common.concur.resource.OSharedResourceIterator does not deal with thread safety of state between iterations.

Much efficient approach in this case is to introduce firstKey/lastKey methods in idex and use between queries. In such case we will iterate only through data which we need and will not prefetch unneeded data and drop iterators from index.

lvca · 2013-11-25T10:11:41Z

So this query:

select from Customer order by name

When you've 10M of Customer records, how do you prefetch so many records?

andrii0lomakin · 2013-11-25T10:13:01Z

We do not need to.
Range query will do pagination in thread safe manner.

lvca · 2013-11-25T10:14:50Z

So without iterators how do you lazy browse results?

andrii0lomakin · 2013-11-25T10:17:56Z

Paginated range query is lazy browsing of results, for example com.orientechnologies.orient.core.index.sbtree.OSBTreeMapEntryIterator in reallity paginated query.

andrii0lomakin · 2013-11-25T10:23:14Z

I got what you mean how to do async queries on the fly, but it is already implemented using listeners - com.orientechnologies.orient.core.index.OIndex#getValuesBetween. All records loaded using com.orientechnologies.orient.core.index.OIndex.IndexValuesResultListener not collections.

lvca · 2013-11-25T10:23:20Z

So do you mean to add a direction in OTreeInternal.loadEntriesMajor() to move forward and backward and use the OSBTreeMapEntryIterator to browse item?

andrii0lomakin · 2013-11-25T10:24:43Z

In such way it is up to listener how to process them. But only needed records will be loaded in ordered manner.

andrii0lomakin · 2013-11-25T10:27:04Z

I meant to add direction in com.orientechnologies.orient.core.index.OIndexEngine#getEntriesMajor and listener will follow through records in needed order.

lvca · 2013-11-25T10:29:57Z

Gotcha, +1

…es were added.

…ally by index were added.

…ssed.

…e was fixed.

ghost assigned andrii0lomakin Dec 25, 2013

andrii0lomakin mentioned this issue Jan 10, 2014

OutOfMemoryError using ORDER BY clause #1943

Closed

andrii0lomakin added the In progress label Feb 7, 2014

andrii0lomakin added a commit that referenced this issue Feb 12, 2014

Implementation of Issue #1843 but some tests are still failed.

1b58f42

andrii0lomakin added a commit that referenced this issue Feb 12, 2014

Issue #1843 tests for "order by" full optimization by composite index…

52d08e6

…es were added.

andrii0lomakin added a commit that referenced this issue Feb 12, 2014

Issue #1843 tests for "order by" when collection is sorted only parti…

2b510a5

…ally by index were added.

andrii0lomakin added a commit that referenced this issue Feb 12, 2014

Issue #1843 tests for "order by" for local and remote storage were pa…

b4afb9a

…ssed.

andrii0lomakin added a commit that referenced this issue Feb 12, 2014

Issue #1843 tests for "order by" for memory storage were passed. Issu…

0d2f688

…e was fixed.

andrii0lomakin removed the In progress label Feb 12, 2014

andrii0lomakin closed this as completed Feb 12, 2014

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of order by index optimization #1843

Improve performance of order by index optimization #1843

andrii0lomakin commented Nov 23, 2013

lvca commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

lvca commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

lvca commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

lvca commented Nov 25, 2013

Improve performance of order by index optimization #1843

Improve performance of order by index optimization #1843

Comments

andrii0lomakin commented Nov 23, 2013

lvca commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

lvca commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

lvca commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

andrii0lomakin commented Nov 25, 2013

lvca commented Nov 25, 2013