partial eval: cover comprehensions that do not depend on unknowns #653

tsandall · 2018-03-08T22:53:56Z

Comprehensions that do not depend on unknown values should be partially evaluated. Today they are not, they are simply saved off. For example, given:

allow {
  # where ... does not depend on input
  rows = [[x, y] | ...]
  rows[i][0] == input.x
  rows[i][1] == input.y
}

This rule could be expanded out to:

allow {
  rows = <array literal>
  rows[0][0] == input.x
  rows[0][1] == input.y
}

allow {
  rows = <array literal>
  rows[1][0] == input.x
  rows[1][1] == input.y
}

...

allow {
  rows = <array literal>
  rows[...][0] == input.x
  rows[...][1] == input.y
}

For now, it should be fine to inline the comprehension result into the query. For comprehensions that don't depend on variables in the query, this is inefficient, but these comprehensions should be lifted out of the rule at compile time anyway.

The text was updated successfully, but these errors were encountered:

t83714 · 2019-05-07T05:41:14Z

@tsandall
We use partial eval to integrate with elastic search & dataset for data filtering.
As comprehensions will not be partially evaluated even not depend on unknown values, does that mean we should avoid using comprehensions for any policies for data filtering purpose?

Do we have any timelines on this feature?

t83714 · 2019-05-07T06:14:13Z

@tsandall
More info on this one. For sample rule (allowRead) below, is there an alternative way to achieve the same logic without comprehensions?

### if not listed on controlledItems, assume allowed
controlledItems[item] {
    item := {
        "uri": "xxxx/xxx/a",
         # put as boolean value to make it simple. The actual value may depend on another rule
        "allow": false
    }
}

controlledItems[item] {
    item := {
        "uri": "xxxx/xxx/b",
        "allow": true
    }
}

default allowRead = false

allowRead {
    
    ## It's not a pattern
    contains(input.operationUri, "*") != true

    disallowedUris := [uri | uri := controlledItems[i].uri; controlledItems[i].allow = false]

    disallowedUris[_] != input.operationUri
    
    input.object.content.id == input.operationUri
}

allowRead {
    
    ## It's a pattern
    contains(input.operationUri, "*") == true

    disallowedUris := [uri | uri := controlledItems[i].uri; 
                             glob.match(input.operationUri, ["/"], uri) == true;
                             controlledItems[i].allow = false]

    count(disallowedUris) == 0
    # No matched disallowedUris found, thus, allow
}

allowRead {
    
    ## It's a pattern
    contains(input.operationUri, "*") == true

    disallowedUris := [uri | uri := controlledItems[i].uri; 
                             glob.match(input.operationUri, ["/"], uri) == true;
                             controlledItems[i].allow = false]

    count(disallowedUris) > 0
    
    input.object.content.id != disallowedUris[_]
}

tsandall · 2019-05-09T17:39:09Z

@t83714 here's an example that I think implements your intended policy: https://play.openpolicyagent.org/p/aiipr4eQm3

Note the playground example has an important difference: allowRead is true if input.object.content.id is not equal to ALL matching disallowed URIs. In your example above, input.object.content.id != disallowedUris[_] doesn't mean "for all", it means "for any", i.e., your example says allowRead is true if input.object.content.id is not equal to ANY matching disallowed URIs. This means that if input.object.content.id was "foo" and disallowdUris contained "foo" and "bar", allowRead would be true because "foo" is not equal to "bar".

We have an open issue to improve the docs to explicitly call out how to express "FOR ALL" vs. "FOR ANY" (the default): #1307

t83714 · 2019-05-15T01:34:57Z

@tsandall

Thanks a lot for the example and your comments regarding "FOR ALL" vs. "FOR ANY"~

And thanks for the hint of using negation as an alternative way to implement the similar "For All" logic 👍

You example logic work very well, but I still have some issues around partial eval result when trying to implement this for our data filtering purpose.

When I test it for partial eval in the terminal, I got the empty result (Please let me know If I did it incorrectly ):

OPA 0.10.7 (commit 0f39c27e, built at 2019-04-09T00:29:06Z)

Run 'help' to see a list of commands.

> unknown input.object.content
> data.play.allowRead with input as {
|   "operationUri": "xxxx/**"
| }
| 
+---------+---------------------------------------------------------------+
| Query 1 | data.play.allowRead with input as {"operationUri": "xxxx/**"} |
+---------+---------------------------------------------------------------+
>

The partial eval result through compile API looks correct 🎉 . But not ideal as it left unnecessary local var in the rule body.
e.g. For the following request:

{
  "query": "data.play.allowRead == true",
  "input": {
  	"operationUri": "xxxx/**"
  },
  "unknowns": [
    "input.object.content"
  ]
}

I got the following rule body in support (the local var __local3__ & __local3__1 seems unnecessary):

{
  "body": [
    {
      "index": 0,
      "negated": true,
      "terms": [
        {
          "type": "ref",
          "value": [
            {
              "type": "var",
              "value": "eq"
            }
          ]
        },
        {
          "type": "var",
          "value": "__local3__1"
        },
        {
          "type": "string",
          "value": "xxxx/xxx/a"
        }
      ]
    },
    {
      "index": 1,
      "terms": [
        {
          "type": "ref",
          "value": [
            {
              "type": "var",
              "value": "eq"
            }
          ]
        },
        {
          "type": "var",
          "value": "__local3__"
        },
        {
          "type": "ref",
          "value": [
            {
              "type": "var",
              "value": "input"
            },
            {
              "type": "string",
              "value": "object"
            },
            {
              "type": "string",
              "value": "content"
            },
            {
              "type": "string",
              "value": "id"
            }
          ]
        }
      ]
    }
  ]
}

For our project, I think we can overcome it by making some further parsing & adjustment once received the partial eval result.

Not sure it's worth fixing and if so I can create a ticket for it 😄

Below is the full response (in case you need it):

{
    "result": {
        "queries": [
            [
                {
                    "index": 0,
                    "terms": [
                        {
                            "type": "ref",
                            "value": [
                                {
                                    "type": "var",
                                    "value": "equal"
                                }
                            ]
                        },
                        {
                            "type": "ref",
                            "value": [
                                {
                                    "type": "var",
                                    "value": "data"
                                },
                                {
                                    "type": "string",
                                    "value": "partial"
                                },
                                {
                                    "type": "string",
                                    "value": "play"
                                },
                                {
                                    "type": "string",
                                    "value": "allowRead"
                                }
                            ]
                        },
                        {
                            "type": "boolean",
                            "value": true
                        }
                    ]
                }
            ]
        ],
        "support": [
            {
                "package": {
                    "path": [
                        {
                            "type": "var",
                            "value": "data"
                        },
                        {
                            "type": "string",
                            "value": "partial"
                        },
                        {
                            "type": "string",
                            "value": "play"
                        }
                    ]
                },
                "rules": [
                    {
                        "head": {
                            "name": "allowRead",
                            "value": {
                                "type": "boolean",
                                "value": true
                            }
                        },
                        "body": [
                            {
                                "index": 0,
                                "negated": true,
                                "terms": [
                                    {
                                        "type": "ref",
                                        "value": [
                                            {
                                                "type": "var",
                                                "value": "eq"
                                            }
                                        ]
                                    },
                                    {
                                        "type": "var",
                                        "value": "__local3__1"
                                    },
                                    {
                                        "type": "string",
                                        "value": "xxxx/xxx/a"
                                    }
                                ]
                            },
                            {
                                "index": 1,
                                "terms": [
                                    {
                                        "type": "ref",
                                        "value": [
                                            {
                                                "type": "var",
                                                "value": "eq"
                                            }
                                        ]
                                    },
                                    {
                                        "type": "var",
                                        "value": "__local3__"
                                    },
                                    {
                                        "type": "ref",
                                        "value": [
                                            {
                                                "type": "var",
                                                "value": "input"
                                            },
                                            {
                                                "type": "string",
                                                "value": "object"
                                            },
                                            {
                                                "type": "string",
                                                "value": "content"
                                            },
                                            {
                                                "type": "string",
                                                "value": "id"
                                            }
                                        ]
                                    }
                                ]
                            }
                        ]
                    },
                    {
                        "default": true,
                        "head": {
                            "name": "allowRead",
                            "value": {
                                "type": "boolean",
                                "value": false
                            }
                        },
                        "body": [
                            {
                                "index": 0,
                                "terms": {
                                    "type": "boolean",
                                    "value": true
                                }
                            }
                        ]
                    }
                ]
            }
        ]
    }
}

Certain portions of Rego cannot be easily inlined when they depend on unknowns (e.g., comprehensions). In the past, if PE encountered statements that could not be inlined, PE would simply save the original statement regardless of whether that statement (or any of it's dependencies) depended on unknowns. This was the safest way to introduce PE initially and worked in many cases as long as authors were aware of the fragment of Rego that could be PE-ed. Of course, this places a burden on authors and also fails when constructs like comprehensions are required. This commit enhances the evaluator so that PE will evaluate all terms (e.g., comprehensions, references to full extent of partial sets/objects, etc.) as long as they do not depend on unknowns. This commit also enhances the evaluato to support PE on expressions that include with statements. Support for with statements is handled by disabling inlining on any references that are contained in the expression modified by the with statement. This approach was chosen over open-policy-agent#1956 because while it generates support rules it avoids the need to re-apply with statements to inlined expressions. This commit also addresses open-policy-agent#1417 because negated expressions that can be completely evaluated will be now. As part of this change, the evaluator has been refactored to maintain the inlining controls as a stack (this makes with statements easier to handle.) name old time/op new time/op delta InliningFullScan/1000-8 5.89ms ± 0% 5.92ms ± 3% ~ (p=0.548 n=5+5) InliningFullScan/10000-8 65.8ms ± 2% 65.1ms ± 0% ~ (p=0.095 n=5+5) InliningFullScan/300000-8 2.06s ± 2% 2.07s ± 5% ~ (p=0.841 n=5+5) name old alloc/op new alloc/op delta InliningFullScan/1000-8 2.68MB ± 0% 2.68MB ± 0% +0.01% (p=0.008 n=5+5) InliningFullScan/10000-8 27.4MB ± 0% 27.4MB ± 0% +0.00% (p=0.008 n=5+5) InliningFullScan/300000-8 825MB ± 0% 825MB ± 0% ~ (p=0.087 n=5+5) name old allocs/op new allocs/op delta InliningFullScan/1000-8 81.0k ± 0% 81.0k ± 0% +0.01% (p=0.008 n=5+5) InliningFullScan/10000-8 810k ± 0% 810k ± 0% +0.00% (p=0.008 n=5+5) InliningFullScan/300000-8 24.3M ± 0% 24.3M ± 0% +0.00% (p=0.008 n=5+5) Fixes open-policy-agent#1069 Fixes open-policy-agent#653 Fixes open-policy-agent#1421 Fixes open-policy-agent#1417 Signed-off-by: Torin Sandall <torinsandall@gmail.com>

tsandall added the optimization label Mar 8, 2018

tsandall added this to the v0.8 milestone Mar 9, 2018

tsandall removed this from the v0.8 milestone Apr 16, 2018

t83714 mentioned this issue May 9, 2019

partial eval: Shouldn't include never matched rule in support rules #1400

Closed

tsandall added the partial-eval label Jun 20, 2019

tsandall self-assigned this Jan 16, 2020

tsandall closed this as completed in 863e46e Jan 29, 2020

This was referenced Sep 11, 2020

[Snyk] Security upgrade node-fetch from 2.6.0 to 2.6.1 qsays/opa#5

Open

[Snyk] Security upgrade node-fetch from 2.6.0 to 2.6.1 verygoodsecurity/opa#6

Closed

snyk-bot mentioned this issue Feb 4, 2022

[Snyk] Upgrade node-fetch from 2.6.0 to 2.6.6 airadier/opa#2

Open

ashutosh-narkar added this to Open Policy Agent Aug 5, 2024

ashutosh-narkar moved this to Done in Open Policy Agent Aug 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

partial eval: cover comprehensions that do not depend on unknowns #653

partial eval: cover comprehensions that do not depend on unknowns #653

tsandall commented Mar 8, 2018 •

edited

Loading

t83714 commented May 7, 2019

t83714 commented May 7, 2019

tsandall commented May 9, 2019

t83714 commented May 15, 2019

partial eval: cover comprehensions that do not depend on unknowns #653

partial eval: cover comprehensions that do not depend on unknowns #653

Comments

tsandall commented Mar 8, 2018 • edited Loading

t83714 commented May 7, 2019

t83714 commented May 7, 2019

tsandall commented May 9, 2019

t83714 commented May 15, 2019

tsandall commented Mar 8, 2018 •

edited

Loading