✨ decode: add `raise_on_limit_exceeded` option #11

techouse · 2024-11-23T15:19:07Z

Description

This pull request introduces several changes to the qs_codec package, focusing on enhancing the handling of list and parameter limits, improving error handling, and updating test cases accordingly.

Enhancements to list and parameter limits:

src/qs_codec/decode.py: Modified _parse_array_value and _parse_query_string_values to include checks for list and parameter limits, raising errors when limits are exceeded. [1] [2]
src/qs_codec/models/decode_options.py: Added raise_on_limit_exceeded option to DecodeOptions to control error raising when limits are exceeded.

Error handling improvements:

src/qs_codec/decode.py: Updated _parse_object to handle current list length when parsing values, ensuring limits are respected.
src/qs_codec/utils/utils.py: Enhanced merge and compact functions to handle Undefined values more robustly. [1] [2]

Test case updates:

tests/unit/decode_test.py: Added new test cases in TestParameterList and TestListLimit to validate the new limit checks and error handling.
tests/unit/example_test.py: Adjusted existing tests to reflect changes in list handling, ensuring accurate results.

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Added new tests

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
Any dependent changes have been merged and published in downstream modules

Summary by CodeRabbit

Release Notes

New Features
- Enhanced error handling for parameter and list limits in the decoding process.
- Introduced a new option to raise errors when limits are exceeded.
Bug Fixes
- Improved handling of Undefined values during merging and compacting operations.
Tests
- Expanded test coverage for parameter and list limit functionalities.
- Updated existing tests to reflect changes in output format for list indices.

These updates aim to provide a more robust and user-friendly experience when working with data decoding.

coderabbitai · 2024-11-23T15:19:14Z

Walkthrough

The changes in this pull request enhance the functionality and error handling of the qs_codec library, specifically in the decode.py, decode_options.py, and utils.py files. Key modifications include the introduction of new parameters and attributes for managing list and parameter limits, refinements to existing functions, and the addition of extensive unit tests to validate these changes. The overall structure of the affected classes remains intact, with a focus on improving robustness and clarity in handling various decoding scenarios.

Changes

File Path	Change Summary
`src/qs_codec/decode.py`	Updated `_parse_array_value` to include `current_list_length`. Enhanced error handling for list limits. Refined `_parse_query_string_values` for parameter limits.
`src/qs_codec/models/decode_options.py`	Added `raise_on_limit_exceeded` attribute to `DecodeOptions`. Updated docstrings for `strict_depth` and `strict_null_handling` for clarity.
`src/qs_codec/utils/utils.py`	Modified `merge` and `compact` methods to improve handling of `Undefined` values. Removed `_compact_queue` method.
`tests/unit/decode_test.py`	Added `TestParameterList` and `TestListLimit` classes with multiple tests for validating parameter and list limit behaviors.
`tests/unit/example_test.py`	Updated assertions in `TestDecoding` and `TestLists` to reflect changes in how list indices are processed, changing output from lists to dictionaries.

Poem

In the land of code, where rabbits play,
New limits and checks brighten the day.
With lists and parameters now in control,
Our decoder hops on, making all whole! 🐇✨
Errors raised when limits are near,
A robust solution, let’s give a cheer!

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

codacy-production · 2024-11-23T15:21:16Z

Coverage summary from Codacy

See diff coverage on Codacy

Coverage variation	Diff coverage
✅ -0.22% (target: -1.00%)	✅ 89.66%

Coverage variation details

	Coverable lines	Covered lines	Coverage
Common ancestor commit (`fb6c6a4`)	713	690	96.77%
Head commit (`4dc4a35`)	726 (+13)	701 (+11)	96.56% (-0.22%)

Coverage variation is the difference between the coverage for the head and common ancestor commits of the pull request branch: <coverage of head commit> - <coverage of common ancestor commit>

Diff coverage details

	Coverable lines	Covered lines	Diff coverage
Pull request (#11)	29	26	89.66%

Diff coverage is the percentage of lines that are covered by tests out of the coverable lines that the pull request added or modified: <covered lines added or modified>/<coverable lines added or modified> * 100%

See your quality gate settings Change summary preferences

_{Codacy stopped sending the deprecated coverage status on June 5th, 2024. Learn more}

codecov · 2024-11-23T15:21:21Z

Codecov Report

Attention: Patch coverage is 89.65517% with 3 lines in your changes missing coverage. Please review.

Project coverage is 96.55%. Comparing base (fb6c6a4) to head (4dc4a35).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/qs_codec/decode.py	86.36%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #11      +/-   ##
==========================================
- Coverage   96.77%   96.55%   -0.22%     
==========================================
  Files          16       16              
  Lines         713      726      +13     
==========================================
+ Hits          690      701      +11     
- Misses         23       25       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests
JS Bundle Analysis - Avoid shipping oversized bundles

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (9)

src/qs_codec/models/decode_options.py (1)
86-87: Enhance the docstring to clarify limit relationships.

While the docstring explains the basic functionality, it could be more specific about which limits trigger the error and how it interacts with both list_limit and parameter_limit.

Consider updating the docstring to:
-    """Set to ``True`` to raise an error when the input contains more parameters than the ``list_limit``."""
+    """Set to ``True`` to raise an error when parsing limits are exceeded. This affects:
+    
+    - The ``list_limit`` option: Raises when array indices exceed the limit
+    - The ``parameter_limit`` option: Raises when the number of parameters exceeds the limit"""
src/qs_codec/utils/utils.py (1)
Line range hint 17-107: Consider improving maintainability of the merge function

The merge function has grown quite complex with many nested conditions and type checks. Consider:

Breaking down the logic into smaller, well-named helper methods

Adding detailed documentation about the transformation rules

Including examples in the docstring to illustrate different merge scenarios

Example structure:
@staticmethod
def merge(
    target: t.Optional[t.Union[t.Mapping[str, t.Any], t.List[t.Any], t.Tuple]],
    source: t.Optional[t.Union[t.Mapping[str, t.Any], t.List[t.Any], t.Tuple, t.Any]],
    options: DecodeOptions = DecodeOptions(),
) -> t.Union[t.Dict[str, t.Any], t.List, t.Tuple, t.Any]:
    """Merge two objects together.
    
    Examples:
        >>> Utils.merge([1, Undefined, 3], [4, 5])
        [1, 5, 3, 4]
        
        >>> Utils.merge({"a": 1}, [1, 2])
        {"a": 1, "0": 1, "1": 2}
    """
    if source is None:
        return target
        
    if not isinstance(source, t.Mapping):
        return Utils._merge_non_mapping_source(target, source, options)
        
    return Utils._merge_mapping_source(target, source, options)

@staticmethod
def _merge_non_mapping_source(target, source, options):
    # Extract the list/tuple handling logic here
    ...

@staticmethod
def _merge_mapping_source(target, source, options):
    # Extract the mapping handling logic here
    ...
🧰 Tools

🪛 Ruff (0.7.0)

44-44: Multiple isinstance calls for el, merge into a single call

Merge isinstance calls for el

(SIM101)
src/qs_codec/decode.py (5)
54-57: Include actual list length in exception message for clarity

Including the actual number of elements in the exception message provides better context for the error and aids in debugging.

Apply this diff:
-            raise ValueError(
-                f"List limit exceeded: Only {options.list_limit} element{'' if options.list_limit == 1 else 's'} allowed in a list."
-            )
+            raise ValueError(
+                f"List limit exceeded: Attempted to process {len(split_val)} elements, but only {options.list_limit} allowed."
+            )
60-63: Enhance exception message with current list length

Including current_list_length in the exception message helps users understand at which point the limit was exceeded.

Apply this diff:
-    if options.raise_on_limit_exceeded and current_list_length >= options.list_limit:
-        raise ValueError(
-            f"List limit exceeded: Only {options.list_limit} element{'' if options.list_limit == 1 else 's'} allowed in a list."
-        )
+    if options.raise_on_limit_exceeded and current_list_length >= options.list_limit:
+        raise ValueError(
+            f"List limit exceeded: Current list length is {current_list_length}, but limit is {options.list_limit}."
+        )
82-82: Simplify condition by removing redundant check

Since limit cannot be zero (validated earlier), the or not limit part is unnecessary.

Apply this diff:
-            if (limit is None) or not limit
+            if limit is None
88-88: Simplify condition by removing redundant check

Similarly, simplify the condition for clarity and consistency.

Apply this diff:
-            if (limit is None) or not limit
+            if limit is None
155-168: Refactor index checking for efficiency

Replace the dictionary conversion with direct index checking to improve performance and readability.

Apply this diff:
-        if parent_key is not None and isinstance(val, (list, tuple)) and parent_key in dict(enumerate(val)):
+        if parent_key is not None and isinstance(val, (list, tuple)) and 0 <= parent_key < len(val):
tests/unit/decode_test.py (2)
703-706: Inconsistent Use of 'throwOnLimitExceeded' vs 'raise_on_limit_exceeded'

The test method names refer to throwOnLimitExceeded, but the actual option in DecodeOptions is raise_on_limit_exceeded. For consistency and clarity, consider updating the method names to match the option name.

Apply this diff to rename the methods:
-def test_silently_truncates_when_throwOnLimitExceeded_is_not_given(self) -> None:
+def test_silently_truncates_when_raise_on_limit_exceeded_is_not_given(self) -> None:

-def test_silently_truncates_when_parameter_limit_exceeded_without_error(self) -> None:
+def test_silently_truncates_when_parameter_limit_exceeded_without_raising(self) -> None:
Also applies to: 707-712

743-745: Ambiguous Handling of Negative 'list_limit' Values

In test_handles_negative_list_limit_correctly, setting list_limit=-1 and expecting a ValueError may not be the desired behavior. Negative list limits could lead to unexpected results or might need explicit handling in the decode function. Please ensure that negative list_limit values are appropriately handled, and update the test case if necessary.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between fb6c6a4 and d29f0ac.

📒 Files selected for processing (5)

src/qs_codec/decode.py (4 hunks)
src/qs_codec/models/decode_options.py (1 hunks)
src/qs_codec/utils/utils.py (1 hunks)
tests/unit/decode_test.py (3 hunks)
tests/unit/example_test.py (1 hunks)

🔇 Additional comments (13)

src/qs_codec/models/decode_options.py (1)

81-81: LGTM! Improved terminology consistency.

The docstring updates improve consistency with Python conventions by using "raise an error" instead of "throw an error" and using "True" instead of "true".

Also applies to: 84-84

src/qs_codec/utils/utils.py (2)

38-41: Consider optimizing the undefined check and preserving indices.

The code has a few potential improvements:

The condition any(isinstance(value, Undefined) for value in target_.values()) repeats the check that was already done in the outer if condition, which could impact performance with large arrays.
When converting to a list in the else branch, the original indices are lost which might be important for preserving array positions.

Consider this optimization:

-                    if any(isinstance(value, Undefined) for value in target_.values()):
-                        target = {str(i): target_[i] for i in target_ if not isinstance(target_[i], Undefined)}
-                    else:
-                        target = list(filter(lambda el: not isinstance(el, Undefined), target_.values()))
+                    # Reuse the result of the first undefined check
+                    filtered_items = [(i, v) for i, v in target_.items() if not isinstance(v, Undefined)]
+                    if any(isinstance(el, Undefined) for el in target):
+                        target = {str(i): v for i, v in filtered_items}
+                    else:
+                        # Preserve original indices by sorting
+                        target = [v for _, v in sorted(filtered_items)]

Additionally, let's verify the impact of converting integer keys to strings:

Line range hint 17-24: Verify integration with throwOnLimitExceeded option

The merge function accepts a DecodeOptions parameter but doesn't seem to utilize the new raise_on_limit_exceeded option. Consider whether the merge operation should respect list limits and potentially throw exceptions when enabled.

Let's check how the limit exceeded feature is implemented:

🧰 Tools

🪛 Ruff (0.7.0)

44-44: Multiple isinstance calls for el, merge into a single call

Merge isinstance calls for el

(SIM101)

tests/unit/example_test.py (1)

123-123: LGTM! Consider documenting this behavior change.

The updated test case correctly reflects the new behavior where sparse array indices are converted to dictionary keys instead of creating a sparse array. This change aligns with the PR objectives for better list limit handling.

Let's check if this behavior change is documented:

✅ Verification successful

The behavior change is already well documented

The documentation in both README.rst and docs/README.rst clearly explains that:

When indices are too large, they become dictionary keys instead of list indices
This behavior can be controlled via the list_limit option
The documentation includes examples that match the updated test case behavior

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Search for documentation about list limit handling in README or docs
fd -e md -e rst . | xargs rg -l "sparse.*list|list.*limit|array.*index"

Length of output: 100

Script:

#!/bin/bash
# Check content of README.rst and docs/README.rst for documentation about this behavior
rg -A 5 -B 5 "sparse.*list|list.*limit|array.*index" README.rst docs/README.rst

Length of output: 2949

src/qs_codec/decode.py (5)

51-51: Function signature updated appropriately

The addition of current_list_length to _parse_array_value enhances the function's ability to enforce list limits effectively.

75-77: Parameter limit validation is correctly implemented

The check ensures that the parameter_limit is a positive integer, preventing invalid configurations.

92-93: Parameter limit exceeded error handling is appropriate

The code correctly raises a ValueError when the number of parameters exceeds the set limit.

126-130: Update to _parse_array_value call ensures proper limit tracking

Passing current_list_length maintains accurate tracking of the list length during parsing.

168-168: Properly updating leaf with current_list_length

The modification ensures that the list limits are correctly enforced in recursive parsing.

tests/unit/decode_test.py (4)

181-181: Possible Logical Error in Test Assertion

In the test test_parses_a_mix_of_simple_and_explicit_lists, the assertion:

assert decode("a[1]=b&a=c", DecodeOptions(list_limit=20)) == {"a": {"1": "b", "2": "c"}}

may not produce the expected result. Given the input "a[1]=b&a=c" and a list_limit of 20, the decoded output should likely be {"a": ["b", "c"]} to maintain consistency with list parsing behavior. Please verify if the expected output is correct.

739-741: Unexpected Behavior with 'list_limit=0'

In the test test_handles_list_limit_of_zero_correctly, with list_limit=0, the decoded output is {"a": ["1", "2"]}. This seems counterintuitive because a list_limit of zero might suggest that lists should not be parsed or should be treated differently. Please verify whether the decode function behaves as intended when list_limit=0.

691-722: Comprehensive Test Coverage for Parameter Limits

The new tests in TestParameterList effectively cover various scenarios for the parameter_limit option, including cases when the limit is exceeded and how the raise_on_limit_exceeded flag affects behavior. Excellent work ensuring robust test coverage.

724-748: Thorough Testing of List Limits and Nested Structures

The tests in TestListLimit provide detailed coverage of the list_limit functionality, including edge cases like zero and negative limits, and the application of limits to nested lists. This enhances the reliability of list parsing in the decode function.

tests/unit/decode_test.py

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (1)

tests/unit/decode_test.py (1)
739-741: Consider enhancing zero limit test coverage.

The test for zero limit could be more comprehensive by also verifying behavior when raise_on_limit_exceeded=True.

Consider adding:
 def test_handles_list_limit_of_zero_correctly(self) -> None:
     assert decode("a[]=1&a[]=2", DecodeOptions(list_limit=0)) == {"a": ["1", "2"]}
+    # Verify behavior with raise_on_limit_exceeded=True
+    assert decode("a[]=1&a[]=2", DecodeOptions(list_limit=0, raise_on_limit_exceeded=True)) == {"a": ["1", "2"]}

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between d29f0ac and 4dc4a35.

📒 Files selected for processing (1)

tests/unit/decode_test.py (3 hunks)

🔇 Additional comments (3)

tests/unit/decode_test.py (3)

181-183: LGTM: List limit assertions are well-structured.

The assertions correctly verify the behavior of the list_limit parameter with different values and scenarios.

691-722: LGTM: Comprehensive test coverage for parameter limits.

The test class provides excellent coverage of parameter limit functionality:

Success and error cases
Silent truncation behavior
Edge cases like infinity limit
Clear and descriptive test names

724-748: LGTM: Comprehensive test coverage for list limits.

The test class provides excellent coverage of list limit functionality:

Success and error cases
List to map conversion
Edge cases (zero, negative)
Nested list handling

## 1.0.5 * [FEAT] add `DecodeOptions.raise_on_limit_exceeded` option ([#11](#11)) * [CHORE] remove dead code in `Utils` * [CHORE] add more tests * [CHORE] update dependencies

✨ decode: add throwOnLimitExceeded option

d29f0ac

techouse added the enhancement New feature or request label Nov 23, 2024

techouse self-assigned this Nov 23, 2024

techouse changed the title ~~✨ decode: add throwOnLimitExceeded option~~ ✨ decode: add raise_on_limit_exceeded option Nov 23, 2024

coderabbitai bot reviewed Nov 23, 2024

View reviewed changes

tests/unit/decode_test.py Outdated Show resolved Hide resolved

✅ fix flaky test

4dc4a35

coderabbitai bot reviewed Nov 23, 2024

View reviewed changes

techouse merged commit 465bd93 into main Nov 23, 2024
14 of 16 checks passed

techouse deleted the feat/qs-571 branch November 23, 2024 17:20

techouse added a commit that referenced this pull request Nov 24, 2024

🔖 release v1.0.5

2d4bd2f

## 1.0.5 * [FEAT] add `DecodeOptions.raise_on_limit_exceeded` option ([#11](#11)) * [CHORE] remove dead code in `Utils` * [CHORE] add more tests * [CHORE] update dependencies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ decode: add `raise_on_limit_exceeded` option #11

✨ decode: add `raise_on_limit_exceeded` option #11

techouse commented Nov 23, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 23, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

codacy-production bot commented Nov 23, 2024 •

edited

Loading

codecov bot commented Nov 23, 2024 •

edited

Loading

coderabbitai bot left a comment

coderabbitai bot left a comment

✨ decode: add raise_on_limit_exceeded option #11

✨ decode: add raise_on_limit_exceeded option #11

Conversation

techouse commented Nov 23, 2024 • edited by coderabbitai bot Loading

Description

Enhancements to list and parameter limits:

Error handling improvements:

Test case updates:

Type of change

How Has This Been Tested?

Checklist:

Summary by CodeRabbit

Release Notes

coderabbitai bot commented Nov 23, 2024 • edited Loading

Walkthrough

Changes

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

codacy-production bot commented Nov 23, 2024 • edited Loading

Coverage summary from Codacy

See diff coverage on Codacy

See your quality gate settings Change summary preferences

codecov bot commented Nov 23, 2024 • edited Loading

Codecov Report

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

✨ decode: add `raise_on_limit_exceeded` option #11

✨ decode: add `raise_on_limit_exceeded` option #11

techouse commented Nov 23, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 23, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

codacy-production bot commented Nov 23, 2024 •

edited

Loading

codecov bot commented Nov 23, 2024 •

edited

Loading