Proper support for allow_partial #147

EduardoGoulart1 · 2023-05-08T08:20:07Z

@eliotwrobson @caleb531 Please do not review it because the code is not ready yet. This is just a draft to book progress on the allow_partial support.

If you go with the definition that DFAs are sets of words, then all operations are well-defined and relatively easy to implement. Most of the times the only required change is to replace loops like:

for symbol in input_symbols:
    tgt_a = transitions_a[symbol]

By the code:

for symbol in input_symbols:
    tgt_a = transitions_a.get(tgt_a, None)

I made the code such that it adds no overhead for complete DFAs compared to the current implementation. For partial DFAs, if the DFAs are sparse, then you will get a large performance boost, while if they are dense you will pay some penalty (but most of the times negligible).

I feel like we still need to discuss a few things:

Reserve something (probably None) to represent the trap state. Basically, we add the invariant that if None is part of the set of states, then it must be a trap state. This is already implicitly assumed in _get_next_current_state and greatly simplifies the implementation logic. Because network graph does not allow for states labeled None, we replaced it with automatically generated integers
We could think of making this allow_partial thing invisible to the users. Instead of passing allow_partial as a parameter, we compute internally the flag "is_partial". I go by the principle that we should not restrict our API unless it would lead to misuse.
Now that support for partial DFAs is there, it is straightforward to extend operations to support DFAs with different alphabets

Missing:

Adjust/add tests
Add benchmarks

automata/fa/dfa.py

EduardoGoulart1 · 2023-05-29T22:22:48Z

I'm currently working on finalizing this PR and I would like to hear your opinions before proceeding.

When converting from partial to complete DFAs, it is necessary to add a trap state. So far I was using None for this purpose, but it doesn't work well with networkx.DiGraph because that library doesn't allow nodes labeled None in the graph. It is not too difficult to work around this problem for the DFA class, but it becomes problematic when calling NFA.to_dfa (or other automata types) as this invariant would need to propagate throughout the code. So I need another approach to generate new trap states.

The only solution I can think of that does not require significant refactoring, is to assign the highest negative integer number that isn't already a state. This solution works well, but sometimes states are strings or frozen sets, which leads to a typing mismatch that isn't directly caused by the user. To some extent, this can be mitigated on our side, but I don't think that we can completely avoid it.

The more radical solution would be to make all DFAs partial. That would simplify a lot of the code, but would cost some performance for "dense" DFAs

caleb531 · 2023-05-29T22:26:36Z

@EduardoGoulart1 Regarding mixed state types, that's a fair question. @eliotwrobson I think we've updated the library to handle, for example, states of mixed types, correct? I believe some existing methods (maybe DFA.minify?) produce machines with such mixed state types.

If this is the case, then I'm fine with the highest negative integer number for the trap state name.

eliotwrobson · 2023-05-29T22:31:07Z

@caleb531 yes, the library can handle mixed state types, and there are a couple of places this is used I think (see from_finite_language). The type annotations are also compatible with mixed-type states. @EduardoGoulart1 I think that's a fine convention, as there are a few places where we start assigning integer state names starting from 0 (in the regex code I believe).

coveralls · 2023-06-04T21:13:41Z

coverage: 99.812% (-0.07%) from 99.883% when pulling 125d244 on EduardoGoulart1:proper_support_allow_partial into f1c3674 on caleb531:develop.

EduardoGoulart1 · 2023-06-04T21:49:59Z

Also, adapting the test is fairly easy, but some tests have hard-coded values for the expected result which for me does not make sense and complicates reusing it for testing partial DFAs. For instance the test below (test_union) is explicitly testing the union operator by constructing the DFA representing the result.

Is there a specific reason for that? I would have expected the test to verify its algebraic properties. For example something like A \subseteq (A \cup B) && B \subseteq (A \cub B) && (A \cub B) - A - B == \emptyset. If there is no such reason, then I will proceed to update such tests (this will also remove many lines of code)

eliotwrobson · 2023-06-04T22:19:44Z

@EduardoGoulart1 the reason for these is to verify the state names and types of the output (which when names are retained, is part of the API). These test cases should be kept with their hard-coded output. There actually is a test case testing for algebraic properties separately.

I'm not sure about the best way to test with allow partial for these. Is there a way to use parameterized test cases (like in pytest) with nose? The DFAs in those test cases are not partial anyway, so I don't think they can really be meaningfully adapted to test for the new behavior in this PR.

caleb531 · 2023-06-04T22:46:41Z

@eliotwrobson @EduardoGoulart1 It does look like nose2 supports test parameterization! Although since DFAs can be verbose to construct, I wonder if the decorator signature would get pretty large. But please feel free to play around to find something practical and maintainable:

https://docs.nose2.io/en/latest/params.html

EduardoGoulart1 · 2023-06-05T08:03:55Z

@caleb531 @eliotwrobson sure my initial idea was to extend the tests to use dfa.as_partial(). The only places where this does not work is for such hard-coded tests. The problem is that we do not expand all the states. But I will try my best with these constraints in mind :D

eliotwrobson · 2023-06-06T01:04:21Z

@EduardoGoulart1 just a heads up that, because of the size of the refactor and some weirdness that was uncovered, I want to wait until #129 is merged before merging this. I fixed the last major blocker there, so hopefully that will happen soon.

Even if a state in a partial DFA has no outgoing transitions, it still needs to have a state function (i.e. an empty dict) Also added f-strings to validation messages

caleb531 · 2023-06-14T19:49:16Z

@EduardoGoulart1 @eliotwrobson Just merged #129! I'm working on resolving the merge conflicts now.

The semantics is defined such that if no transition is defined for a certain symbol on a given state, we act as if there would be a transition there leading to a trap state. Reserve None to denote trap states. The code is implemented such that there is little or no extra overhead for complete DFAs. For partial DFAs, the code is implemented to perform well with sparse DFAs. For most functions, the code will also perform well with dense partial DFAs.

automata/fa/dfa.py

eliotwrobson · 2023-06-17T23:20:47Z

@EduardoGoulart1 Some merge conflicts came up when #152 was merged. I've resolved those and gotten the lint check passing in the latest push (with some minor style changes and optimizations). Please pull when you get the chance.

eliotwrobson · 2023-07-16T21:54:08Z

Would it be possible for you guys to work through the outstanding comment threads? I would personally prefer to review this PR only once these comment threads have been resolved, and I don't have anything more to say on the ones that are still open.

@caleb531 sounds good, I'm resolving some now and the remaining threads are fairly short.

eliotwrobson · 2023-07-16T23:06:18Z

@caleb531 all items are resolved and removed all TODOs. The only thing left is docs changes, but I'll leave that until after your review. Thanks!

automata/fa/dfa.py

caleb531

@EduardoGoulart1 @eliotwrobson Left a few requested changes. Nothing major (at least from my perspective 😅).

eliotwrobson

@caleb531 Resolved all threads from your comments! If those were all the items you had questions about, feel free to merge when ready!

caleb531 · 2023-07-18T02:59:12Z

@EduardoGoulart1 @eliotwrobson This looks good to me! Will merge now.

Thank you both for all the work on this PR.

eliotwrobson reviewed May 8, 2023

View reviewed changes

automata/fa/dfa.py Show resolved Hide resolved

eliotwrobson mentioned this pull request May 11, 2023

v8 Caching Behavior #148

Closed

caleb531 mentioned this pull request Jun 14, 2023

Added jupyter notebook integration and new visualization #129

Merged

caleb531 added this to the v9 milestone Jun 14, 2023

Require partial dfas to have all state transitions

43f377c

Even if a state in a partial DFA has no outgoing transitions, it still needs to have a state function (i.e. an empty dict) Also added f-strings to validation messages

EduardoGoulart1 added 4 commits June 14, 2023 12:51

Account for unreachable nodes when building the graph

9e6b6a0

Changed trap states from None to integers

952b1d3

Fix failing test

a48075d

caleb531 force-pushed the proper_support_allow_partial branch from 3d92e07 to a48075d Compare June 14, 2023 19:51

Indicate that FA.show_diagram() supports partial DFA

146416a

eliotwrobson reviewed Jun 14, 2023

View reviewed changes

automata/fa/dfa.py Outdated Show resolved Hide resolved

eliotwrobson added 7 commits June 17, 2023 17:51

Merge branch 'develop' into proper_support_allow_partial

43c9bc8

Update dfa.py

7123f7e

Update dfa.py

38df765

Update dfa.py

9ed1822

Update dfa.py

a606cd5

Update dfa.py

bf7f8e8

Update dfa.py

344ac14

eliotwrobson added 13 commits July 16, 2023 16:56

Started responding to comments

85d8be7

Update dfa.py

b7fa44d

Simplify from_nfa return type

54006bf

Update test_dfa.py

6183f5a

Update test_dfa.py

7319802

Removed comments

2a72416

Added test

c373e70

Simplified test case

626889f

Added more test cases

ef8978f

Switch default and simplify function call

87c9bfd

Updated test cases and edge case failures

d856b68

Change parameter name to be consistent

e2e6ca9

Change a default

6a00cb6

caleb531 reviewed Jul 17, 2023

View reviewed changes

automata/fa/dfa.py Outdated Show resolved Hide resolved

caleb531 reviewed Jul 18, 2023

View reviewed changes

automata/fa/dfa.py Show resolved Hide resolved

caleb531 reviewed Jul 18, 2023

View reviewed changes

automata/fa/dfa.py Outdated Show resolved Hide resolved

caleb531 reviewed Jul 18, 2023

View reviewed changes

automata/fa/dfa.py Show resolved Hide resolved

caleb531 requested changes Jul 18, 2023

View reviewed changes

eliotwrobson added 5 commits July 17, 2023 20:55

Minor changes

d598202

Update dfa.py

6cf3af2

Simplified type and added keyword arguments

070ff32

Updated documentation

4700831

Update class-dfa.md

125d244

caleb531 approved these changes Jul 18, 2023

View reviewed changes

eliotwrobson approved these changes Jul 18, 2023

View reviewed changes

caleb531 merged commit 832bd5d into caleb531:develop Jul 18, 2023
5 checks passed

eliotwrobson mentioned this pull request Jul 18, 2023

Fix the allow_partial behavior, or deprecate it #126

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proper support for allow_partial #147

Proper support for allow_partial #147

EduardoGoulart1 commented May 8, 2023 •

edited

Loading

EduardoGoulart1 commented May 29, 2023 •

edited

Loading

caleb531 commented May 29, 2023 •

edited

Loading

eliotwrobson commented May 29, 2023 •

edited

Loading

coveralls commented Jun 4, 2023 •

edited

Loading

EduardoGoulart1 commented Jun 4, 2023 •

edited

Loading

eliotwrobson commented Jun 4, 2023

caleb531 commented Jun 4, 2023

EduardoGoulart1 commented Jun 5, 2023

eliotwrobson commented Jun 6, 2023

caleb531 commented Jun 14, 2023

eliotwrobson commented Jun 17, 2023

eliotwrobson commented Jul 16, 2023

eliotwrobson commented Jul 16, 2023

caleb531 left a comment

eliotwrobson left a comment

caleb531 commented Jul 18, 2023 •

edited

Loading

Proper support for allow_partial #147

Proper support for allow_partial #147

Conversation

EduardoGoulart1 commented May 8, 2023 • edited Loading

EduardoGoulart1 commented May 29, 2023 • edited Loading

caleb531 commented May 29, 2023 • edited Loading

eliotwrobson commented May 29, 2023 • edited Loading

coveralls commented Jun 4, 2023 • edited Loading

EduardoGoulart1 commented Jun 4, 2023 • edited Loading

eliotwrobson commented Jun 4, 2023

caleb531 commented Jun 4, 2023

EduardoGoulart1 commented Jun 5, 2023

eliotwrobson commented Jun 6, 2023

caleb531 commented Jun 14, 2023

eliotwrobson commented Jun 17, 2023

eliotwrobson commented Jul 16, 2023

eliotwrobson commented Jul 16, 2023

caleb531 left a comment

Choose a reason for hiding this comment

eliotwrobson left a comment

Choose a reason for hiding this comment

caleb531 commented Jul 18, 2023 • edited Loading

EduardoGoulart1 commented May 8, 2023 •

edited

Loading

EduardoGoulart1 commented May 29, 2023 •

edited

Loading

caleb531 commented May 29, 2023 •

edited

Loading

eliotwrobson commented May 29, 2023 •

edited

Loading

coveralls commented Jun 4, 2023 •

edited

Loading

EduardoGoulart1 commented Jun 4, 2023 •

edited

Loading

caleb531 commented Jul 18, 2023 •

edited

Loading