Day 23: Amphipod

Haskell: Part 1 (06:05:52, rank 4527), Part 2 (15:25:39, rank 5111)

A new record in exasperation and desparation.

Part 1

I was really stumped by this for a little while, and decided to try tackling it as a game tree search problem using iterative deepening search. It took a really, really long time to just express the constraints correctly, so for several hours I was getting either not enough neighbouring nodes, or too many.

Painful, but eventually it got the job done, just about.

Part 2

This is where the panic and confusion started to set in, and as I got more and more tired it became even harder to see the wood for the trees.

At first I thought that it might be possible to use the trick of mapping the state to a fixed size integer, which could then be used as an index into an array or table for running Dijkstra's algorithm (which I'd already used on day 15). However this method would require a lot of storage if placed in a flat array -- there are 27 slots that can hold an amphipod, and each slot can have one of 5 states (A, B, C, D or empty). This means we need 27*3 == 2^81 bits to represent the game state, which equates to 2417851639229258349412352 different "slots". You could store them with a modulo index, like a hashtable, but then you might have collisions with unexpected results, or if you add them to a proper hashtable with linear chaining, then you'd still have the problem of potentially storing zillions of nodes ...or so I thought (insert echoey diminished arpeggio or other suitable foreshadowing music). I abandoned the idea early on, but after hours, and I mean many hours of slogging with IDS and trying to come up with clever heuristics for pruning the state space, it was clearly not going to work that way. At some point I killed the program while searching depth 22 on the sample input, and it had been running for just over an hour. My gut feeling was that something like IDS was needed as it has very low space complexity, but it wasn't working out.

Eventually I checked the Reddit solutions thread (for the second time this year) and saw that people were actually succeeding with Dijkstra. After thinking about it again, I figured that Dijkstra visits "useful" parts of the game tree first, so it won't get stuck exploring crappy dead-ends like DFS will. So I tried re-implementing Dijkstra with a mutable distance array based on an encoding of bit-shifted values in each legal cell (you can't stop in the spaces outside the rooms, so I skipped them in the encoding), modulo a fixed value (I tried 2^24 up to 2^28 or so) to keep it in fixed bounds.

The ST approach with a flat array and state bit-hashing was a total failure for some reason, and eventually I tried again with a plain old immutable Data.Map Grid Int which allows querying the distance table directly with each discovered game state. I had almost no hope at this point, but was amazed when it completed on the full input in about 10 seconds.

Reflections

Not sure what to say really. I felt a mixture of elation, relief and annoyance at having overlooked the right solution. You could say that I mentally did a depth-first search when I should have stepped back sooner and thought more about the bigger picture. After several puzzles where I learned lessons that surprised me about the performance benefits of mutability, I guess a blind spot developed where I forgot how significant the branching factor is for search algorithms like IDS/DFS, and failed to appreciate the benefits of a smarter search strategy. All I could think about was "but I can't store an unbounded amount of distance values", and it took many hours before I realised what should have been obvious:

Due to the nature of Dijkstra's algorithm, I can expect it to find a solution for a problem like this in a reasonable number of steps.
Since it looks at a fixed number of "neighbouring nodes" -- game states that are reachable from the current state -- and completes in a reasonable number of steps, it stands to reason that the storage requirements are bounded if you store values lazily (distances in the lookup table, and neighbour nodes in the priority queue).
- Most learning resources I consulted seem to focus on the static, finite graph case, and describe an initialisation step where you pre-allocate a complete table of distances for all nodes, then insert all nodes into the priority queue with an initial (very high) distance value. In this situation though, you don't even know how many nodes there are since you discover them when evaluating other nodes starting from the initial game state. The answer here is to start with both structures only holding one entry for the initial game state with a cost of zero, then add/update to both structures when new nodes are discovered, or previously-seen nodes with a lower distance. Eventually the priority queue runs out because no new reachable nodes exist which can be reached for less than the cost of the optimal solution which you've already discovered (unless you never discover any solved states, in which case you will run out of memory). If a solved state was ever reached, the minimum cost to arrive there will be the value for the solved state in the distance table.
- What does this mean for games with an infinite number of solutions? That's probably a good question, and I don't know the answer. Maybe you could accumulate a list of solved states seen so far, or just store one and replace it when a new one is encountered with a lower cost.

Overall today served up some hard but valuable lessons, but how much will I remember for next time? There's only one way to find out...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Day 23: Amphipod

Part 1

Part 2

Reflections

Files

README.md

Latest commit

History

README.md

File metadata and controls

Day 23: Amphipod

Part 1

Part 2

Reflections