Skip to content

Diff, patch, merge, and synchronize JSON documents with an Automerge-compatible interface

Notifications You must be signed in to change notification settings

frameable/pigeon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pigeon

Diff, patch, merge, and synchronize JSON documents with an Automerge-compatible interface

const Pigeon = require('@frameable/pigeon')

// initialize our document from an object literal
let doc1 = Pigeon.from({
  cards: [
    { id: 1, title: 'Rewrite everything in Clojure', done: false },
    { id: 2, title: 'Rewrite everything in Haskell', done: false },
  ]
})

// make a clone of our document
let doc2 = Pigeon.from(doc1)

// one user deletes the clojure card
doc1 = Pigeon.change(doc1, doc => doc.cards.splice(0, 1))

// meanwhile another user sets the haskell card to done
doc2 = Pigeon.change(doc2, doc => doc.cards[1].done = true)

// we merge the documents together in any order
const merged = Pigeon.merge(doc1, doc2)

// all is well with updates merged together
assert.deepEqual(merged, {
  cards: [
    { id: 2, title: 'Rewrite everything in Haskell', done: true },
  ]
})

Differences from Automerge

Pigeon keeps a near fully-compatible interface to Automerge, but the underlying implementation is optimized for a different use case, and makes different trade-offs. While Automerge optimizes for working offline and merging changes periodically, Pigeon is optimized for online real-time collaboration.

  • By default, history will grow only to 1000 items in length, after which oldest entries will be jettisoned
  • Because of the above, performance is much improved for larger docs with more changes
  • Changes are computed across entire data structures, rather than tracing via proxies
  • Documents need not have a direct common ancestor for patches from one to apply to another
  • Unix timestamps and client ids are used instead of vector clocks to ensure order and determinism
  • Change sets use JSON-Patch-esque paths, and so are more easily introspectable using existing tools
  • Objects should have unique identifiers in order to preserve semantic integrity
  • Changes may be made in-place for situations where performance is critical

Installation

npm install @frameable/pigeon

API

newDoc = Pigeon.from(data, cid=_cid)

Create a document from an array or object.

newDoc = Pigeon.clone(doc)

Clone a document.

aliasDoc = Pigeon.alias(doc)

Make an alias to an existing doc; analogous to a hardlink.

changes = Pigeon.getChanges(left, right)

Get the set of changes that would transform left into right.

newDoc = Pigeon.rewindChanges(doc, ts, cid)

Roll back the document state back to the given timestamp.

newDoc = Pigeon.fastForwardChanges(doc)

Roll forward the document state up to the head.

newDoc = Pigeon.applyChanges(doc, changes)

Clone the given document to a new document and apply changes to the new document.

Pigeon.applyChangesInPlace(doc, changes)

Apply given changes to the document in-place.

newDoc = Pigeon.change(doc, fn)

Change the document according to the given function, which receivs the document as a parameter.

doc = Pigeon.from({ message: 'hello' })
newDoc = Pigeon.change(doc, d => d.message = 'hey there')
changes = Pigeon.getChanges(doc, newDoc)

changes = Pigeon.getHistory(doc)

Get all of the changes to recreate the document from scratch.

newDoc = Pigeon.load(str)

Load the document from its serialized form.

str = Pigeon.save(doc)

Serialize the document to be loaded later.

Pigeon.configuire(options)

Set configuration options.

Pigeon.configure({
  strict: true,
  getObjectId: x => x.id || x._id || x.uuid || x.slug,
  getTimestamp: Date.now,
})
strict

In order to preserve semantic integrity, any objects which are items in arrays should contain identifier properties named id, _id, uuid, or slug, as in the example above. When objects have identifier properties, change sets will be keyed by those identifiers, and all will be well. When strict is truthy, an error will be thrown if we try to compare objects with no identifier properties. When strict is falsy however, changes will be keyed by array indexes as a best effort only, and so property changes may or may not be robust to changes in array item order. Defaults to true.

getObjectId

Callback to return an identifier value, given an object. By default object identifiers will be sought as shown above, but if your data uses different properties for unique identifiers, you may supply an alternate function for retrieving them.

getTimestamp

Callback to return a unix timestamp. This defaults to Date.now, which may be good enough in many cases. If the client's clock is set by ntpd for example, all will be probably well enough. However, for more precise ordering of operations, you may wish to provide your own function which would periodically sync the server time to the client and take network latency into account to provide a more accurate timestamp.

More on timestamps

Regardless of the accuracy of clients' clocks, clients will always end up with the same state as each other, given the same documents and changes to be applied, even if the changes arrive out of order.

Each time a client changes a document, internally, the change gets decorated with a client timestamp. When we merge documents, or apply change sets, the document is rewound to just before the earliest change to be applied, and changes are played forward in order.

So, for example, if one client's clock is a few seconds slower than another, if both clients change a value at about the same time, when the changes get merged, the first client's timestamp will be later, and so both clients will apply the first client's change last, and both clients will end up with exactly the same state.

Operating directly on JSON objects

Pigeon also exposes methods to diff and patch JSON objects:

const { diff, patch } = require('@frameable/pigeon')

const a1 = [
  { id: 3920, name: 'Chicago', population: 5239412 },
  { id: 3977, name: 'Boston', population: 1032943 },
]

const a2 = [
  { id: 3920, name: 'Chicago', population: 5239412 },
  { id: 3977, name: 'Boston', population: 1032997 },
]

const [ changes ] = diff(a1, a2);

assert.deepEqual(
  changes,
  { op: 'replace', path: '/[3977]/population', value: 1032997, _prev: 1032943 },
)

patch(a1, changes)
assert.deepEqual(a1, a2)

changes = Pigeon.diff(left, right)

Compares data structures and returns changes required to make left's content equal to right's. The format of the returned changes is based on RFC 6902, with the modification that path components which are array indexes, if they refer to an object, may take the form [<id>] where <id> is the value of a property meant to uniquely identify that object, with a property named id, _id, uuid, or slug.

left = Pigeon.patch(left, changes)

Applies changes to the given data structure, making modifications in-place.

About

Diff, patch, merge, and synchronize JSON documents with an Automerge-compatible interface

Resources

Stars

Watchers

Forks

Packages

No packages published