Lessons learned from building a sync-engine and reactivity system with SQLite
FWIW, I think you might be better off with immutable rows and lamport clocks.
Everything is a full new row because it is “a message” including read receipts. Some messages like read receipts just don’t render in the chat.
Edits can work the same way by rendering over a previous message, even though the local and remote DB have multiple rows for the original and edited messages.
Nice article!
An interesting tool that matches the requirements mentioned in the article is Evolu[0]
It's a sync engine with e2e encryption based on SQLite.
The local-first landscape is quite wide now, and there is probably a solution ready for all kind of needs[1]
Building a sync engine can be a nice learning experience, but for production software it's better to pick something that has already faced and resolved all the weird edge cases you get when building a sync engine and persistent storage on the browser.
We've spend the last couple of years building a local-first end-to-end encrypted multiplayer notes app (Thymer). We also explored running sqlite in the browser but ultimately concluded that we didn't really sqlite or a real database at all in the browser.
What does a document system need? A few simple indexes that track links and backlinks, mentions, and hashtags. You’re broadcasting change notifications anyway (for reactive DOM updates), so updating the indexes for those manually isn’t much extra work. You apply updates eagerly and then notify other users so they can apply the same updates, with some rules to guarantee that all users end up with the same state regardless of the order in which they receive the updates. But a relational database doesn’t help with any of this. Document systems tend to be versioned, so every user action turns into another entry in a transaction log. Even queries like “last Monday’s version of my document” don’t map naturally to SQL. You can query for transactions in a given time period, but unless you have snapshots, you’re still forced to re-apply transactions all the way from t=0 if you want to re-create the document state at a given date.
I love the timing of how, over the past week or so I've been digging into the ecosystem and options around this, and now here's another article!
Great write-up! You should check out SQLite-Sync (https://github.com/sqliteai/sqlite-sync) - it might be exactly what you're looking for. There's also a WASM build (https://www.npmjs.com/package/@sqliteai/sqlite-sync-wasm)
This is great to see and I like the simplicity of the approach. You can also look at PowerSync (which I work on). It's in a similar space as ElectricSQL. It syncs to SQLite on the client-side and provides built-in reactivity. On the web, it uses wa-sqlite with either OPFS or IndexedDB. It also takes care of things like multi-tab on web, and queueing and uploading client-side mutations to the backend.
> And because Electric syncs every change granularly, you are certain that the state of your local database is exactly the same as the server's.
I don't see how this certainty follows from "granularity" (whatever that means in this context). I believe to have such a certainty one would need the synchronization to happen within a single transaction that spans both client and server databases.
> Selecting a sync engine is a lot about knowing the constraints of your application.
Not so sure about this. These seem more like fundamentals than sliding scales.
> How many people will concurrently edit the same resources?
More than 1.
> How write-heavy is it?
Write-heavy enough that you'll encounter an unexpected write between two reads.
> Can you expect unreliable connections or offline usage?
Yes.
CAP with Electron and SQLite is no different from CAP with Tauri and MySQL.
It feels like everyone is solving the sync problem from a slightly different angle. Has anyone here used a few of these in production?
I must be a bit dense on this little Sunday morning but I don't quite get the problem solved.
Is it an issue with reactivity between browser tabs? Hence the use of the Broadcast Channel API?
The reactivity being implemented in svelte, what is different from any normal app? could have been react with mobx, zustand, recoil, etc but that's not too relevant. This is just how frontend frameworks are nowadays.
Especially since it is single-user (player? sic)?
Syncing between the backend server (remote state) and the local sqlite instance (local state) shouldn't really require much work? A simple websocket connection could do the trick to push from the remote server to the local device and then push the changes to sqlite again. But that's not too surprising to me.
I must not have understood something...
Nice write-up! Would love to read more about practical experience on sync engines. I am also currently building one for a niche use case: a CRDT based smart home system w/o central component: syncing config and state across several embedded/MCU + mobile via CAN bus and BLE, no Internet. It features a simple ORM and persistence and for dev/debug it runs on macos/Linux. Written in C++ and Dart. Due to memory and other platform constraints I chose to build most of it from scratch. Thought about open sourcing or writing about it, but wondered whether anybody cares given the relative obscure tech stack.
Amazing write-up! I’m thrilled to see your exploration of a reactive SQLite sync engine; it resonates deeply with my own journey toward Local‑First, Offline‑Resilient tools aimed at relieving pressure, not just impressing. Keeping all user data local in SQLite while enabling seamless, encrypted syncing and reactivity. A nice segue in pragmatic autonomy.
It mirrors the moment I embraced tools like Obsidian; designing for real-world constraints; valuing simplicity, privacy, and functionality; especially in low‑connectivity environments.
PS: I was hooked on ...SQLite
Bravo!
Do you cache data on the client and write buffer it on the way back to the server? I made a key value store as a hobby project, that also kind of streamed the data and it was convenient to do that .
Out of curiosity, would CouchDB not be a contender for this type of application?
very cool! have you tried Tanstack DB and ElectricSQL?
performance degradation observed using the first approach at high concurrency recently discussed here: https://news.ycombinator.com/item?id=44490510
Pretty cool! SQLite fits so well for local-first apps that feels like cheating.
Out of curiosity, apart from the lack of LISTEN support what more did you miss in SQLite itself and its ecosystem?
very cool stuff!
[dead]
Ok is this about a reactive app with with a local database automatically synced to a remote db? All fully encrypted (at rest and in transit)? I thought this is what everyone does nowadays! We built an app like this in 2019 - yes - it was a bit of a challenge with the encryption but the „syncing data“-part is what every litte multiplayer-game has to deal with like forever now. Seems i‘m out of touch with the current state of affairs. Nice article though!
This is a really great write up!
I work at Electric and started the PGlite and now Tanstack DB projects. The issues mentioned with PGlite are one of the major motivating factors behind Tanstack DB. We are taking those learnings and building, what we believe, is the missing client side datastore that is "sync native" and completely backend agnostic. Also being JS, rather than WASM, solves many of the slower than ideal query semantics, and has enabled us to build an incremental query engine for it.
It's also important to note that Electric doesn't require PGlite on the client, far from it - it's essentially a "protocol first" sync engine, you can use it to write into and maintain any client side store.
This solution by the OP, diffing based of modified data is ideal for a huge number of apps, and something that we intend to built into Tanstack DB so you can easily sync with no additional infrastructure.
SQLite (or PGlite) in the browser is awesome, and has the advantage over Tanstack DB at the moment of having persistence (it's on our roadmap), but they are also somewhat chunky downloads. For many local-first apps that's not a problem though.