From c7c5502c9d48596a19ce95a3a307eff1d4ad3d23 Mon Sep 17 00:00:00 2001 From: Kevin Hahn Date: Mon, 10 Jun 2024 09:34:46 +0700 Subject: [PATCH 1/4] Update README.md --- README.md | 132 +++++++++++++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 131 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index e7a20e6..e9d27dd 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,133 @@ # harmony -A CRDT library for C# \ No newline at end of file +A CRDT application library for C#, use it to build offline first applications. + +## Install + +```sh +dotnet add package SIL.Harmony +``` + +It's expected that you use Harmony with the .Net IoC container ([IoC intro](https://learn.microsoft.com/en-us/dotnet/core/extensions/dependency-injection)) and with EF Core. If you're not familier with that you can take a look at the [Host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) docs. If you're using ASP.NET Core you already have this setup for you. + +Prerequsits: +* Setup EF Core in your application ([Getting started docs](https://learn.microsoft.com/en-us/ef/core/get-started/overview/first-app?tabs=netcore-cli)) +* Setup a Host, the default host setup for ASP.NET Core will work, or a [generic host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) for desktop apps, depending on your app. Alterativly you could create a [`ServiceCollection`](https://learn.microsoft.com/en-us/dotnet/api/microsoft.extensions.dependencyinjection.servicecollection?view=net-8.0). + +### Configure Db context +EF Core needs to be told about the entities used by Harmony, for now these are just [`Commit`](src/Crdt/Commit.cs), [`Snapshot`](src/Crdt/Db/ObjectSnapshot.cs), and [`ChangeEntitiy`](Crdt.Core/ChangeEntity.cs) +```C# +public class AppDbContext: DbContext { + protected override void OnModelCreating(ModelBuilder modelBuilder) + { + modelBuilder.UseCrdt(crdtConfig.Value); + } +} +``` + + +> [!TIP] +> [`SampleDbContext`](src/Crdt.Sample/SampleDbContext.cs) has a full example of how to setup the DbContext. + + +### Register CRDT services +Harmony provides the [`DataModel`](src/Crdt/DataModel.cs) class as the main way the application will interact with the CRDT model. You first need to register it with the IoC container. +```C# +var builder = Host.CreateApplicationBuilder(args); +builder.Service.AddCrdtData(config => {}); +``` +> [!NOTE] +> the config callback passed into `AddCrdtData` is currently empty, we'll come back to that later. + +> [!TIP] +> Pay attention to the generic type when calling `AddCrdtData`, this will be the type of your applications DB Context. + +### Define CRDT objects +now that you have the services setup, you need to define a CRDT object. Take a look the following examples +* [`Word`](src/Crdt.Sample/Models/Word.cs) notice this contains a reference to an Antonym Word. +* [`Definition`](src/Crdt.Sample/Models/Definition.cs) references the Word it belongs to, notice that if the Word Reference is removed, the Definition deletes itself. +* [`Example`](src/Crdt.Sample/Models/Example.cs) this one is special because it uses a YDoc to store the example text in a [Yjs](https://github.com/yjs/yjs) compatible format. This allows the example sentence to be edited by multiple users and have those changes merged using the yjs CRDT algorithm. + +Once you have created your CRDT object, you need to tell Harmony about it. Update the config callback passed into `AddCrdtData` +```C# +services.AddCrdtData(config => +{ +// add the following lines + config.ObjectTypeListBuilder + .Add() + .Add() + .Add(); +}); +``` + +### Define CRDT Changes +Now that we've defined our objects, we need to define our changes which will record user intent when making changes to objects. How detailed and specific you make your changes will directly impact how changes get merged between clients and how often users 'lose' changes that they made. + +Example [`SetWordTextChange`](src/Crdt.Sample/Changes/SetWordTextChange.cs) +```C# +public class SetWordTextChange(Guid entityId, string text) : Change(entityId), ISelfNamedType +{ + public string Text { get; } = text; + + public override ValueTask NewEntity(Commit commit, ChangeContext context) + { + return new(new Word() + { + Id = EntityId, + Text = Text + }); + } + + + public override ValueTask ApplyChange(Word entity, ChangeContext context) + { + entity.Text = Text; + return ValueTask.CompletedTask; + } +} +``` +This is a fairly simple change, it can either create a new Word entry, or if the `entityId` passed in matches an object that has previously been created, then it will just set the `Text` field on the Word entry matching the Id. + +> [!NOTE] +> Changes will be serilized and stored forever. Try to keep the amount of data stored as small as possible. +> +> This change can either create, or update an object. Most changes will probably be either an update, or a create. In those cases you should inherit from `EditChange` or `CreateChange`. + +> [!TIP] +> [Sample changes](src/Crdt.Sample/Changes) contain a number of reference changes which are good examples for a couple different change types. There are also a built in [`DeleteChange`](https://github.com/hahn-kev/harmony/blob/external-db-context/src/Crdt/Changes/DeleteChange.cs) + +Once you have created your change types, you need to tell Harmony about them. Again update the config callback passed into `AddCrdtData` +```C# +services.AddCrdtData(config => +{ +// add the following line + config.ChangeTypeListBuilder.Add(); + config.ObjectTypeListBuilder + .Add() + .Add() + .Add(); +}); +``` + +### Use changes to author changes to CRDT objects + +either via DI, or directly from the IoC container get an instance of [`DataModel`](src/Crdt/DataModel.cs) and call `AddChange` +```C# +Guid clientId = ... get a stable Guid representing the application instance +Guid objectId = Guid.NewGuid(); +await dataModel.AddChange( + clientId, + new SetWordTextChange(objectId, "Hello World") +); +var word = await dataModel.GetLatest(objectId); +Console.WriteLine(word.Text); +``` +> [!IMPORTANT] +> the `ClientId` should be consistent for a project per computer/device. It is used to determine what changes should be synced between clients with the assumption each client produces changes sequentially. So if the app is on 2 different computers both with the same project, each one should have a unique client Id, if they had the same Id, then they would not sync changes properly. +> +> How the `ClientId` is stored is left up to the application, in FW Lite we created a table to store the ClientId, it's generated automatically when the project is downloaded or created the first time and it should never change after that. +> +> In case of a WebApp there could be one ClientId to just represent the server, however if users can author apps offline and sync them later, then each browser should get it's own ClientId + +> [!WARNING] +> If you were to regenerate the `ClientId` for each change or on application start, then you will eventually have poor sync performance as the sync process checks if there's new changes to sync per `ClientId` From e9d8c7ca5be5b96034b6ecd6900f25c9c30296a2 Mon Sep 17 00:00:00 2001 From: Kevin Hahn Date: Tue, 11 Jun 2024 06:11:40 +0700 Subject: [PATCH 2/4] Update README.md --- README.md | 48 ++++++++++++++++++++++++++++++++++++++++++++++-- 1 file changed, 46 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index e9d27dd..383493b 100644 --- a/README.md +++ b/README.md @@ -10,7 +10,7 @@ dotnet add package SIL.Harmony It's expected that you use Harmony with the .Net IoC container ([IoC intro](https://learn.microsoft.com/en-us/dotnet/core/extensions/dependency-injection)) and with EF Core. If you're not familier with that you can take a look at the [Host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) docs. If you're using ASP.NET Core you already have this setup for you. -Prerequsits: +#### Prerequsits: * Setup EF Core in your application ([Getting started docs](https://learn.microsoft.com/en-us/ef/core/get-started/overview/first-app?tabs=netcore-cli)) * Setup a Host, the default host setup for ASP.NET Core will work, or a [generic host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) for desktop apps, depending on your app. Alterativly you could create a [`ServiceCollection`](https://learn.microsoft.com/en-us/dotnet/api/microsoft.extensions.dependencyinjection.servicecollection?view=net-8.0). @@ -94,7 +94,7 @@ This is a fairly simple change, it can either create a new Word entry, or if the > This change can either create, or update an object. Most changes will probably be either an update, or a create. In those cases you should inherit from `EditChange` or `CreateChange`. > [!TIP] -> [Sample changes](src/Crdt.Sample/Changes) contain a number of reference changes which are good examples for a couple different change types. There are also a built in [`DeleteChange`](https://github.com/hahn-kev/harmony/blob/external-db-context/src/Crdt/Changes/DeleteChange.cs) +> The [Sample](src/Crdt.Sample/Changes) project contain a number of reference changes which are good examples for a couple different change types. There are also a built in [`DeleteChange`](https://github.com/hahn-kev/harmony/blob/external-db-context/src/Crdt/Changes/DeleteChange.cs) Once you have created your change types, you need to tell Harmony about them. Again update the config callback passed into `AddCrdtData` ```C# @@ -131,3 +131,47 @@ Console.WriteLine(word.Text); > [!WARNING] > If you were to regenerate the `ClientId` for each change or on application start, then you will eventually have poor sync performance as the sync process checks if there's new changes to sync per `ClientId` + +## Usage + +### Queries +`DataModel` is the primary class for both making changes and getting data. We just showed an example of making changes, so we'll start with querying data. + +Query Word objects starting with the letter "A" +```C# +DataModel dataModel; //get from IoC, probably via DI +var wordsStartingWithA = await dataModel.GetLatestObjects() + .Where(w => w.Text.StartsWith("a")) + .ToArrayAsync(); +``` +Harmony uses EF Core queries under the covers, you can read more about them [here](https://learn.microsoft.com/en-us/ef/core/querying/). + +### Submitting Changes +Changes are the only way to modify CRDT data. Here's another example of a change +```C# +DataModel dataModel; +Guid clientId; //get a stable Guid representing the application instance +var definitionId = Guid.NewGuid(); +Guid wordId; //get the word Id this definition is related to. +await dataModel.AddChange(clientId, new NewDefinitionChange(definitionId) + { + WordId = wordId, + Text = "Hello", + PartOfSpeech = partOfSpeech, + Order = order + }); +``` + +> [!WARNING] +> You can modify data returned by EF Core, and issue updates and inserts yourself, however that data will be lost, and will not sync properly. Do not directly modify the tables produced by Harmony otherwise you risk losing data. + +### Syncing data +Syncing is primarily done using the `DataModel` class, however the implementation of the server side is left up to you. You can find the Lexbox implemtnation [here](https://github.com/sillsdev/languageforge-lexbox/blob/develop/backend/LexBoxApi/Services/CrdtSyncRoutes.cs). The sync works by having 2 instances of the ISyncable interface. The local one is implemented by `DataModel` however the remote implementation depends on your server side, the FW Liate implementation can be found [here](https://github.com/sillsdev/languageforge-lexbox/blob/eefe404ab90593a2a36185f705babe0bdbcfd0d6/backend/LocalWebApp/CrdtHttpSyncService.cs#L63), you will need to scope the instance to the project and it will need to deal with authentication also. + +once you have a remote representation of the `ISyncable` interface you just call it like this +```C# +DataModel dataModel; +ISyncable remoteModel; +await dataModel.SyncWith(remoteModel); +``` +it's pretty simple, all the heavy lifting is done by the interface which is fairly simple to implement. From 847c011d2d5ca110e6490da2066c05b852b7a49b Mon Sep 17 00:00:00 2001 From: Tim Haasdyk Date: Wed, 12 Jun 2024 16:56:00 +0200 Subject: [PATCH 3/4] readme touchups --- README.md | 38 +++++++++++++++++++------------------- 1 file changed, 19 insertions(+), 19 deletions(-) diff --git a/README.md b/README.md index 383493b..328e0ce 100644 --- a/README.md +++ b/README.md @@ -14,7 +14,7 @@ It's expected that you use Harmony with the .Net IoC container ([IoC intro](http * Setup EF Core in your application ([Getting started docs](https://learn.microsoft.com/en-us/ef/core/get-started/overview/first-app?tabs=netcore-cli)) * Setup a Host, the default host setup for ASP.NET Core will work, or a [generic host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) for desktop apps, depending on your app. Alterativly you could create a [`ServiceCollection`](https://learn.microsoft.com/en-us/dotnet/api/microsoft.extensions.dependencyinjection.servicecollection?view=net-8.0). -### Configure Db context +### Configure DbContext EF Core needs to be told about the entities used by Harmony, for now these are just [`Commit`](src/Crdt/Commit.cs), [`Snapshot`](src/Crdt/Db/ObjectSnapshot.cs), and [`ChangeEntitiy`](Crdt.Core/ChangeEntity.cs) ```C# public class AppDbContext: DbContext { @@ -40,15 +40,15 @@ builder.Service.AddCrdtData(config => {}); > the config callback passed into `AddCrdtData` is currently empty, we'll come back to that later. > [!TIP] -> Pay attention to the generic type when calling `AddCrdtData`, this will be the type of your applications DB Context. +> Pay attention to the generic type when calling `AddCrdtData`, this will be the type of your application's DbContext. ### Define CRDT objects -now that you have the services setup, you need to define a CRDT object. Take a look the following examples -* [`Word`](src/Crdt.Sample/Models/Word.cs) notice this contains a reference to an Antonym Word. -* [`Definition`](src/Crdt.Sample/Models/Definition.cs) references the Word it belongs to, notice that if the Word Reference is removed, the Definition deletes itself. +Now that you have the services setup, you need to define a CRDT object. Take a look the following examples +* [`Word`](src/Crdt.Sample/Models/Word.cs) contains a reference to an Antonym Word. +* [`Definition`](src/Crdt.Sample/Models/Definition.cs) references the Word it belongs to. Notice that if the Word Reference is removed, the Definition deletes itself. * [`Example`](src/Crdt.Sample/Models/Example.cs) this one is special because it uses a YDoc to store the example text in a [Yjs](https://github.com/yjs/yjs) compatible format. This allows the example sentence to be edited by multiple users and have those changes merged using the yjs CRDT algorithm. -Once you have created your CRDT object, you need to tell Harmony about it. Update the config callback passed into `AddCrdtData` +Once you have created your CRDT objects, you need to tell Harmony about them. Update the config callback passed into `AddCrdtData` ```C# services.AddCrdtData(config => { @@ -61,7 +61,7 @@ services.AddCrdtData(config => ``` ### Define CRDT Changes -Now that we've defined our objects, we need to define our changes which will record user intent when making changes to objects. How detailed and specific you make your changes will directly impact how changes get merged between clients and how often users 'lose' changes that they made. +Now that you've defined your objects, you need to define your changes. These record user intent when making changes to objects. How detailed and specific you make your changes will directly impact how changes get merged between clients and how often users 'lose' changes that they made. Example [`SetWordTextChange`](src/Crdt.Sample/Changes/SetWordTextChange.cs) ```C# @@ -89,7 +89,7 @@ public class SetWordTextChange(Guid entityId, string text) : Change(entity This is a fairly simple change, it can either create a new Word entry, or if the `entityId` passed in matches an object that has previously been created, then it will just set the `Text` field on the Word entry matching the Id. > [!NOTE] -> Changes will be serilized and stored forever. Try to keep the amount of data stored as small as possible. +> Changes will be serialized and stored forever. Try to keep the amount of data stored as small as possible. > > This change can either create, or update an object. Most changes will probably be either an update, or a create. In those cases you should inherit from `EditChange` or `CreateChange`. @@ -109,9 +109,9 @@ services.AddCrdtData(config => }); ``` -### Use changes to author changes to CRDT objects +### Use change objects to author changes to CRDT objects -either via DI, or directly from the IoC container get an instance of [`DataModel`](src/Crdt/DataModel.cs) and call `AddChange` +Either via DI, or directly from the IoC container get an instance of [`DataModel`](src/Crdt/DataModel.cs) and call `AddChange` ```C# Guid clientId = ... get a stable Guid representing the application instance Guid objectId = Guid.NewGuid(); @@ -123,19 +123,19 @@ var word = await dataModel.GetLatest(objectId); Console.WriteLine(word.Text); ``` > [!IMPORTANT] -> the `ClientId` should be consistent for a project per computer/device. It is used to determine what changes should be synced between clients with the assumption each client produces changes sequentially. So if the app is on 2 different computers both with the same project, each one should have a unique client Id, if they had the same Id, then they would not sync changes properly. +> The `ClientId` should be consistent for a project per computer/device. It is used to determine what changes should be synced between clients with the assumption that each client produces changes sequentially. So if a project is on 2 different computers, each copy should have a unique client Id. If they had the same Id, then they would not sync changes properly. > -> How the `ClientId` is stored is left up to the application, in FW Lite we created a table to store the ClientId, it's generated automatically when the project is downloaded or created the first time and it should never change after that. +> How the `ClientId` is stored is left up to the application. In FW Lite we created a table to store the ClientId. It's generated automatically when the project is downloaded or created the first time and it should never change after that. > -> In case of a WebApp there could be one ClientId to just represent the server, however if users can author apps offline and sync them later, then each browser should get it's own ClientId +> In case of an online web app there could be one ClientId to represent the server. However, if users can author changes offline and sync them later, then each browser would need it's own ClientId. > [!WARNING] -> If you were to regenerate the `ClientId` for each change or on application start, then you will eventually have poor sync performance as the sync process checks if there's new changes to sync per `ClientId` +> If you were to regenerate the `ClientId` for each change or on application start, that would eventually result in poor sync performance, as the sync process checks for new changes to sync per `ClientId`. ## Usage ### Queries -`DataModel` is the primary class for both making changes and getting data. We just showed an example of making changes, so we'll start with querying data. +`DataModel` is the primary class for both making changes and getting data. Above you saw an example of making changes, now we'll start querying data. Query Word objects starting with the letter "A" ```C# @@ -163,15 +163,15 @@ await dataModel.AddChange(clientId, new NewDefinitionChange(definitionId) ``` > [!WARNING] -> You can modify data returned by EF Core, and issue updates and inserts yourself, however that data will be lost, and will not sync properly. Do not directly modify the tables produced by Harmony otherwise you risk losing data. +> You can modify data returned by EF Core, and issue updates and inserts yourself, but that data will be lost, and will not sync properly. Do not directly modify the tables produced by Harmony otherwise you risk losing data. ### Syncing data -Syncing is primarily done using the `DataModel` class, however the implementation of the server side is left up to you. You can find the Lexbox implemtnation [here](https://github.com/sillsdev/languageforge-lexbox/blob/develop/backend/LexBoxApi/Services/CrdtSyncRoutes.cs). The sync works by having 2 instances of the ISyncable interface. The local one is implemented by `DataModel` however the remote implementation depends on your server side, the FW Liate implementation can be found [here](https://github.com/sillsdev/languageforge-lexbox/blob/eefe404ab90593a2a36185f705babe0bdbcfd0d6/backend/LocalWebApp/CrdtHttpSyncService.cs#L63), you will need to scope the instance to the project and it will need to deal with authentication also. +Syncing is primarily done using the `DataModel` class, however the implementation of the server side is left up to you. You can find the Lexbox implemtnation [here](https://github.com/sillsdev/languageforge-lexbox/blob/develop/backend/LexBoxApi/Services/CrdtSyncRoutes.cs). The sync works by having 2 instances of the ISyncable interface. The local one is implemented by `DataModel` and the remote implementation depends on your server side. The FW Lite implementation can be found [here](https://github.com/sillsdev/languageforge-lexbox/blob/eefe404ab90593a2a36185f705babe0bdbcfd0d6/backend/LocalWebApp/CrdtHttpSyncService.cs#L63). You will need to scope the instance to the project as well as deal with authentication. -once you have a remote representation of the `ISyncable` interface you just call it like this +Once you have a remote representation of the `ISyncable` interface you just call it like this ```C# DataModel dataModel; ISyncable remoteModel; await dataModel.SyncWith(remoteModel); ``` -it's pretty simple, all the heavy lifting is done by the interface which is fairly simple to implement. +It's that easy. All the heavy lifting is done by the interface which is fairly simple to implement. From 46a85fa015e0699029030f017f31dc1a6d5432e9 Mon Sep 17 00:00:00 2001 From: Kevin Hahn Date: Wed, 19 Jun 2024 11:21:37 -0500 Subject: [PATCH 4/4] fix spelling --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 328e0ce..2b0590d 100644 --- a/README.md +++ b/README.md @@ -10,9 +10,9 @@ dotnet add package SIL.Harmony It's expected that you use Harmony with the .Net IoC container ([IoC intro](https://learn.microsoft.com/en-us/dotnet/core/extensions/dependency-injection)) and with EF Core. If you're not familier with that you can take a look at the [Host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) docs. If you're using ASP.NET Core you already have this setup for you. -#### Prerequsits: +#### Prerequisites: * Setup EF Core in your application ([Getting started docs](https://learn.microsoft.com/en-us/ef/core/get-started/overview/first-app?tabs=netcore-cli)) -* Setup a Host, the default host setup for ASP.NET Core will work, or a [generic host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) for desktop apps, depending on your app. Alterativly you could create a [`ServiceCollection`](https://learn.microsoft.com/en-us/dotnet/api/microsoft.extensions.dependencyinjection.servicecollection?view=net-8.0). +* Setup a Host, the default host setup for ASP.NET Core will work, or a [generic host](https://learn.microsoft.com/en-us/dotnet/core/extensions/generic-host?tabs=hostbuilder) for desktop apps, depending on your app. Alternatively you could create a [`ServiceCollection`](https://learn.microsoft.com/en-us/dotnet/api/microsoft.extensions.dependencyinjection.servicecollection?view=net-8.0). ### Configure DbContext EF Core needs to be told about the entities used by Harmony, for now these are just [`Commit`](src/Crdt/Commit.cs), [`Snapshot`](src/Crdt/Db/ObjectSnapshot.cs), and [`ChangeEntitiy`](Crdt.Core/ChangeEntity.cs) @@ -166,7 +166,7 @@ await dataModel.AddChange(clientId, new NewDefinitionChange(definitionId) > You can modify data returned by EF Core, and issue updates and inserts yourself, but that data will be lost, and will not sync properly. Do not directly modify the tables produced by Harmony otherwise you risk losing data. ### Syncing data -Syncing is primarily done using the `DataModel` class, however the implementation of the server side is left up to you. You can find the Lexbox implemtnation [here](https://github.com/sillsdev/languageforge-lexbox/blob/develop/backend/LexBoxApi/Services/CrdtSyncRoutes.cs). The sync works by having 2 instances of the ISyncable interface. The local one is implemented by `DataModel` and the remote implementation depends on your server side. The FW Lite implementation can be found [here](https://github.com/sillsdev/languageforge-lexbox/blob/eefe404ab90593a2a36185f705babe0bdbcfd0d6/backend/LocalWebApp/CrdtHttpSyncService.cs#L63). You will need to scope the instance to the project as well as deal with authentication. +Syncing is primarily done using the `DataModel` class, however the implementation of the server side is left up to you. You can find the Lexbox implementation [here](https://github.com/sillsdev/languageforge-lexbox/blob/develop/backend/LexBoxApi/Services/CrdtSyncRoutes.cs). The sync works by having 2 instances of the ISyncable interface. The local one is implemented by `DataModel` and the remote implementation depends on your server side. The FW Lite implementation can be found [here](https://github.com/sillsdev/languageforge-lexbox/blob/eefe404ab90593a2a36185f705babe0bdbcfd0d6/backend/LocalWebApp/CrdtHttpSyncService.cs#L63). You will need to scope the instance to the project as well as deal with authentication. Once you have a remote representation of the `ISyncable` interface you just call it like this ```C#