Wk source zips #6

wkelly17 · 2024-03-26T19:38:50Z

@PurpleGuitar , @danparisd - This is branched off dev before merge, so it looks like 5 commits, but this is really just one:
9d38750.
This pr adds a view for Mark's requested source_zips if they are actually what they wants. Replicated here with some notes:

Not sure if this what you want or not exactly Craig, and I can adjust it to add on full language name or something if Orature or other needs that pretty easy.
Some notes.

This makes the assumption that the branch in wacs is master. The archive/master.zip is a wacs convention thing.
This makes the assumption, as is true right now, that all these projects are in wacs (these zips are wacs api things).
This filters out content by domain (scripture only, as well as the meta properties of biel/primary)
I was under the impression that we only wanted to expose these for which we had an entire project successfully rendered, hence the count of unique book slugs that have been rendered must be > 26 (i.e. 27 = NT.. more than than and we assume it's probably an OT + NT)

Here is an example result:

If this isn't what you had in mind, we can certainly pivot, but this is at least what I understand that Mark/Orature was asking for.

cloudflare-workers-and-pages · 2024-04-03T21:52:55Z

Deploying languageapi with Cloudflare Pages

Latest commit:	`1bb22ab`
Status:	✅ Deploy successful!
Preview URL:	https://61fe2e14.languageapi.pages.dev
Branch Preview URL:	https://wk-source-zips.languageapi.pages.dev

View logs

wkelly17

@PurpleGuitar @danparisd - I might should have put this in a different PR than just attaching it source zips, but being that both will need reworking if we adjust architecture, then I figured its fine to just tack it on here.

wkelly17 · 2024-04-03T22:02:11Z

controller/src/db/schema/schema.ts

@PurpleGuitar @danparisd - this is an initial proposition for tracking localization of bible book names and of words like "tw" -> translation words. As seen, I figured the most logical unique index (and in this case I just made it the composite primary key) is the ietf already in the database plus a "key" for the string.
The one question I wonder about is do we want to add something like a "domain" to this table as well. I.e. the domain of bible books woudl be "bible_book" and then of "tn" would be something like resource types. There aren't currently conflicts between things like Gen and tn, and I'm not aware of any potential conflicts since the bible book slugs are set in englihs. But thoughts?
Forgot to add the lines: Those are here: https://github.com/WycliffeAssociates/languageapi/pull/6/files/afea3a4b1556f3db8b0f4ccadd3d3547a845c8eb#diff-ea2dd11638ccb90cd55880ec76d010447b0b2737763ba9563fac7455956d9096R321-R338

wkelly17 · 2024-04-03T22:06:33Z

controller/src/functions/localization.ts

+  // const query = sql.raw(`SELECT book_name, book_slug, ietf_code, id
+  // FROM (SELECT book_name, book_slug, l.ietf_code, c.id,
+  // ROW_NUMBER() OVER (PARTITION BY l.ietf_code, book_slug ORDER BY book_slug) AS rn
+  //     FROM scriptural_rendering_metadata AS srm
+  //     JOIN rendering AS r ON r.id = srm.rendering_id
+  //     JOIN content AS c ON r.content_id = c.id
+  //     JOIN language AS l ON l.ietf_code = c.language_id
+  //     JOIN git_repo AS gr ON c.git_id = gr.id
+  //     WHERE gr.username ILIKE '%wa-catalog%'
+  //     AND c.domain = 'scripture'
+  //     AND book_slug IS NOT NULL
+  // ) AS subquery
+  // WHERE rn = 1
+  // ORDER BY ietf_code, book_slug;`);


@PurpleGuitar @danparisd - I've done it with the ORM instead of raw sql, but I imaigne the commented out raw sql might be more readable. In short, this query has those hardcoded dependencies on the git repo username being wa-catalog there. We only need one localized version for each slug (e.g. Gen, Exo), hence the select from rn =1 for each partition.

The result looks like this:

wkelly17 · 2024-04-03T22:07:32Z

controller/src/functions/localization.ts

+app.timer("manageLocalizationTable", {
+  schedule: "0 0 0 * * *",
+  handler: populateLocalization,
+  useMonitor: false,
+});


We can ruin this not on a cron, but for now outside of setting up something more complicated such as doing event driven inserts based on the metadata table, this is a really straightforward way, and this day surely isn't gonna be that time critical I'd image.

wkelly17 · 2024-04-03T22:10:51Z

controller/src/localizations/en.ts

+const en = {
+  tw: "Translations Words",
+  tn: "Translation Notes",
+};
+export type keysType = keyof typeof en;
+export default {dict: en, ietf: "en"};


For now, I know that you mentioned doing this in crowdin, and we could certainly do it via api most likely. For now though, I've put these into TS files. We probably need to decide what's the scope of these to translate, and moreover, there's some junky data currently in content resource types that needs cleaning up where the resource type is clearly not somethign we would consider a resource type. Probably worth having a discussion on .

PurpleGuitar · 2024-04-04T18:30:22Z

Re: the source.zip list, I think this is a great first pass. If I read it right, it should return a JSON document containing all the source zips with some metadata. Looks good to me. 👍

…ting renderings table

Revert to only en

danparisd

LGTM

wkelly17 requested review from PurpleGuitar and danparisd March 26, 2024 19:38

wkelly17 commented Apr 3, 2024

View reviewed changes

wkelly17 and others added 10 commits April 24, 2024 12:36

add renderings table. Move gateway to walangmeta. Rename some properties

71ae97b

adjust makefile dump back to what it should be

5d5bef7

make runnable locally, add dev bus string

ff172cc

adjusted the logic for creating content row if not present while upda…

992f628

…ting renderings table

add a view for getting source zips from wacs

07e1aa3

add a localization table and cron trigger

8550ca6

Update Crowdin configuration file

d5cda67

Update Crowdin configuration file

7e92a09

Update crowdin.yml

26e40fe

Revert to only en

rebase dev onto feature branch that has crowdin

e9f6f08

wkelly17 force-pushed the wk-source_zips branch from e72296e to e9f6f08 Compare April 24, 2024 17:54

PurpleGuitar approved these changes Jul 2, 2024

View reviewed changes

wkelly17 added 6 commits July 3, 2024 10:14

code for bus messages for audio

a73b610

merge prod to this feature branch

df7efe5

fix some hasura metadata for view src zips

60fbb7d

Merge branch 'prod' into wk-source_zips

19621c6

make subscription name an env var. Adjust open api docs

dd0ae2e

change name of FileBase to FileBasePath

1bb22ab

danparisd approved these changes Aug 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wk source zips #6

Wk source zips #6

wkelly17 commented Mar 26, 2024

cloudflare-workers-and-pages bot commented Apr 3, 2024 •

edited

Loading

wkelly17 left a comment •

edited

Loading

wkelly17 Apr 3, 2024 •

edited

Loading

wkelly17 Apr 3, 2024

wkelly17 Apr 3, 2024

wkelly17 Apr 3, 2024

PurpleGuitar commented Apr 4, 2024

danparisd left a comment

Wk source zips #6

Are you sure you want to change the base?

Wk source zips #6

Conversation

wkelly17 commented Mar 26, 2024

cloudflare-workers-and-pages bot commented Apr 3, 2024 • edited Loading

Deploying languageapi with Cloudflare Pages

wkelly17 left a comment • edited Loading

Choose a reason for hiding this comment

wkelly17 Apr 3, 2024 • edited Loading

Choose a reason for hiding this comment

wkelly17 Apr 3, 2024

Choose a reason for hiding this comment

wkelly17 Apr 3, 2024

Choose a reason for hiding this comment

wkelly17 Apr 3, 2024

Choose a reason for hiding this comment

PurpleGuitar commented Apr 4, 2024

danparisd left a comment

Choose a reason for hiding this comment

cloudflare-workers-and-pages bot commented Apr 3, 2024 •

edited

Loading

wkelly17 left a comment •

edited

Loading

wkelly17 Apr 3, 2024 •

edited

Loading