[Discussion] AnnotationTable.TokenizedAnnotationTable #39

HLWeil · 2023-10-20T14:43:37Z

I think we should reconsider the current design of this type as it's kind of an awkward state:

Currently it is split into a list of IO columns and a list of Term Columns. This has two-fold problems according to the current proposed state of the ARC specification 1.2:

What about non-term and non-IO columns like Protocol REF?
There MUST be at most 1 Input and 1 Output Column, so a list seems counterintuitive.

Alternatively to trying to design this in some specific way, we could also keep it more naive and just have a list of columns (including terms, IOs and whatever)?

#25

The text was updated successfully, but these errors were encountered:

kMutagene · 2023-10-24T19:01:43Z

i am thinking of a complete rework of the parsing. I think we should use ARCtrl's composite column model.

kMutagene · 2023-10-24T19:03:27Z

iirc ARCtrl parses annotation tables like this:

pattern match and assign grouping
everything not assignes is Freetext

if that is true, then it should be easy to use for tokenization as well, by filling these composite columns with CvParams in an additional step.

Sounds good? @HLWeil

HLWeil · 2023-10-25T07:30:21Z

Yup that's pretty much it.

It sounds fine with me, provided that it doesn't fail in some specific cases which should be checked. But as a starting point for getting your tokens for further use it should be good!

kMutagene · 2024-02-14T08:50:55Z

Closing this as we use ARCtr's ARCTable parser now, which we then tokenize. See #48

kMutagene closed this as completed Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Discussion] AnnotationTable.TokenizedAnnotationTable #39

[Discussion] AnnotationTable.TokenizedAnnotationTable #39

HLWeil commented Oct 20, 2023

kMutagene commented Oct 24, 2023

kMutagene commented Oct 24, 2023 •

edited

Loading

HLWeil commented Oct 25, 2023

kMutagene commented Feb 14, 2024

[Discussion] AnnotationTable.TokenizedAnnotationTable #39

[Discussion] AnnotationTable.TokenizedAnnotationTable #39

Comments

HLWeil commented Oct 20, 2023

kMutagene commented Oct 24, 2023

kMutagene commented Oct 24, 2023 • edited Loading

HLWeil commented Oct 25, 2023

kMutagene commented Feb 14, 2024

kMutagene commented Oct 24, 2023 •

edited

Loading