-
Notifications
You must be signed in to change notification settings - Fork 1
/
eval.log
45 lines (45 loc) · 2.56 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_Swedish-PUD:
commit 4236c3fbbd938b00ef411ea773f9219cc0548959
Merge: be2a807 205f660
Author: Dan Zeman <[email protected]>
Size: counted 19076 of 19076 words (nodes).
Size: min(0, log((N/1000)**2)) = 5.89686200087196.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Did not find more than 10000 training words.
Split: Did not find at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.5.
Universal POS tags: 16 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.1.
Features: 12708 out of 19076 total words have one or more features.
Features: source of annotation (from README) factor is 0.5.
Universal relations: 32 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 1.
Udapi:
TOTAL 1686
Udapi: found 1686 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 19076 words.
Genres: found 2 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang sv --max-err=10 UD_Swedish-PUD/sv_pud-ud-test.conllu
[Line 11977 Sent w01053067 Node 15]: [L3 Warning fixed-gap] Gaps in fixed expression [15, 19] 'för * * * sedan'
[Line 14121 Sent w01091016 Node 24]: [L3 Warning fixed-gap] Gaps in fixed expression [24, 27] 'för * * sedan'
[Line 17165 Sent w01143015 Node 1]: [L3 Warning fixed-gap] Gaps in fixed expression [1, 5] 'För * * * sedan'
[Line 20035 Sent n05002004 Node 27]: [L3 Warning fixed-gap] Gaps in fixed expression [27, 30] 'för * * sedan'
Warnings: 4
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.5) = 0.0384615384615385
(weight=0.0769230769230769) * (score{genres}=0.117647058823529) = 0.00904977375565611
(weight=0.0769230769230769) * (score{lemmas}=0.5) = 0.0384615384615385
(weight=0.256410256410256) * (score{size}=0.426829104587276) = 0.109443360150584
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974
(weight=0.0769230769230769) * (score{tags}=0.0941176470588235) = 0.00723981900452489
(weight=0.307692307692308) * (score{udapi}=0.11616691130216) = 0.0357436650160491
(weight=0.0769230769230769) * (score{udeprels}=0.864864864864865) = 0.0665280665280665
(TOTAL score=0.322363658813855) * (availability=1) * (validity=1) = 0.322363658813855
STARS = 1.5
UD_Swedish-PUD 0.322363658813855 1.5