Skip to content

Update sum-rel.py to summarize-release.py#44

Merged
goodmami merged 2 commits intomainfrom
gh-19-summarize-release
Feb 11, 2025
Merged

Update sum-rel.py to summarize-release.py#44
goodmami merged 2 commits intomainfrom
gh-19-summarize-release

Conversation

@goodmami
Copy link
Copy Markdown
Collaborator

  • Rename sum-rel.py to summarize-release.py to be more clear
  • Use newer Wn version
  • Load from LMF file instead of adding to DB (no need for temporary DB)
  • Output MarkDown instead of HTML (targeting GitHub)

Resolves #19

Use like this:

$ python scripts/summarize-release.py build/omw-1.5 --core-ili etc/wn-core-ili.tab

Here's the output for the 1.5 release (current state):

ID:ver Lang Label License Synsets Senses Words Core
omw-arb:1.5 arb Arabic WordNet (AWN v2) CC-BY-SA 3.0 9916 37342 18003 47.5%
omw-bg:1.5 bg BulTreeBank Wordnet (BTB-WN) CC-BY-3.0 4959 8936 6737 99.2%
omw-ca:1.5 ca Multilingual Central Repository (Catalan) CC-BY-3.0 60462 100120 69301 88.7%
omw-cmn:1.5 cmn-Hans Chinese Open Wordnet wordnet 42312 79809 63347 99.7%
omw-da:1.5 da DanNet wordnet 4476 5859 4521 81.0%
omw-el:1.5 el Greek Wordnet Apache-2.0 18049 24106 18264 56.9%
omw-en15:1.5 en OMW English Wordnet based on WordNet-1.5 WordNet 91591 168215 127139 97.0%
omw-en16:1.5 en OMW English Wordnet based on WordNet-1.6 WordNet 99642 174002 130235 97.9%
omw-en17:1.5 en OMW English Wordnet based on WordNet-1.7 WordNet 109377 192548 145772 98.2%
omw-en171:1.5 en OMW English Wordnet based on WordNet-1.7.1 WordNet 111223 195817 147417 98.5%
omw-en20:1.5 en OMW English Wordnet based on WordNet-2.0 WordNet 115424 203147 153236 98.7%
omw-en21:1.5 en OMW English Wordnet based on WordNet-2.1 WordNet 117597 207018 156588 98.3%
omw-en30:1.5 en OMW English Wordnet based on WordNet-3.0 WordNet 117659 206978 156584 100.0%
omw-en31:1.5 en OMW English Wordnet based on WordNet-3.1 WordNet 117791 207272 156762 99.9%
omw-es:1.5 es Multilingual Central Repository (Spanish) CC-BY-3.0 78417 145641 93834 95.7%
omw-eu:1.5 eu Multilingual Central Repository (Basque) CC-BY-3.0 29414 48933 26388 70.5%
omw-fi:1.5 fi FinnWordNet CC-BY-3.0 116763 189227 130742 99.8%
omw-fr:1.5 fr WOLF (Wordnet Libre du Français) CeCILL-1.0 59091 102651 59616 92.4%
omw-gl:1.5 gl Multilingual Central Repository (Galician) CC-BY-3.0 34770 53121 40874 70.2%
omw-he:1.5 he Hebrew Wordnet wordnet 5448 6872 5379 27.2%
omw-hr:1.5 hr Croatian Wordnet CC-BY-3.0 23120 47900 29089 100.0%
omw-id:1.5 id Wordnet Bahasa (Indonesian) MIT 38085 106688 41478 94.0%
omw-is:1.5 is IceWordNet CC-BY-3.0 4951 16004 11655 99.2%
omw-it:1.5 it MultiWordNet (Italian) CC-BY-3.0 35001 63133 43011 83.0%
omw-iwn:1.5 it ItalWordNet ODC-BY 15563 24135 19680 47.6%
omw-ja:1.5 ja Japanese Wordnet wordnet 57184 158069 94002 94.7%
omw-lt:1.5 lt Lithuanian WordNet CC-BY-SA 3.0 9462 16032 11428 35.4%
omw-nb:1.5 nb Norwegian Wordnet (Bokmål) wordnet 4455 5586 4244 80.8%
omw-nl:1.5 nl Open Dutch WordNet CC-BY-SA 4.0 30177 60259 43667 66.8%
omw-nn:1.5 nn Norwegian Wordnet (Nynorsk) wordnet 3671 4762 3436 65.6%
omw-pl:1.5 pl plWordNet wordnet 33826 52378 45458 54.1%
omw-pt:1.5 pt OpenWN-PT CC-BY-SA 43895 74012 54932 84.1%
omw-ro:1.5 ro Romanian Wordnet CC-BY-SA 56026 84638 52600 93.5%
omw-sk:1.5 sk Slovak WordNet CC-BY-SA 3.0 18507 44029 29228 58.1%
omw-sl:1.5 sl sloWNet CC-BY-SA 3.0 42583 70945 40340 86.1%
omw-sq:1.5 sq Albanet CC-BY-3.0 4675 9599 6489 30.8%
omw-sv:1.5 sv WordNet-SALDO CC-BY-3.0 6796 6904 5872 99.2%
omw-th:1.5 th Thai Wordnet wordnet 73350 95517 83481 80.9%
omw-zsm:1.5 zsm Wordnet Bahasa (Malaysian) MIT 36911 105028 38755 96.3%

* Use newer Wn version
* Load from LMF file instead of adding to DB
* Output MarkDown instead of HTML

Resolves #19
@goodmami
Copy link
Copy Markdown
Collaborator Author

The second commit should link those omw-en lexicons to this GitHub project.

Also, the idea is that this script will get run by the GitHub CI actions and the output will be added to the release page.

@goodmami goodmami merged commit b16e981 into main Feb 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Show a summary of each release

1 participant