Name | Last modified | Size | Description | |
---|---|---|---|---|
Parent Directory | - | |||
afrikaans.tokenize_c..> | 2022-04-02 09:28 | 359K | ||
afrikaans.zip | 2021-02-05 06:03 | 24M | ||
ancient-greek-perseu..> | 2022-04-02 09:28 | 3.6M | ||
ancient-greek-perseu..> | 2021-02-05 06:03 | 28M | ||
ancient-greek.tokeni..> | 2022-04-02 09:28 | 3.5M | ||
ancient-greek.zip | 2021-02-05 06:03 | 27M | ||
arabic.tokenize_cach..> | 2022-04-02 09:28 | 2.7M | ||
arabic.zip | 2021-02-05 06:03 | 37M | ||
armenian.tokenize_ca..> | 2022-04-02 09:28 | 1.2M | ||
armenian.zip | 2021-02-05 06:03 | 29M | ||
basque.tokenize_cach..> | 2022-04-02 09:28 | 1.3M | ||
basque.zip | 2021-02-05 06:03 | 26M | ||
belarusian.tokenize_..> | 2022-04-02 09:28 | 267K | ||
belarusian.zip | 2021-02-05 06:03 | 26M | ||
bulgarian.tokenize_c..> | 2022-04-02 09:28 | 2.4M | ||
bulgarian.zip | 2021-02-05 06:03 | 28M | ||
catalan.tokenize_cac..> | 2022-04-02 09:28 | 1.9M | ||
catalan.zip | 2021-02-05 06:03 | 26M | ||
chinese.tokenize_cac..> | 2022-04-02 09:28 | 606K | ||
chinese.zip | 2021-02-05 06:03 | 39M | ||
classical-chinese.to..> | 2022-04-02 09:28 | 147K | ||
classical-chinese.zip | 2021-02-05 06:03 | 37M | ||
croatian.tokenize_ca..> | 2022-04-02 09:28 | 2.1M | ||
croatian.zip | 2021-02-05 06:03 | 30M | ||
czech-cac.tokenize_c..> | 2022-04-02 09:28 | 4.9M | ||
czech-cac.zip | 2021-02-05 06:03 | 35M | ||
czech-cltt.tokenize_..> | 2022-04-02 09:28 | 313K | ||
czech-cltt.zip | 2021-02-05 06:03 | 27M | ||
czech-fictree.tokeni..> | 2022-04-02 09:28 | 1.8M | ||
czech-fictree.zip | 2021-02-05 06:03 | 34M | ||
czech.tokenize_cache..> | 2022-04-02 09:28 | 9.1M | ||
czech.zip | 2021-02-05 06:03 | 40M | ||
danish.tokenize_cach..> | 2022-04-02 09:28 | 1.1M | ||
danish.zip | 2021-02-05 06:03 | 25M | ||
dutch-lassysmall.tok..> | 2022-04-02 09:28 | 932K | ||
dutch-lassysmall.zip | 2021-02-05 06:03 | 25M | ||
dutch.tokenize_cache..> | 2022-04-02 09:28 | 1.8M | ||
dutch.zip | 2021-02-05 06:03 | 44M | ||
english-gum.tokenize..> | 2022-04-02 09:28 | 641K | ||
english-gum.zip | 2021-02-05 06:03 | 45M | ||
english-lines.tokeni..> | 2022-04-02 09:28 | 480K | ||
english-lines.zip | 2021-02-05 06:03 | 44M | ||
english-partut.token..> | 2022-04-02 09:28 | 412K | ||
english-partut.zip | 2021-02-05 06:03 | 45M | ||
english.tokenize_cac..> | 2022-04-02 09:28 | 1.2M | ||
english.zip | 2021-02-05 06:03 | 46M | ||
estonian-ewt.tokeniz..> | 2022-04-02 09:28 | 387K | ||
estonian-ewt.zip | 2021-02-05 06:03 | 26M | ||
estonian.tokenize_ca..> | 2022-04-02 09:28 | 5.4M | ||
estonian.zip | 2021-02-05 06:03 | 28M | ||
finnish-ftb.tokenize..> | 2022-04-02 09:28 | 2.9M | ||
finnish-ftb.zip | 2021-02-05 06:03 | 33M | ||
finnish.tokenize_cac..> | 2022-04-02 09:28 | 3.8M | ||
finnish.zip | 2021-02-05 06:03 | 32M | ||
french-partut.tokeni..> | 2022-04-02 09:28 | 238K | ||
french-partut.zip | 2021-02-05 06:03 | 36M | ||
french-sequoia.token..> | 2022-04-02 09:28 | 542K | ||
french-sequoia.zip | 2021-02-05 06:03 | 36M | ||
french-spoken.tokeni..> | 2022-04-02 09:28 | 150K | ||
french-spoken.zip | 2021-02-05 06:03 | 36M | ||
french.tokenize_cach..> | 2022-04-02 09:28 | 2.7M | ||
french.zip | 2021-02-05 06:03 | 38M | ||
galician-treegal.tok..> | 2022-04-02 09:28 | 252K | ||
galician-treegal.zip | 2021-02-05 06:03 | 26M | ||
galician.tokenize_ca..> | 2022-04-02 09:28 | 821K | ||
galician.zip | 2021-02-05 06:03 | 24M | ||
german-hdt.tokenize_..> | 2022-04-02 09:28 | 13M | ||
german-hdt.zip | 2021-02-05 06:03 | 49M | ||
german.tokenize_cach..> | 2022-04-02 09:28 | 3.6M | ||
german.zip | 2021-02-05 06:03 | 47M | ||
greek.tokenize_cache..> | 2022-04-02 09:28 | 922K | ||
greek.zip | 2021-02-05 06:03 | 26M | ||
hebrew.tokenize_cach..> | 2022-04-02 09:28 | 1.6M | ||
hebrew.zip | 2021-02-05 06:03 | 27M | ||
hindi.tokenize_cache..> | 2022-04-02 09:28 | 1.7M | ||
hindi.zip | 2021-02-05 06:03 | 26M | ||
hungarian.tokenize_c..> | 2022-04-02 09:28 | 566K | ||
hungarian.zip | 2021-02-05 06:03 | 29M | ||
indonesian.tokenize_..> | 2022-04-02 09:28 | 1.1M | ||
indonesian.zip | 2021-02-05 06:03 | 25M | ||
irish.tokenize_cache..> | 2022-04-02 09:28 | 364K | ||
irish.zip | 2021-02-05 06:03 | 26M | ||
italian-partut.token..> | 2022-04-02 09:28 | 515K | ||
italian-partut.zip | 2021-02-05 06:03 | 27M | ||
italian-postwita.tok..> | 2022-04-02 09:28 | 1.2M | ||
italian-postwita.zip | 2021-02-05 06:03 | 28M | ||
italian-twittiro.tok..> | 2022-04-02 09:28 | 362K | ||
italian-twittiro.zip | 2021-02-05 06:03 | 28M | ||
italian-vit.tokenize..> | 2022-04-02 09:28 | 1.4M | ||
italian-vit.zip | 2021-02-05 06:03 | 27M | ||
italian.tokenize_cac..> | 2022-04-02 09:28 | 1.8M | ||
italian.zip | 2021-02-05 06:03 | 27M | ||
japanese.tokenize_ca..> | 2022-04-02 09:28 | 885K | ||
japanese.zip | 2021-02-05 06:03 | 26M | ||
kazakh.tokenize_cach..> | 2022-04-02 09:28 | 28K | ||
kazakh.zip | 2021-02-05 06:03 | 24M | ||
korean-kaist.tokeniz..> | 2022-04-02 09:28 | 7.1M | ||
korean-kaist.zip | 2021-02-05 06:03 | 34M | ||
korean.tokenize_cach..> | 2022-04-02 09:28 | 2.2M | ||
korean.zip | 2021-02-05 06:03 | 29M | ||
kurmanji.tokenize_ca..> | 2022-04-02 09:28 | 10K | ||
kurmanji.zip | 2021-02-05 06:03 | 23M | ||
latin-perseus.tokeni..> | 2022-04-02 09:28 | 437K | ||
latin-perseus.zip | 2021-02-05 06:03 | 25M | ||
latin-proiel.tokeniz..> | 2022-04-02 09:28 | 1.8M | ||
latin-proiel.zip | 2021-02-05 06:03 | 27M | ||
latin.tokenize_cache..> | 2022-04-02 09:28 | 1.0M | ||
latin.zip | 2021-02-05 06:03 | 33M | ||
latvian.tokenize_cac..> | 2022-04-02 09:28 | 2.7M | ||
latvian.zip | 2021-02-05 06:03 | 33M | ||
lithuanian-hse.token..> | 2022-04-02 09:28 | 101K | ||
lithuanian-hse.zip | 2021-02-05 06:03 | 25M | ||
lithuanian.tokenize_..> | 2022-04-02 09:28 | 1.0M | ||
lithuanian.zip | 2021-02-05 06:03 | 29M | ||
marathi.tokenize_cac..> | 2022-04-02 09:28 | 85K | ||
marathi.zip | 2021-02-05 06:03 | 26M | ||
norwegian-bokmaal.to..> | 2022-04-02 09:28 | 2.1M | ||
norwegian-bokmaal.zip | 2021-02-05 06:03 | 26M | ||
norwegian-nynorsk.to..> | 2022-04-02 09:28 | 2.0M | ||
norwegian-nynorsk.zip | 2021-02-05 06:03 | 26M | ||
norwegian-nynorsklia..> | 2022-04-02 09:28 | 183K | ||
norwegian-nynorsklia..> | 2021-02-05 06:03 | 25M | ||
old-french.tokenize_..> | 2022-04-02 09:28 | 1.0M | ||
old-french.zip | 2021-02-05 06:03 | 24M | ||
old-russian.tokenize..> | 2022-04-02 09:28 | 2.6M | ||
old-russian.zip | 2021-02-05 06:03 | 27M | ||
persian.tokenize_cac..> | 2022-04-02 09:28 | 1.1M | ||
persian.zip | 2021-02-05 06:03 | 26M | ||
polish-lfg.tokenize_..> | 2022-04-02 09:28 | 2.0M | ||
polish-lfg.zip | 2021-02-05 06:03 | 30M | ||
polish.tokenize_cach..> | 2022-04-02 09:28 | 4.1M | ||
polish.zip | 2021-02-05 06:03 | 37M | ||
portuguese-gsd.token..> | 2022-04-02 09:28 | 1.8M | ||
portuguese-gsd.zip | 2021-02-05 06:03 | 26M | ||
portuguese.tokenize_..> | 2022-04-02 09:28 | 1.7M | ||
portuguese.zip | 2021-02-05 06:03 | 27M | ||
romanian-nonstandard..> | 2022-04-02 09:29 | 1.4M | ||
romanian-nonstandard..> | 2021-02-05 06:03 | 29M | ||
romanian.tokenize_ca..> | 2022-04-02 09:29 | 2.1M | ||
romanian.zip | 2021-02-05 06:03 | 29M | ||
russian-gsd.tokenize..> | 2022-04-02 09:29 | 2.5M | ||
russian-gsd.zip | 2021-02-05 06:03 | 37M | ||
russian-taiga.tokeni..> | 2022-04-02 09:29 | 681K | ||
russian-taiga.zip | 2021-02-05 06:03 | 36M | ||
russian.tokenize_cac..> | 2022-04-02 09:29 | 11M | ||
russian.zip | 2021-02-05 06:03 | 38M | ||
scottish-gaelic.toke..> | 2022-04-02 09:29 | 269K | ||
scottish-gaelic.zip | 2021-02-05 06:03 | 26M | ||
serbian.tokenize_cac..> | 2022-04-02 09:29 | 1.1M | ||
serbian.zip | 2021-02-05 06:03 | 28M | ||
slovak.tokenize_cach..> | 2022-04-02 09:29 | 1.4M | ||
slovak.zip | 2021-02-05 06:03 | 31M | ||
slovenian-sst.tokeni..> | 2022-04-02 09:29 | 264K | ||
slovenian-sst.zip | 2021-02-05 06:03 | 29M | ||
slovenian.tokenize_c..> | 2022-04-02 09:29 | 1.9M | ||
slovenian.zip | 2021-02-05 06:03 | 30M | ||
spanish-gsd.tokenize..> | 2022-04-02 09:29 | 3.0M | ||
spanish-gsd.zip | 2021-02-05 06:03 | 36M | ||
spanish.tokenize_cac..> | 2022-04-02 09:29 | 2.4M | ||
spanish.zip | 2021-02-05 06:03 | 36M | ||
swedish-lines.tokeni..> | 2022-04-02 09:29 | 719K | ||
swedish-lines.zip | 2021-02-05 06:03 | 27M | ||
swedish.tokenize_cac..> | 2022-04-02 09:29 | 926K | ||
swedish.zip | 2021-02-05 06:03 | 26M | ||
tamil.tokenize_cache..> | 2022-04-02 09:29 | 363K | ||
tamil.zip | 2021-02-05 06:03 | 25M | ||
telugu.tokenize_cach..> | 2022-04-02 09:29 | 207K | ||
telugu.zip | 2021-02-05 06:03 | 25M | ||
traditional-chinese...> | 2022-04-02 09:29 | 614K | ||
traditional-chinese.zip | 2021-02-05 06:03 | 39M | ||
turkish.tokenize_cac..> | 2022-04-02 09:29 | 963K | ||
turkish.zip | 2021-02-05 06:03 | 27M | ||
ukrainian.tokenize_c..> | 2022-04-02 09:29 | 2.6M | ||
ukrainian.zip | 2021-02-05 06:03 | 34M | ||
urdu.tokenize_cache...> | 2022-04-02 09:29 | 696K | ||
urdu.zip | 2021-02-05 06:03 | 25M | ||
uyghur.tokenize_cach..> | 2022-04-02 09:29 | 748K | ||
uyghur.zip | 2021-02-05 06:03 | 26M | ||
vietnamese-vtb.token..> | 2022-04-02 09:29 | 2.9M | ||
vietnamese-vtb.zip | 2021-02-05 06:03 | 22M | ||
vietnamese.tokenize_..> | 2022-04-02 09:29 | 2.9M | ||
vietnamese.zip | 2021-02-05 06:03 | 31M | ||