Alle Optionen
buster  ] [  bullseye  ] [  bookworm  ] [  trixie  ] [  sid  ] [  experimental  ]
[ Quellcode: unidic-mecab  ]

Paket: unidic-mecab (3.1.1-1)

Links für unidic-mecab

Screenshot

Debian-Ressourcen:

Quellcode-Paket unidic-mecab herunterladen:

Betreuer:

Externe Ressourcen:

Ähnliche Pakete:

Dictionary for Mecab (Corpus of Contemporary Written Japanese)

unidic-mecab is a dictionary for Mecab (Japanese morphological analysis implementation), based on corpus of Contemporary Written Japanese (upstream publish it as unidic-cwj).

 * All entries are based on the definition of "SUW (short-unit word)" that is
   specified by NINJAL (The National Institute for Japanese Language and
   Linguistics), which provides word segmentation in uniform size suited for
   linguistic research.
 * It has three-layered structure with
    - lemma
    - form
    - spelling
   And it can provide a clear distinction of two types of word variant:
   spelling variant and form variant.
 * It is useful for research of Speech processing since it can be added
   accent and shift in sound information.

This package is huge. You need more than 10GB of free space to download and install.

Andere Pakete mit Bezug zu unidic-mecab

  • hängt ab von
  • empfiehlt
  • schlägt vor
  • erweitert

unidic-mecab herunterladen

Download für alle verfügbaren Architekturen
Architektur Paketgröße Größe (installiert) Dateien
all 1.017.114,4 kB5.057.212,0 kB [Liste der Dateien]