Cute Gopher Mascot

The Best Go Libraries For Natural Language Processing - Tokenizers (9)

Discover the best Go libraries for Natural Language Processing in Tokenizers! Find the perfect tools to streamline your development and boost productivity. From MMSEGO to shamoji, we've got you covered. Let the coding begin!

MMSEGO

This is a GO implementation of [MMSEG](http://technology.chtsai.org/mmseg/) which a Chinese word splitting algorithm

Discover More! 🚀

shamoji

The shamoji is word filtering package written in Go

Discover More! 🚀

stemmer

Stemmer packages for Go programming language. Includes English and German stemmers

Discover More! 🚀

textcat

Go package for n-gram based text categorization, with support for utf-8 and raw text

Discover More! 🚀

gse

Go efficient text segmentation; support english, chinese, japanese and other

Discover More! 🚀

segment

Go library for performing Unicode Text Segmentation as described in [Unicode Standard Annex #29](https://www.unicode.org/reports/tr29/)

Discover More! 🚀

sentences

Sentence tokenizer: converts text into a list of sentences

Discover More! 🚀

gojieba

This is a Go implementation of [jieba](https://github.com/fxsjy/jieba) which a Chinese word splitting algorithm

Discover More! 🚀

gotokenizer

A tokenizer based on the dictionary and Bigram language models for Golang. (Now only support chinese segmentation)

Discover More! 🚀