sentences.v1gopkg.in/neurosnap/sentences.v1/english Index | Files

package english

import "gopkg.in/neurosnap/sentences.v1/english"

Index

Functions

func NewSentenceTokenizer

func NewSentenceTokenizer(s *sentences.Storage) (*sentences.DefaultSentenceTokenizer, error)

English customized sentence tokenizer.

Types

type MultiPunctWordAnnotation

type MultiPunctWordAnnotation struct {
	*sentences.Storage
	sentences.TokenParser
	sentences.TokenGrouper
	sentences.Ortho
}

Attempts to tease out custom Abbreviations, e.g. F.B.I.

func (*MultiPunctWordAnnotation) Annotate

func (a *MultiPunctWordAnnotation) Annotate(tokens []*sentences.Token) []*sentences.Token

type WordTokenizer

type WordTokenizer struct {
	sentences.DefaultWordTokenizer
}

func NewWordTokenizer

func NewWordTokenizer(p sentences.PunctStrings) *WordTokenizer

func (*WordTokenizer) HasSentEndChars

func (e *WordTokenizer) HasSentEndChars(t *sentences.Token) bool

Find any punctuation excluding the period final

Source Files

main.go

Version
v1.0.7 (latest)
Published
May 26, 2021
Platform
darwin/amd64
Imports
4 packages
Last checked
3 hours ago

Tools for package owners.