package chunk

prose – github.com/jdkato/prose/chunk Index | Examples | Files

package chunk

import "github.com/jdkato/prose/chunk"

Package chunk implements functions for finding useful chunks in text previously tagged from parts of speech.

Example¶

Code:

{
	txt := "Go is a open source programming language created at Google."

	words := tokenize.TextToWords(txt)
	tagger := tag.NewPerceptronTagger()

	fmt.Println(Chunk(tagger.Tag(words), TreebankNamedEntities))
	// Output: [Go Google]
}

Output:

[Go Google]

Index ¶

Variables
func Chunk(tagged []tag.Token, rx *regexp.Regexp) []string
func Locate(tagged []tag.Token, rx *regexp.Regexp) [][]int

Examples ¶

package

Variables ¶

var TreebankNamedEntities = regexp.MustCompile(
	`((CD__)*(NNP.)+(CD__|NNP.)*)+` +
		`((IN__)*(CD__)*(NNP.)+(CD__|NNP.)*)*`)

TreebankNamedEntities matches proper names, excluding prior adjectives, possibly including numbers and a linkage by preposition or subordinating conjunctions (for example "Bank of England").

Functions ¶

func Chunk ¶

func Chunk(tagged []tag.Token, rx *regexp.Regexp) []string

Chunk returns a slice containing the chunks of interest according to the regexp.

This is a convenience wrapper around Locate, which should be used if you need access the to the in-text locations of each chunk.

func Locate ¶

func Locate(tagged []tag.Token, rx *regexp.Regexp) [][]int

Locate finds the chunks of interest according to the regexp.

Source Files ¶

chunk.go

Version: v1.2.0
Published: Jun 16, 2020
Platform: darwin/amd64
Imports: 2 packages
Last checked: 7 hours ago –

Tools for package owners.

?	: This menu
/	: Search site
f	: Jump to identifier
g then g	: Go to top of page
g then b	: Go to end of page
G	: Go to end of page
g then i	: Go to index
g then e	: Go to examples

package chunk

Index ¶

Examples ¶

Variables ¶

Functions ¶

func Chunk ¶

func Locate ¶

Source Files ¶

Jump to identifier

Keyboard shortcuts