package mining

import "git.sr.ht/~shulhan/pakakeh.go/lib/mining"

Package mining provides library for data mining and statistic related to machine learning.

Classifiers,

Resampling,

Miscellaneous,

Index

Variables

var BadWords = []string{
	"666", "da", "dont", "dosent", "whatever", "guy", "hi", "nazi", "sup",
	"guise", "loser", "thats", "ugly", "wanna", "whats", "wont", "gotta",
	"bloody", "fart", "pot", "prick", "stink", "smells", "smelly", "alot",
	"dunno", "gotcha",
}

BadWords contain list of colloquial words or bad writing words.

var BiasedWords = []string{
	"cutting-edge", "single-handedly", "well-established", "well-known",
	"world-class", "absolute", "acclaimed", "amazing", "astonishing",
	"authoritative", "beautiful", "best", "boreing", "boring",
	"brilliant", "canonical", "cares", "celebrated", "charismatic",
	"classic", "coolest", "defining", "definitive", "eminent", "enigma",
	"ever", "everyone", "exciting", "extraordinary", "fabulous", "famous",
	"fantastic", "fat", "fully", "genius", "global", "great", "greatest",
	"hate", "huge", "iconic", "idiotic", "immensely", "impactful",
	"incendiary", "indisputable", "infamous", "influential", "innovative",
	"inspired", "intriguing", "lame", "leader", "leading", "legendary",
	"like", "major", "masterly", "mature", "memorable", "most", "notable",
	"outstanding", "pioneer", "popular", "prestigious", "probably",
	"really", "remarkable", "renowned", "respected", "seminal",
	"significant", "skillful", "solution", "staunch", "strange", "super",
	"talented", "top", "total", "totally", "transcendent", "ugly",
	"undoubtedly", "unique", "virtually", "virtuoso", "visionary",
	"weird", "worst",
}

BiasedWords contain list of colloquial words with high bias.

var PronounWords = []string{
	"i", "me", "mine", "my", "myself", "our", "ours", "ourself",
	"ourselves", "selves", "thee", "thine", "thou", "thy", "thyself",
	"us", "we", "y'all", "y'all", "y'all's", "yis", "you", "you-uns",
	"your", "yours", "yourself", "yourselves", "yourselves", "yous",
	"yous's", "youse", "youse",
}

PronounWords contains list of first and second person pronouns including slangs.

var SexWords = []string{
	"anal", "breast", "breasts", "buttocks", "dildo", "dildos", "erect",
	"nipple", "nipples", "penis", "sex", "sodomized", "sodomy", "vagina",
	"vibrator", "vibrators",
}

SexWords contain list of non-vulgar sex-related words.

var VulgarWords = []string{ /* 779 elements not displayed */

}

VulgarWords contain list of vulgar and offensive words in informal and slangs.

Source Files

dict.go mining.go

Directories

PathSynopsis
lib/mining/classifierPackage classifier provides machine learning classifier library, including CART, Random Forest, Cascaded Random Forest, and KNN.
lib/mining/classifier/cartPackage cart implement the Classification and Regression Tree by Breiman, et al.
lib/mining/classifier/crfPackage crf implement the cascaded random forest algorithm, proposed by Baumann et.al in their paper:
lib/mining/classifier/rfPackage rf implement ensemble of classifiers using random forest algorithm by Breiman and Cutler.
lib/mining/gain
lib/mining/gain/giniPackage gini contain function to calculating Gini gain.
lib/mining/knnPackage knn implement the K Nearest Neighbor using Euclidean to compute the distance between samples.
lib/mining/resamplingPackage resampling provide common interface, constants, and methods for resampling modules.
lib/mining/resampling/lnsmotePackage lnsmote implement the Local-Neighborhood algorithm from the paper,
lib/mining/resampling/smotePackage smote resamples a dataset by applying the Synthetic Minority Oversampling TEchnique (SMOTE).
lib/mining/tree
lib/mining/tree/binaryPackage binary contain implementation of binary tree.
Version
v0.60.0 (latest)
Published
Feb 1, 2025
Platform
linux/amd64
Last checked
10 hours ago

Tools for package owners.