package mining
import "git.sr.ht/~shulhan/pakakeh.go/lib/mining"
Package mining provides library for data mining and statistic related to machine learning.
Classifiers,
- CART
- Random Forest
- Cascaded Random Forest
- K-Nearest Neighbourhood
Resampling,
- SMOTE
- LN-SMOTE (Local Neigbourhood SMOTE)
Miscellaneous,
- Gini index
Index ¶
Variables ¶
var BadWords = []string{ "666", "da", "dont", "dosent", "whatever", "guy", "hi", "nazi", "sup", "guise", "loser", "thats", "ugly", "wanna", "whats", "wont", "gotta", "bloody", "fart", "pot", "prick", "stink", "smells", "smelly", "alot", "dunno", "gotcha", }
BadWords contain list of colloquial words or bad writing words.
var BiasedWords = []string{ "cutting-edge", "single-handedly", "well-established", "well-known", "world-class", "absolute", "acclaimed", "amazing", "astonishing", "authoritative", "beautiful", "best", "boreing", "boring", "brilliant", "canonical", "cares", "celebrated", "charismatic", "classic", "coolest", "defining", "definitive", "eminent", "enigma", "ever", "everyone", "exciting", "extraordinary", "fabulous", "famous", "fantastic", "fat", "fully", "genius", "global", "great", "greatest", "hate", "huge", "iconic", "idiotic", "immensely", "impactful", "incendiary", "indisputable", "infamous", "influential", "innovative", "inspired", "intriguing", "lame", "leader", "leading", "legendary", "like", "major", "masterly", "mature", "memorable", "most", "notable", "outstanding", "pioneer", "popular", "prestigious", "probably", "really", "remarkable", "renowned", "respected", "seminal", "significant", "skillful", "solution", "staunch", "strange", "super", "talented", "top", "total", "totally", "transcendent", "ugly", "undoubtedly", "unique", "virtually", "virtuoso", "visionary", "weird", "worst", }
BiasedWords contain list of colloquial words with high bias.
var PronounWords = []string{ "i", "me", "mine", "my", "myself", "our", "ours", "ourself", "ourselves", "selves", "thee", "thine", "thou", "thy", "thyself", "us", "we", "y'all", "y'all", "y'all's", "yis", "you", "you-uns", "your", "yours", "yourself", "yourselves", "yourselves", "yous", "yous's", "youse", "youse", }
PronounWords contains list of first and second person pronouns including slangs.
var SexWords = []string{ "anal", "breast", "breasts", "buttocks", "dildo", "dildos", "erect", "nipple", "nipples", "penis", "sex", "sodomized", "sodomy", "vagina", "vibrator", "vibrators", }
SexWords contain list of non-vulgar sex-related words.
var VulgarWords = []string{ /* 779 elements not displayed */ }
VulgarWords contain list of vulgar and offensive words in informal and slangs.
Source Files ¶
Directories ¶
Path | Synopsis |
---|---|
lib/mining/classifier | Package classifier provides machine learning classifier library, including CART, Random Forest, Cascaded Random Forest, and KNN. |
lib/mining/classifier/cart | Package cart implement the Classification and Regression Tree by Breiman, et al. |
lib/mining/classifier/crf | Package crf implement the cascaded random forest algorithm, proposed by Baumann et.al in their paper: |
lib/mining/classifier/rf | Package rf implement ensemble of classifiers using random forest algorithm by Breiman and Cutler. |
lib/mining/gain | |
lib/mining/gain/gini | Package gini contain function to calculating Gini gain. |
lib/mining/knn | Package knn implement the K Nearest Neighbor using Euclidean to compute the distance between samples. |
lib/mining/resampling | Package resampling provide common interface, constants, and methods for resampling modules. |
lib/mining/resampling/lnsmote | Package lnsmote implement the Local-Neighborhood algorithm from the paper, |
lib/mining/resampling/smote | Package smote resamples a dataset by applying the Synthetic Minority Oversampling TEchnique (SMOTE). |
lib/mining/tree | |
lib/mining/tree/binary | Package binary contain implementation of binary tree. |
- Version
- v0.60.0 (latest)
- Published
- Feb 1, 2025
- Platform
- linux/amd64
- Last checked
- 10 hours ago –
Tools for package owners.