Compute Natural Breaks in Python (Fisher-Jenks algorithm)
- 
            Updated
            Feb 14, 2025 
- Python
Compute Natural Breaks in Python (Fisher-Jenks algorithm)
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
📊 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
Visual Knowledge Discovery demo tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.
Two differrent approach to predict Churn customers and finding out important variables that drives churn
Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.
Discover ROPAC, a novel rule-based classifier we proposed. Here, you'll find the code, data, and original paper detailing this data classification algorithm.
Neural Network Deep learning specialization course offered via Coursera
ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB
Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.
Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...
This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.
This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.
Machine Learning classifier comparison GUI application. Choose 21 classifiers, evaluation data (optional for evaluation of synthetic data), hyperparameters, cross-validation splits, and rng seed; tabulates, and visualizes in Parallel Coordinates: best, worst, average, and standard deviation of Accuracy/F1/Recall.
Add a description, image, and links to the data-classification topic page so that developers can more easily learn about it.
To associate your repository with the data-classification topic, visit your repo's landing page and select "manage topics."