pyclustering icon indicating copy to clipboard operation
pyclustering copied to clipboard

pyclustering is a Python, C++ data mining library.

Results 75 pyclustering issues
Sort by recently updated
recently updated
newest added

**Introduction** There is a discussion regarding to purity estimation - see #620 . In order to cover this proposal, F-Measure should be implemented. **References** Article: The truth of the F-measure....

Enhancement
Good First Issue

**Introduction** There is a discussion regarding to purity estimation - see #620 . In order to cover this proposal, V-Measure should be implemented. **References** Article: V-Measure: A conditional entropy-based external...

Enhancement
Good First Issue

**Introduction** There are two logo for pyclustering and both are done via none vector graphic. **Description** - [ ] Create new logo using vector graphic, for example, SVG.

Documentaion

**Introduction** Documentation should contains information about changes that have been introduced. It can be automatically generated from CHANGELOG file. **Description** Script should be implemented in order to generate doxygen page...

Documentaion

**Description** Coverity Scan should be introduced for pyclustering library. In line policy, it shouldn't be run every commit therefore there are several variants how to do that:: 1. Nightly cron...

Continuous Integration

**Introduction** Distinguish UPGMA and WPGMA for Python and C++ parts of the library. **Description** Currently, UPGMA is implemented and called as average linkage. - [ ] Documentation revise; - [...

Enhancement
Documentaion
Good First Issue

**Introduction** 'random_state' parameter should be introduced in order to provide control to random seed value that is used during the processing. **Description** - [ ] `random_state` parameter should be passed...

Enhancement
Good First Issue

**Introduction** As a tester or developer, I want to have correct answer for Hepta FCPS sample so that I can use it in test scenarios to improve quality of the...

Testing
Good First Issue

**Introduction** Current python implementation `pyclustering.cluster.clique` is not optimal in case of high dimensional data. See complaint: #634 **Description** Algorithm should work for high-dimensional data. No need to build blocks physically,...

Optimization

**Introduction** The current problem is following: in case of incorrect input data or due to some other reason (even unexpected) - C++ code throws exception that is not captured by...

Enhancement
Good First Issue