-
-
Notifications
You must be signed in to change notification settings - Fork 1k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
fc6b492
commit 5dcaa8a
Showing
4 changed files
with
48 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
46 changes: 46 additions & 0 deletions
46
doc/visual-programming/source/widgets/unsupervised/DBSCAN.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,46 @@ | ||
DBSCAN | ||
====== | ||
|
||
Groups items using the DBSCAN clustering algorithm. | ||
|
||
**Inputs** | ||
|
||
- Data: input dataset | ||
|
||
**Outputs** | ||
|
||
- Data: dataset with cluster index as a class attribute | ||
|
||
The widget applies the | ||
[DBSCAN clustering](https://en.wikipedia.org/wiki/DBSCAN) algorithm to | ||
the data and outputs a new dataset with cluster indices as a meta | ||
attribute. The widget also shows the sorted graph with distances to | ||
k-th nearest neighbors. With k values set to **Core point neighbors** | ||
as suggested in the | ||
[methods article](https://www.aaai.org/Papers/KDD/1996/KDD96-037.pdf). | ||
This gives the user the idea of an | ||
ideal selection for **Neighborhood distance** setting. As suggested by | ||
authors this parameter should be set to the first value in the first | ||
"valley" in the graph. | ||
|
||
![](images/dbscan-stamped.png) | ||
|
||
1. Set *minimal number of core neighbors* for a cluster and *maximal | ||
neighborhood distance. | ||
2. Set the distance metric that is used in grouping the items. | ||
3. If *Apply Automatically* is ticked, the widget will commit changes | ||
automatically. Alternatively, click *Apply*. | ||
4. The graph shows the distance to the k-th nearest neighbor. *k* is | ||
set by the **Core point neighbor** option. With moving the black slider | ||
left and right you can select the right **Neighbourhood distance**. | ||
|
||
Example | ||
------- | ||
|
||
In the following example, we connected the File widget with selected | ||
Iris dataset to the DBSCAN widget. In the DBSCAN widget, we set | ||
**Core points neighbors** parameter to 5. And select the | ||
**Neighbourhood distance** to the value in the first "valley" in the | ||
graph. We show clusters in the Scatter Plot widget. | ||
|
||
![](images/dbscan-example.png) |
Binary file added
BIN
+74.3 KB
doc/visual-programming/source/widgets/unsupervised/images/dbscan-example.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+33.7 KB
doc/visual-programming/source/widgets/unsupervised/images/dbscan-stamped.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.