You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`slopeMean` | Mean slope of the lines on which clusters are based. Line slopes are drawn from the normal distribution.
25
-
`slopeStd` | Standard deviation of line slopes.
26
-
`numClusts` | Number of clusters (and therefore of lines) to generate.
27
-
`xClustAvgSep` | Average separation of line centers along the X axis.
28
-
`yClustAvgSep` | Average separation of line centers along the Y axis.
29
-
`lengthMean` | Mean length of the lines on which clusters are based. Line lengths are drawn from the folded normal distribution.
30
-
`lengthStd` | Standard deviation of line lengths.
31
-
`lateralStd` | Cluster "fatness", i.e., the standard deviation of the distance from each point to the respective line, in both *x* and *y* directions. This distance is obtained from the normal distribution with zero mean.
32
-
`totalPoints` | Total points in generated data. These will be randomly divided between clusters using the half-normal distribution with unit standard deviation.
33
-
`linePtsDist` | Optional parameter which specifies the distribution of points along lines. Possible values are `'unif'` (default) and `'norm'`. The former will distribute points uniformly along lines, while the latter will use a normal distribution (mean equal to the line center, standard deviation equal to one sixth of the line length). In the latter case, the line includes three standard deviations of the normal distribution, meaning that there is a small chance that some points are projected outside line limits.
22
+
### Required parameters
23
+
24
+
Parameter | Description
25
+
-------------- | -----------
26
+
`slopeMean` | Mean slope of the lines on which clusters are based. Line slopes are drawn from the normal distribution.
27
+
`slopeStd` | Standard deviation of line slopes.
28
+
`numClusts` | Number of clusters (and therefore of lines) to generate.
29
+
`xClustAvgSep` | Average separation of line centers along the X axis.
30
+
`yClustAvgSep` | Average separation of line centers along the Y axis.
31
+
`lengthMean` | Mean length of the lines on which clusters are based. Line lengths are drawn from the folded normal distribution.
32
+
`lengthStd` | Standard deviation of line lengths.
33
+
`lateralStd` | Cluster "fatness", i.e., the standard deviation of the distance from each point to its projection on the line. The way this distance is obtained is controlled by the optional `'pointOffset'` parameter.
34
+
`totalPoints` | Total points in generated data. These will be randomly divided between clusters using the half-normal distribution with unit standard deviation.
35
+
36
+
### Optional named parameters
37
+
38
+
Parameter name | Parameter values | Default value | Description
`pointDist` | `'unif'`, `'norm'` | `unif` | Specifies the distribution of points along lines, with two possible values: 1) `'unif'` distributes points uniformly along lines; or, 2) `'norm'` distribute points along lines using a normal distribution (line center is the mean and the line length is equal to 3 standard deviations).
42
+
`pointOffset` | `1D`, `2D` | `2D` | Controls how points are created from their projections on the lines, with two possible values: 1) `'1D'` places points on a second line perpendicular to the cluster line using a normal distribution centered at their intersection; or, 2) `'2D'` places point using a bivariate normal distribution centered at the point projection.
34
43
35
44
## Return values
36
45
@@ -43,7 +52,9 @@ depending on the selected input parameters.
43
52
`slopes` | Vector (`numClusts` x *1*) containing the effective slopes of the lines used to generate clusters.
44
53
`lengths` | Vector (`numClusts` x *1*) containing the effective lengths of the lines used to generate clusters.
0 commit comments