Having a numeric variable in a dataset we usually do not operate with its values directly, instead, we organize values into groups (segments) we need for our purposes. It is not always possible to foresee all that is needed for the analysis segments, so we found it useful to enable the feature for the fast creation of segments directly in Data Explorer.
How to create segments for numeric variables in Data Explorer (DeX)
Choose any numeric variable from a list of variables in the left panel in DeX to activate the secondary panel, which may be empty or display previously created segments.
To create new intervals, click on the input field at the top of the secondary panel.
A hint displaying common expressions will appear. You can either type an expression or choose and edit any suggested in the hint.
Press Enter to apply the expression and generate the specified intervals. Remember that you can edit each interval's label by double-clicking on the created segment individually.
Once you have created your new segments, you can rename them to make more meaningful labels. eg. weight 10-99 g for ‘Small’ fruits, weight 300-399 g for ‘Heavy’ fruits. Afterward, you can operate with them just like you would with regular categories.
To remove existing segments
Select the segments
Click the 'x' icon next to them or press the Delete key on your keyboard.
Note! Deleting intervals from the list will not impact existing reports.
Syntax of Segmenting Expression
Note! All the expressions described in this section are case insensitive.
To define the segment you need to enter an expression which can be:
Single value as it is (18, 44, 1947 etc.) defines options strictly matched to the number.
Expressions 'more or less' (>=17, >99, <=87 etc.) accordingly selects the options that match the condition.
An interval defined by formula FROM:TO (18:77, 4:15 etc.) includes borders.
Intervals with explicitly defined closures: square brackets define included borders, round brackets define excluded borders. Examples: [18:25] - from 18 to 25 borders included ie.18-25; (18:25) - from 18 to 25 borders excluded ie. 19-24; [18:25) - from 18 to 25 excluding 25 ie. 18-24.
By using * asterisk we define the full span from min to max value.
The expression ‘from [N1] by [N2]*[N3]’ creates N3 non-intersecting intervals of span N2, starting from N1. Numbers in brackets are variable and optional. Example: 'from 38 by 5*6' creates six non-intersecting intervals, each includes five values: 38-42, 43-47, 48-52, 53-57, 58-62, 63-67.
The expression ‘from [N1] by [N2]*' creates intervals of span N2 for the full range starting from N1. Similar to the previous one without a specified number of intervals.