QQ - How do I get detailed statistics for continuous data fields?
  • 1 Minute to read
  • Dark
    Light
  • PDF

QQ - How do I get detailed statistics for continuous data fields?

  • Dark
    Light
  • PDF

Article summary

There are multiple ways of accessing statistical data in DataJet:

  • Analytics | Statistics
  • Analytics | Multi-field Statistics
  • Database Tree | Right-Click | Statistics
  • Analytics | Data Audit, select field (note: reduced basic stats for continuous fields)


Basic Statistics are available for all numeric fields:


Detailed statistics are available for all discrete numeric fields:


To obtain detailed statistics for continuous numeric fields, the following options are available:

  1. If Data Size/Cardinality allows, enhance the field to change it from Continuous to Discrete
  2. Use the Engineering | Functions | String | CTOD function to create a discrete sample field
  3. TODO: Sample Option for continuous fields

Enhance Field

  • Engineering | Enhance Field
  • TODO: What are the rules for enhancing a field?

CTOD (Continuous to Discrete), A (LMath), A (DMath)

  1. Drag the table that contains the field into the HotBar
  2. Right-Click the DataSet object in the HotBar and choose Function | Random (Note - any sampling function can be used - see HotBar - Functions for a detailed overview of sampling options) 
  3. Enter a value less than the discrete threshold (Between 250,000 and 1,000,000 depending on Data Type) and choose OK - a sample dataset will be created.
  4. In the Database tree, select the field, Right-Click and choose one of the following:
    1. Engineering | Function | String | CTOD for STRING fields (filter of up to 1 million rows)
    2. Engineering | Function | LMath | A for INTEGER fields (filter of up to 250,000 rows)
    3. Engineering | Function | DMath | A for DOUBLE fields (filter of up to 250,000 rows)
  5. Drag the Filter from the HotBar into the Filter section of the Engineering Dialog
  6. Give the new field a suitable name (e.g., smp_Field1).   Be sure that the name clearly marks the field as a sampled field.
  7. The sampled field can now be used in the Statistics, Multi-Field Statistics and Data Audit reports to access detailed statistical information

Sample Continuous

TODO: Coming Soon


Was this article helpful?