Datasets 🔗

A Dataset is the primary container of statistical data in Crunch. Each dataset contains a collection of variables, from which analyses can be composed and then saved and exported. These analyses may include filters, which a user can define and save. A user can also share a dataset with another user.

Data is added to a dataset as a batch. A new dataset may be created empty (have zero data batches), or dataset creation and adding the first data batch may be combined in certain import operations. In either case, additional batches can be appended to datasets. Similarly, variables from other datasets can be joined onto a dataset.

As with other Crunch objects, references to the dataset entities available to a user are provided in a catalog. Multiple endpoint-methods described below return a dataset catalog, which may be filtered and/or organized in a hiearchy, depending on the method.

See API Reference - Create dataset.

Other catalogs 🔗

In addition to /datasets/, there are a few other catalogs of datasets in the API:

Project datasets 🔗

See API Reference - List datasets in project.

Filter datasets by name 🔗

See API Reference - Filter datasets by name.

Entity 🔗

GET 🔗

See API Reference - Dataset details.

PATCH 🔗

See API Reference - Update dataset.

DELETE 🔗

See API Reference - Delete dataset.

Views 🔗

Applied filters 🔗

Cube 🔗

See API Reference - Calculate data cube.

Export 🔗

See API Reference - List available export formats.

See API Reference - Export dataset.

Match 🔗

See API Reference - Request matching variable analysis.

See API Reference - View matching variable analysis.

Summary 🔗

See API Reference - Row and column count.

Fragments 🔗

Table 🔗

See API Reference - List variable definitions.

State 🔗

See API Reference - List dataset’s current state.

Exclusion 🔗

See API Reference - View the current row exclusion filter.

See API Reference - Update row exclusions.

Publishing 🔗

See API Reference - Is dataset published.

See API Reference - Publish/unpublish dataset.

Stream 🔗

See API Reference - Streaming status.

See API Reference - Insert streamed data.

Settings 🔗

See API Reference - List settings.

See API Reference - Update settings.

Preferences 🔗

See API Reference - List user preferences.

See API Reference - Update settings.

Primary key 🔗

See API Reference - List primary key.

See API Reference - Set primary key.

See API Reference - Unset primary key.

Catalogs 🔗

Users 🔗

See API Reference - List dataset users.

See API Reference - Add/remove dataset users.

Teams 🔗

See API Reference - List dataset teams.