Skip to content

Commit

Permalink
missing values: add Datasheet Q, notes in TableAPI, bump version (#49)
Browse files Browse the repository at this point in the history
  • Loading branch information
bennn authored Jun 14, 2024
1 parent df0a60e commit fd227ef
Show file tree
Hide file tree
Showing 3 changed files with 5 additions and 3 deletions.
2 changes: 1 addition & 1 deletion Datasheet.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
> Q. Do tables express heterogeneous data, or must data be homogenized?

> Q. Do tables capture missing data and, if so, how?
> Q. Do tables capture missing data and, if so, how? Do missing values affect the output constraints of any operations, for example `groupBy`?

> Q. Are mutable tables supported? Are there any limitations?
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

The Brown Benchmark for Table Types

Version 1.1
Version 1.2

## Context

Expand Down
4 changes: 3 additions & 1 deletion TableAPI.md
Original file line number Diff line number Diff line change
Expand Up @@ -1032,7 +1032,7 @@ Sorts the rows of a `Table` in ascending order by using a sequence of specified
- `header(t2)` is equal to `["value", "count"]`
- `schema(t2)["value"]` is equal to `schema(t1)[c]`
- `schema(t2)["count"]` is equal to `Number`
- `nrows(t2)` is equal to `length(removeDuplicates(getColumn(t1, c)))`
- `nrows(t2)` is equal to `length(removeDuplicates(getColumn(t1, c)))` Note that if there are missing values in the input, this constraint requires one row for missing values in the output.

#### Description

Expand Down Expand Up @@ -1150,6 +1150,8 @@ Partitions rows into groups and summarize each group with the functions in `agg`
- `schema(t2)` is equal to `schema(r3)`
- `nrows(t2)` is equal to `length(removeDuplicates(ks))`, where `ks` is the results of applying `key` to each row of `t1`. `ks` can be defined with `select` and `getColumn`.

Note that these constraints assume a first class representation for missing values.

#### Description

Groups the rows of a table according to a specified key selector function and creates a result value from each group and its key. The rows of each group are projected by using a specified function.
Expand Down

0 comments on commit fd227ef

Please sign in to comment.