collect(): Array[T]
Dataset API — Actions
Actions are part of the Dataset API for…FIXME
|
Note
|
Actions are the methods in the Dataset Scala class that are grouped in action group name, i.e. @group action.
|
| Action | Description | ||
|---|---|---|---|
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
Computes specified statistics for numeric and string columns. The default statistics are:
|
|||
|
|||
|
head Action
head(): T (1)
head(n: Int): Array[T]
-
Calls the other
headwithnas1and takes the first element
head…FIXME
show Action
show(): Unit
show(truncate: Boolean): Unit
show(numRows: Int): Unit
show(numRows: Int, truncate: Boolean): Unit
show(numRows: Int, truncate: Int): Unit
show(numRows: Int, truncate: Int, vertical: Boolean): Unit
show…FIXME
Calculating Statistics — summary Action
summary(statistics: String*): DataFrame
summary calculates specified statistics for numeric and string columns.
The default statistics are: count, mean, stddev, min, max and 25%, 50%, 75% percentiles.
|
Note
|
summary accepts arbitrary approximate percentiles specified as a percentage (e.g. 10%).
|
Internally, summary uses the StatFunctions to calculate the requested summaries for the Dataset.