splitType property
The method to use to split the transform job's data files into smaller
batches. Splitting is necessary when the total size of each object is too
large to fit in a single request. You can also use data splitting to improve
performance by processing multiple concurrent mini-batches. The default
value for SplitType
is None
, which indicates that
input data files are not split, and request payloads contain the entire
contents of an input object. Set the value of this parameter to
Line
to split records on a newline character boundary.
SplitType
also supports a number of record-oriented binary data
formats. Currently, the supported record formats are:
- RecordIO
- TFRecord
BatchStrategy
and MaxPayloadInMB
parameters.
When the value of BatchStrategy
is MultiRecord
,
Amazon SageMaker sends the maximum number of records in each request, up to
the MaxPayloadInMB
limit. If the value of
BatchStrategy
is SingleRecord
, Amazon SageMaker
sends individual records in each request.
Implementation
final SplitType? splitType;