ProductionVariant class
Identifies a model that you want to host and the resources to deploy for hosting it. If you are deploying multiple models, tell Amazon SageMaker how to distribute traffic among the models by specifying variant weights.
Constructors
- ProductionVariant({required int initialInstanceCount, required ProductionVariantInstanceType instanceType, required String modelName, required String variantName, ProductionVariantAcceleratorType? acceleratorType, double? initialVariantWeight})
-
ProductionVariant.fromJson(Map<
String, dynamic> json) -
factory
Properties
- acceleratorType → ProductionVariantAcceleratorType?
-
The size of the Elastic Inference (EI) instance to use for the production
variant. EI instances provide on-demand GPU computing for inference. For
more information, see Using Elastic
Inference in Amazon SageMaker.
final
- hashCode → int
-
The hash code for this object.
no setterinherited
- initialInstanceCount → int
-
Number of instances to launch initially.
final
- initialVariantWeight → double?
-
Determines initial traffic distribution among all of the models that you
specify in the endpoint configuration. The traffic to a production variant
is determined by the ratio of the
VariantWeight
to the sum of allVariantWeight
values across all ProductionVariants. If unspecified, it defaults to 1.0.final - instanceType → ProductionVariantInstanceType
-
The ML compute instance type.
final
- modelName → String
-
The name of the model that you want to host. This is the name that you
specified when creating the model.
final
- runtimeType → Type
-
A representation of the runtime type of the object.
no setterinherited
- variantName → String
-
The name of the production variant.
final
Methods
-
noSuchMethod(
Invocation invocation) → dynamic -
Invoked when a nonexistent method or property is accessed.
inherited
-
toJson(
) → Map< String, dynamic> -
toString(
) → String -
A string representation of this object.
inherited
Operators
-
operator ==(
Object other) → bool -
The equality operator.
inherited