ProductionVariant class

Identifies a model that you want to host and the resources to deploy for hosting it. If you are deploying multiple models, tell Amazon SageMaker how to distribute traffic among the models by specifying variant weights.

Constructors

ProductionVariant({required int initialInstanceCount, required ProductionVariantInstanceType instanceType, required String modelName, required String variantName, ProductionVariantAcceleratorType? acceleratorType, double? initialVariantWeight})
ProductionVariant.fromJson(Map<String, dynamic> json)
factory

Properties

acceleratorType ProductionVariantAcceleratorType?
The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker.
final
hashCode int
The hash code for this object.
no setterinherited
initialInstanceCount int
Number of instances to launch initially.
final
initialVariantWeight double?
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration. The traffic to a production variant is determined by the ratio of the VariantWeight to the sum of all VariantWeight values across all ProductionVariants. If unspecified, it defaults to 1.0.
final
instanceType ProductionVariantInstanceType
The ML compute instance type.
final
modelName String
The name of the model that you want to host. This is the name that you specified when creating the model.
final
runtimeType Type
A representation of the runtime type of the object.
no setterinherited
variantName String
The name of the production variant.
final

Methods

noSuchMethod(Invocation invocation) → dynamic
Invoked when a nonexistent method or property is accessed.
inherited
toJson() Map<String, dynamic>
toString() String
A string representation of this object.
inherited

Operators

operator ==(Object other) bool
The equality operator.
inherited