google_vision library
Integrates Google Vision features, including image labeling, face, logo, and landmark detection, optical character recognition (OCR), and detection of explicit content, into applications.
Classes
- AnnotatedResponses
- Response to a batch image annotation request.
- AnnotateImageResponse
- Individual responses to image annotation requests within the batch.
- AnnotationRequest
- Individual image annotation requests for this batch.
- AnnotationRequests
- Request for performing Google Cloud Vision API tasks over a user-provided image, with user-requested features, and with context information.
- Block
- Logical element on the page.
- BoundingPoly
- The bounding polygon around the face. The coordinates of the bounding box are in the original image's scale. The bounding box is computed to "frame" the face in accordance with human expectations. It is based on the landmarker results. Note that one or more x and/or y coordinates may not be generated in the BoundingPoly (the polygon will be unbounded) if only a partial face appears in the image to be annotated.
- Color
- Represents a color in the RGBA color space. This representation is designed for simplicity of conversion to/from color representations in various languages over compactness. For example, the fields of this representation can be trivially provided to the constructor of java.awt.Color in Java; it can also be trivially provided to UIColor's +colorWithRed:green:blue:alpha method in iOS; and, with just a little work, it can be easily formatted into a CSS rgba() string in JavaScript.
- ColorInfo
- Color information consists of RGB channels, score, and the fraction of the image that the color occupies in the image.
- CropHint
- Single crop hint that is used to generate a new crop when serving an image.
- CropHintsAnnotation
- Set of crop hints that are used to generate new crops when serving images.
- CropHintsParams
- Parameters for crop hints annotation request.
- DetectedBreak
- Detected start or end of a structural component.
- DetectedLanguage
- Detected language for a structural component.
- DominantColorsAnnotation
- Set of dominant colors and their corresponding scores.
- EntityAnnotation
- Set of detected labels
- FaceAnnotation
- A face annotation object contains the results of face detection.
- FaceRecognitionParams
- Individual image annotation requests for this batch.
- Feature
- The type of Google Cloud Vision API detection to perform, and the maximum number of results to return for that type. Multiple Feature objects can be specified in the features list.
- FullTextAnnotation
- TextAnnotation contains a structured representation of OCR extracted text. The hierarchy of an OCR extracted text structure is like this: TextAnnotation -> Page -> Block -> Paragraph -> Word -> Symbol Each structural component,
- GoogleVision
- Integrates Google Vision features, including painter labeling, face, logo, and landmark detection, optical character recognition (OCR), and detection of explicit content, into applications.
- Image
- ImageAnnotationContext
- If an image was produced from a file (e.g. a PDF), this message gives information about the source of that image.
- ImageContext
- Individual image annotation requests for this batch.
- ImagePropertiesAnnotation
- Stores image properties, such as dominant colors.
- JsonSettings
- Model of the Json/authentication file provided by Google
- JwtCredentials
- JwtGenerator
- Landmark
- Face landmark (feature) type. Left and right are defined from the vantage of the viewer of the image without considering mirror projections typical of photos. So, LEFT_EYE, typically, is the person's right eye.
- LatLng
- An object that represents a latitude/longitude pair. This is expressed as a pair of doubles to represent degrees latitude and degrees longitude. Unless specified otherwise, this object must conform to the WGS84 standard. Values must be within normalized ranges.
- LocalizedObjectAnnotation
- Set of detected objects with bounding boxes.
- LocationInfo
- Detected entity location information.
- NormalizedVertex
- A vertex represents a 2D point in the image. NOTE: the vertex coordinates are in the same scale as the original image.
- OAuthClient
- Page
- Detected page from OCR.
- Painter
- The Painter class represents the space by which a supplied image can be modified.
- Paragraph
- Structural unit of text representing a number of words in certain order.
- Position
- A 3D position in the image, used primarily for Face detection landmarks. A valid Position must have both x and y coordinates. The position coordinates are in the same scale as the original image.
- ProductSearchParams
- Parameters for product search.
- Property
- A Property consists of a user-supplied name/value pair.
- SafeSearchAnnotation
- Set of features pertaining to the image, computed by computer vision methods over safe-search verticals (for example, adult, spoof, medical, violence).
- SerializableImage
- Status
- The Status type defines a logical error model that is suitable for different programming environments, including REST APIs and RPC APIs. It is used by gRPC. Each Status message contains three pieces of data: error code, error message, and error details.
- Symbol
- A single symbol representation.
- TextAnnotation
- Set of detected texts with bounding boxes.
- TextDetectionParams
- Parameters for text detection and document text detection.
- TextProperty
- Additional information detected on the structural component.
- Token
- TokenGenerator
- Util
- Utility methods
- Vertex
- A vertex represents a 2D point in the image. NOTE: the vertex coordinates are in the same scale as the original image.
- VisionClient
- Integrates Google Vision features, including image labeling, face, logo, and landmark detection, optical character recognition (OCR), and detection of explicit content, into applications.
- VisionCropHintCommand
- Set of crop hints that are used to generate new crops when serving images.
- VisionDetectCommand
- Integrates Google Vision features, including image labeling, face, logo, and landmark detection, optical character recognition (OCR), and detection of explicit content, into applications.
- VisionHelper
- Helper methods used by most of the cli commands.
- VisionHighlightCommand
- Draw a box to highlight any objects detected.
- VisionSafeSearchCommand
- SafeSearch Detection detects explicit content such as adult content or violent content within an image.
- VisionScoreCommand
- For OBJECT_LOCALIZATION, get the score(s) for the object specified with "look-for".
- VisionVersionCommand
- WebDetectionParams
- Parameters for web detection.
- Word
- A word representation.
Extensions
- UsageExtension
- Helper method to that retrieves error message string.
Enums
- BlockType
- Type of a block (text, image etc) as identified by OCR.
- BreakType
- Enum to denote the type of break found. New line, space etc.
- DetectionType
- LandmarkType
- Face landmark (feature) type. Left and right are defined from the vantage of the viewer of the image without considering mirror projections typical of photos. So, LEFT_EYE, typically, is the person's right eye.
- Likelihood
- A bucketized representation of likelihood, which is intended to give clients highly stable results across model upgrades.
- SafeSearchDetectionType
- A bucketized representation of likelihood, which is intended to give clients highly stable results across model upgrades.