Interface with Google Cloud Document AI API
Draw block bounding boxes
Assign tokens to a single new block
Delete processor
Deprecated functions
Disable processor
Run when daiR is attached
Get entities
List ids of available processors of a given type
Get information about processor
List available versions of processor
List created processors
Get project id
Get tables
Get text
List versions of available processors of a given type
Convert images to PDF
Image to base64 tiff
Check that a string is a valid colour representation
Check that a file is JSON
Check that a file is PDF
List available processor types
Build block dataframe
Make hOCR file
Merge shards
PDF to base64 tiff
Assign tokens to new blocks
Build token dataframe
Create processor
Defunct functions
OCR documents asynchronously
Check authentication
Notify on job completion
Check job status
OCR document synchronously
Produce access token
Get user information
Draw entity bounding boxes
Draw line bounding boxes
Draw paragraph bounding boxes
Draw token bounding boxes
Enable processor
Extract block coordinates from labelme files
Inspect revised block bounding boxes
Split a block bounding box
Get tables from output file
Get tables from response object
Get text from output file
Get text from HTTP response object
R interface for the Google Cloud Services 'Document AI API' <https://cloud.google.com/document-ai> with additional tools for output file parsing and text reconstruction. 'Document AI' is a powerful server-based OCR service that extracts text and tables from images and PDF files with high accuracy. 'daiR' gives R users programmatic access to this service and additional tools to handle and visualize the output. See the package website <https://dair.info/> for more information and examples.
Useful links