arrow17.0.0.1 package

Integration to 'Apache' 'Arrow'

compression

Compressed stream classes

concat_arrays

Concatenate zero or more Arrays

dictionary

Create a dictionary type

DictionaryType

class DictionaryType

io_thread_count

Manage the global I/O thread pool in libarrow

RecordBatch-class

RecordBatch class

RecordBatchReader

RecordBatchReader classes

RecordBatchWriter

RecordBatchWriter classes

recycle_scalars

Recycle scalar values in a list of arrays

reexports

Objects exported from other packages

Schema-class

Schema class

acero

Functions available in Arrow dplyr queries

buffer

Create a Buffer

add_filename

Add the data filename as a column

array-class

Array Classes

ArrayData

ArrayData class

arrow_array

Create an Arrow Array

arrow_info

Report information on the package's capabilities

arrow_not_supported

Helpers to raise classed errors

arrow-package

arrow: Integration to 'Apache' 'Arrow'

DataType-class

DataType class

as_arrow_array

Convert an object to an Arrow Array

as_arrow_table

Convert an object to an Arrow Table

as_chunked_array

Convert an object to an Arrow ChunkedArray

as_data_type

Convert an object to an Arrow DataType

as_record_batch_reader

Convert an object to an Arrow RecordBatchReader

as_record_batch

Convert an object to an Arrow RecordBatch

as_schema

Convert an object to an Arrow Schema

Buffer-class

Buffer class

call_function

Call an Arrow compute function

cast_options

Cast options

cast

Change the type of an array or column

chunked_array

Create a Chunked Array

ChunkedArray-class

ChunkedArray class

codec_is_available

Check whether a compression codec is available

Codec

Compression Codec class

concat_tables

Concatenate one or more Tables

contains_regex

Does this string contain regex metacharacters?

copy_files

Copy files between FileSystems

cpu_count

Manage the global CPU thread pool in libarrow

create_package_with_all_dependencies

Create a source bundle that includes all thirdparty dependencies

csv_convert_options

CSV Convert Options

csv_parse_options

CSV Parsing Options

default_memory_pool

Arrow's default MemoryPool

csv_read_options

CSV Reading Options

csv_write_options

CSV Writing Options

CsvFileFormat

CSV dataset file format

CsvReadOptions

File reader options

CsvTableReader

Arrow CSV and JSON table reader classes

data-type

Create Arrow data types

dataset_factory

Create a DatasetFactory

Dataset

Multi-file datasets

enums

Arrow enums

Expression

Arrow expressions

ExtensionArray

ExtensionArray class

ExtensionType

ExtensionType class

FeatherReader

FeatherReader class

Field-class

Field class

Field

Create a Field

FileFormat

Dataset file formats

FileInfo

FileSystem entry info

FileSelector

file selector

FileSystem

FileSystem classes

FileWriteOptions

Format-specific write options

FixedWidthType

FixedWidthType class

flight_connect

Connect to a Flight server

flight_disconnect

Explicitly close a Flight client

flight_get

Get data from a Flight server

flight_put

Send data to a Flight server

format_schema

Get a string representing a Dataset or RecordBatchReader object's sche...

FragmentScanOptions

Format-specific scan options

get_stringr_pattern_options

Get stringr pattern options

gs_bucket

Connect to a Google Cloud Storage (GCS) bucket

hive_partition

Construct Hive partitioning

infer_schema

Extract a schema from an object

infer_type

Infer the arrow Array type from an R object

InputStream

InputStream classes

install_arrow

Install or upgrade the Arrow library

install_pyarrow

Install pyarrow for use with reticulate

JsonFileFormat

JSON dataset file format

list_compute_functions

List available Arrow C++ compute functions

list_flights

See available resources on a Flight server

load_flight_server

Load a Python Flight server

make_readable_file

Handle a range of possible input sources

map_batches

Apply a function to a stream of RecordBatches

schema

Create a schema or extract one from an object.

match_arrow

Value matching for Arrow objects

MemoryPool

MemoryPool class

Message

Message class

MessageReader

MessageReader class

mmap_create

Create a new read/write memory mapped file of a given size

mmap_open

Open a memory mapped file

new_extension_type

Extension types

open_dataset

Open a multi-file dataset

open_delim_dataset

Open a multi-file dataset of CSV or other delimiter-separated format

OutputStream

OutputStream classes

ParquetArrowReaderProperties

ParquetArrowReaderProperties class

show_exec_plan

Show the details of an Arrow Execution Plan

Table-class

Table class

ParquetFileReader

ParquetFileReader class

ParquetFileWriter

ParquetFileWriter class

ParquetReaderProperties

ParquetReaderProperties class

ParquetWriterProperties

ParquetWriterProperties class

Partitioning

Define Partitioning for a Dataset

read_delim_arrow

Read a CSV or other delimited file with Arrow

table

Create an Arrow Table

to_arrow

Create an Arrow object from a DuckDB connection

read_feather

Read a Feather file (an Arrow IPC file)

read_ipc_stream

Read Arrow IPC stream format

read_json_arrow

Read a JSON file

read_message

Read a Message from a stream

read_parquet

Read a Parquet file

read_schema

Read a Schema from a stream

record_batch

Create a RecordBatch

register_binding

Register compute bindings

register_scalar_function

Register user-defined functions

repeat_value_as_array

Take an object of length 1 and repeat it.

s3_bucket

Connect to an AWS S3 bucket

Scalar-class

Arrow scalars

scalar

Create an Arrow Scalar

Scanner

Scan the contents of a dataset

to_duckdb

Create a (virtual) DuckDB table from an Arrow object

unify_schemas

Combine and harmonize schemas

value_counts

table for Arrow objects

vctrs_extension_array

Extension type for generic typed vectors

write_csv_arrow

Write CSV file to disk

write_dataset

Write a dataset

write_delim_dataset

Write a dataset into partitioned flat files.

write_feather

Write a Feather file (an Arrow IPC file)

write_ipc_stream

Write Arrow IPC stream format

write_parquet

Write Parquet file to disk

write_to_raw

Write Arrow data to a raw vector

'Apache' 'Arrow' <https://arrow.apache.org/> is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. This package provides an interface to the 'Arrow C++' library.

  • Maintainer: Jonathan Keane
  • License: Apache License (>= 2.0)
  • Last published: 2024-08-21