Connect to Impala and create a remote dplyr data source
Connect to Impala and create a remote dplyr data source
src_impala creates a SQL backend to dplyr for Apache Impala, the massively parallel processing query engine for Apache Hadoop.
src_impala can work with any DBI-compatible interface that provides connectivity to Impala. Currently, two packages that can provide this connectivity are odbc and RJDBC.
src_impala(drv,..., auto_disconnect =TRUE)
Arguments
drv: an object that inherits from DBIDriver-class. For example, an object returned by odbc or JDBC
...: arguments passed to the underlying Impala database connection method dbConnect. See dbConnect,OdbcDriver-method or dbConnect,JDBCDriver-method
auto_disconnect: Should the connection to Impala be automatically closed when the object returned by this function is deleted? Pass NA
to auto-disconnect but print a message when this happens.