dependent_variable: (Type: character) name of the dependent variable. This variable is logged and then used as the dependent variable in the estimation.
distance: (Type: character) name of the distance variable that should be taken as the key independent variable in the estimation. The distance is logged automatically when the function is executed.
additional_regressors: (Type: character) names of the additional regressors to include in the model (e.g. a dummy variable to indicate contiguity). Unilateral metric variables such as GDPs can be added but those variables have to be logged first. Interaction terms can be added.
Write this argument as c(contiguity, common currency, ...). By default this is set to NULL.
data: (Type: data.frame) the dataset to be used.
...: Additional arguments to be passed to the function.
Returns
The function returns the summary of the estimated gravity model similar to a glm-object.
Details
nls is an estimation method for gravity models belonging to generalized linear models. It is estimated via glm using the gaussian distribution and a log-link.
As the method may not lead to convergence when poor starting values are used, the linear predictions, fitted values, and estimated coefficients resulting from a ppml estimation are used for the arguments etastart, mustart, and start.
For similar functions, utilizing the multiplicative form via the log-link, but different distributions, see ppml, gpml, and nbpml.
nls estimation can be used for both, cross-sectional as well as panel data, but its up to the user to ensure that the functions can be applied to panel data.
Depending on the panel dataset and the variables - specifically the type of fixed effects - included in the model, it may easily occur that the model is not computable.
Also, note that by including bilateral fixed effects such as country-pair effects, the coefficients of time-invariant observables such as distance can no longer be estimated.
Depending on the specific model, the code of the respective function may has to be changed in order to exclude the distance variable from the estimation.
At the very least, the user should take special care with respect to the meaning of the estimated coefficients and variances as well as the decision about which effects to include in the estimation. When using panel data, the parameter and variance estimation of the models may have to be changed accordingly.
For a comprehensive overview of gravity models for panel data see if(!exists(".Rdpack.currefs")) .Rdpack.currefs <-new.env();Rdpack::insert_citeOnly(keys="Egger2003;textual",package="gravity",cached_env=.Rdpack.currefs) , if(!exists(".Rdpack.currefs")) .Rdpack.currefs <-new.env();Rdpack::insert_citeOnly(keys="Gomez-Herrera2013;textual",package="gravity",cached_env=.Rdpack.currefs) and if(!exists(".Rdpack.currefs")) .Rdpack.currefs <-new.env();Rdpack::insert_citeOnly(keys="Head2010;textual",package="gravity",cached_env=.Rdpack.currefs) as well as the references therein.
Examples
# Example for CRAN checks:# Executable in < 5 seclibrary(dplyr)data("gravity_no_zeros")# Choose 5 countries for testingcountries_chosen <- c("AUS","CHN","GBR","BRA","CAN")grav_small <- filter(gravity_no_zeros, iso_o %in% countries_chosen)grav_small <- grav_small %>% mutate( lgdp_o = log(gdp_o), lgdp_d = log(gdp_d))fit <- nls( dependent_variable ="flow", distance ="distw", additional_regressors = c("rta","lgdp_o","lgdp_d"), data = grav_small
)
References
For more information on gravity models, theoretical foundations and estimation methods in general see