Title: | Computing Log-Transformed Kernel Density Estimates for Positive Data |
---|---|
Description: | Computes log-transformed kernel density estimates for positive data using a variety of kernels. It follows the methods described in Jones, Nguyen and McLachlan (2018) <doi:10.21105/joss.00870>. |
Authors: | Hien D. Nguyen, Andrew T. Jones, and Geoffrey J. McLachlan |
Maintainer: | Andrew Thomas Jones <[email protected]> |
License: | GPL-3 |
Version: | 0.3.2 |
Built: | 2024-11-04 04:03:44 UTC |
Source: | https://github.com/andrewthomasjones/logkde |
Computes least squares cross-validation (CV) bandwidth (BW) for log domain KDE.
bw.logCV(x, grid = 21, NB = 512)
bw.logCV(x, grid = 21, NB = 512)
x |
numeric vector of the data. Must be strictly positive, will be log transformed during estimation. |
grid |
number of points used for BW selection CV grid. |
NB |
number of points at which to estimate the KDE at during the CV loop. |
bw the optimal least squares CV bandwidth.
Silverman, B. W. (1986). Density estimation for statistics and data analysis. Monographs on Statistics and Applied Probability. 26.
Stone, C. J. (1984). An asymptotically optimal window selection rule for kernel density estimates. The Annals of Statistics, 12(4), 1285-1297.
bw.logCV(rchisq(100,10), grid=21, NB=512)
bw.logCV(rchisq(100,10), grid=21, NB=512)
Computes bandwidth for log domain KDE using the Silverman rule.
bw.logG(x)
bw.logG(x)
x |
numeric vector of the data. Must be strictly positive, will be log transformed during estimation. |
bw the optimal bandwidth.
Silverman, B. W. (1986). Density estimation for statistics and data analysis. Monographs on Statistics and Applied Probability. 26.
Wand, M. P., Marron, J. S., & Ruppert, D. (1991). Transformations in density estimation. Journal of the American Statistical Association, 86(414), 343-353.
bw.logG(rchisq(100,10))
bw.logG(rchisq(100,10))
The function logdensity
computes kernel density estimates (KDE) of strictly positive distributions by performing the KDE in the log domain and then transforming the result back again. The syntax and function structure is largely borrowed from the function density
in package stats.
logdensity(x, bw = "nrd0", adjust = 1, kernel = "gaussian", weights = NULL, n = 512, from, to, cut = 3, na.rm = FALSE)
logdensity(x, bw = "nrd0", adjust = 1, kernel = "gaussian", weights = NULL, n = 512, from, to, cut = 3, na.rm = FALSE)
x |
the data from which the estimate is to be computed. |
bw |
the smoothing bandwidth to be used. Can also be can also be a character string giving a rule to choose the bandwidth. Like |
adjust |
the bandwidth used is actually |
kernel |
a character string giving the smoothing kernel to be used. Choose from "gaussian", "epanechnikov", "triangular", "uniform", "laplace" and "logistic". Default value is "gaussian". |
weights |
numeric vector of non-negative observation weights of the same length as |
n |
the number of equally spaced points at which the density is to be estimated. Note that these are equally spaced in the original domain. |
from , to
|
the left and right-most points of the grid at which the density is to be estimated; the defaults are cut * bw outside of range(x). |
cut |
by default, the values of from and to are cut bandwidths beyond the extremes of the data |
na.rm |
logical; if TRUE, missing values are removed from x. If FALSE any missing values cause an error. |
An object with class "density". See help(density)
for details.
Charpentier, A., & Flachaire, E. (2015). Log-transform kernel density estimation of income distribution. L'Actualite economique, 91(1-2), 141-159.
Wand, M. P., Marron, J. S., & Ruppert, D. (1991). Transformations in density estimation. Journal of the American Statistical Association, 86(414), 343-353.
density
, plot.density
, logdensity_fft
, bw.nrd
, bw.logCV
, bw.logG
.
logdensity(abs(rnorm(100)), from =.1, to=2, kernel='triangular')
logdensity(abs(rnorm(100)), from =.1, to=2, kernel='triangular')
The function logdensity_fft
computes kernel density estimates (KDE) of strictly positive distributions by performing the KDE via fast fourier transform utilizing the fft
function. The syntax and function structure is largely borrowed from the function density
in package stats.
logdensity_fft(x, bw = "nrd0", adjust = 1, kernel = "gaussian", weights = NULL, n = 512, from, to, cut = log(3), na.rm = FALSE)
logdensity_fft(x, bw = "nrd0", adjust = 1, kernel = "gaussian", weights = NULL, n = 512, from, to, cut = log(3), na.rm = FALSE)
x |
the data from which the estimate is to be computed. |
bw |
the smoothing bandwidth to be used. Can also be can also be a character string giving a rule to choose the bandwidth. Like |
adjust |
the bandwidth used is actually |
kernel |
a character string giving the smoothing kernel to be used. Choose from "gaussian", "epanechnikov", "triangular", "uniform", "laplace" and "logistic". Default value is "gaussian". |
weights |
numeric vector of non-negative observation weights of the same length as |
n |
the number of equally spaced points at which the density is to be estimated. Note that these are equally spaced in the log domain for |
from , to
|
the left and right-most points of the grid at which the density is to be estimated; the defaults are cut * bw outside of range(x). |
cut |
by default, the values of from and to are cut bandwidths beyond the extremes of the data |
na.rm |
logical; if TRUE, missing values are removed from x. If FALSE any missing values cause an error. |
An object with class "density". See help(density)
for details.
Charpentier, A., & Flachaire, E. (2015). Log-transform kernel density estimation of income distribution. L'Actualite economique, 91(1-2), 141-159.
Cooley, J. W., & Tukey, J. W. (1965). An algorithm for the machine calculation of complex Fourier series. Mathematics of computation, 19(90), 297-301.
Wand, M. P., Marron, J. S., & Ruppert, D. (1991). Transformations in density estimation. Journal of the American Statistical Association, 86(414), 343-353.
density
, plot.density
, logdensity
, bw.nrd
, bw.logCV
, bw.logG
.
logdensity_fft(abs(rnorm(100)), from =0.01, to= 2.5, kernel = 'logistic')
logdensity_fft(abs(rnorm(100)), from =0.01, to= 2.5, kernel = 'logistic')