NEWS For R Version 4.0.4 (2021-02-15)
NEWS For R Version 4.0.4 (2021-02-15)
4 (2021-02-15)
NEWS R News
CHANGES IN R 4.0.4
NEW FEATURES:
File ‘share/texmf/tex/latex/jss.cls’ has been updated to work with LaTeX ver-
sions since Oct 2020.
Unicode character width tables (as used by nchar(,type = "w")) have been updated
to Unicode 12.1 by Brodie Gaslam (PR#17781), including many emoji.
The internal table for iswprint (used on Windows, macOS and AIX) has been up-
dated to include many recent Unicode characters.
INSTALLATION on a UNIX-ALIKE:
If an external BLAS is specified by ‘--with-blas=foo’ or via environment variable
BLAS_LIBS is not found, this is now a configuration error. The previous behaviour
was not clear from the documentation: it was to continue the search as if ‘--with-
blas=yes’ was specified.
BUG FIXES:
all.equal(x,y) now “sees” the two different NAs in factors, thanks to Bill Dunlap
and others in PR#17897.
(~ NULL)[1] and similar formula subsetting now works, thanks to a report and patch
by Henrik Bengtsson in PR#17935. Additionally, subsetting leaving an empty for-
mula now works too, thanks to suggestions by Suharto Anggono.
.traceback(n) keeps source references again, as before R 4.0.0, fixing a regression;
introduced by the PR#17580, reported including two patch proposals by Brodie
Gaslam.
unlist(plst,recursive=FALSE) no longer drops content for pairlists with list com-
ponents, thanks to the report and patch by Suharto Anggono in PR#17950.
iconvlist() now also works on MUSL based (Linux) systems, from a report and
patch suggestion by Wesley Chan in PR#17970.
1
2 NEWS
CHANGES IN R 4.0.3
NEW FEATURES:
On platforms using configure option ‘--with-internal-tzcode’, additional values
"internal" and (on macOS only) "macOS" are accepted for the environment variable
TZDIR. (See ?TZDIR.)
On macOS, "macOS" is used by default if the system timezone database is a newer
version than that in the R installation.
When install.packages(type = "source") fails to find a package in a repository
it mentions package versions which are excluded by their R version requirement and
links to hints on why a package might not be found.
The default value for options("timeout") can be set from environment variable
R_DEFAULT_INTERNET_TIMEOUT, still defaulting to 60 (seconds) if that is not set or
invalid.
This may be needed when child R processes are doing downloads, for example during
the installation of source packages which download jars or other forms of data.
BUG FIXES:
The code mitigating stack overflow with PCRE regexps on very long strings is enabled
for PCRE2 < 10.30 also when JIT is enabled, since stack overflows have been seen in
that case.
Fix to correctly show the group labels in dotchart() (which where lost in the ylab
improvement for R 4.0.0).
addmargins(*,..) now also works when fn() is a local function, thanks to bug report
and patch PR#17124 from Alex Bertram.
rank(x) and hence sort(x) now work when x is an object (as per is.object(x)) of
type "raw" and provides a valid `[` method, e.g., for gmp::as.bigz(.) numbers.
chisq.test(*,simulate.p.value=TRUE) and r2dtable() now work correctly for
large table entries (in the millions). Reported by Sebastian Meyer and investigated
by more helpers in PR#16184.
Low-level socket read/write operations have been fixed to correctly signal communi-
cation errors. Previously, such errors could lead to a segfault due to invalid memory
access. Reported and debugged by Dmitriy Selivanov in PR#17850.
quantile(x,pr) works more consistently for pr values slightly outside [0,1], thanks
to Suharto Anggono’s PR#17891.
Further, quantile(x,prN,names=FALSE) now works even when prN contains NAs,
thanks to Anggono’s PR#17892. Ditto for ordered factors or Date objects when
type = 1 or 3, thanks to PR#17899.
Libcurl-based internet access, including curlGetHeaders(), was not respecting the
"timeout" option. If this causes unanticipated timeouts, consider increasing the
default by setting R_DEFAULT_INTERNET_TIMEOUT.
as.Date(<char>) now also works with an initial "", thanks to Michael Chirico’s
PR#17909.
isS3stdGeneric(f) now detects an S3 generic also when it it is trace()d, thanks
to Gabe Becker’s PR#17917.
R_allocLD() has been fixed to return memory aligned for long double type
PR#16534.
fisher.test() no longer segfaults when called again after its internal stack has been
exceeded PR#17904.
Accessing a long vector represented by a compact integer sequence no longer segfaults
(reported and debugged by Hugh Parsonage).
duplicated() now works also for strings with multiple encodings inside a single vector
PR#17809.
phyper(11,15,0,12,log.p=TRUE) no longer gives NaN; reported as PR#17271 by
Alexey Stukalov.
Fix incorrect calculation in logLik.nls() PR#16100, patch from Sebastian Meyer.
A very old bug could cause a segfault in model.matrix() when terms involved logical
variables. Part of PR#17879.
4 NEWS
CHANGES IN R 4.0.2
UTILITIES:
R CMD check skips vignette re-building (with a warning) if the ‘VignetteBuilder’
package(s) are not available.
BUG FIXES:
Paths with non-ASCII characters caused problems for package loading on Windows
PR#17833.
Using tcltk widgets no longer crashes R on Windows.
source(*,echo=TRUE) no longer fails in some cases with empty lines; reported by
Bill Dunlap in PR#17769.
on.exit() now correctly matches named arguments, thanks to PR#17815 (including
patch) by Brodie Gaslam.
regexpr(*,perl=TRUE) no longer returns incorrect positions into text containing
characters outside of the Unicode Basic Multilingual Plane on Windows.
CHANGES IN R 4.0.1
NEW FEATURES:
paste() and paste0() gain a new optional argument recycle0. When set to
true, zero-length arguments are recycled leading to character(0) after the sep-
concatenation, i.e., to the empty string "" if collapse is a string and to the zero-
length value character(0) when collapse = NULL.
A package whose code uses this should depend on ‘R (>= 4.0.1)’.
The summary(<warnings>) method now maps the counts correctly to the warning
messages.
BUG FIXES:
aov(frml,...) now also works where the formula deparses to more than 500 char-
acters, thanks to a report and patch proposal by Jan Hauffa.
Fix a dozen places (code, examples) as Sys.setlocale() returns the new rather than
the previous setting.
Fix for adding two complex grid units via sum(). Thanks to Gu Zuguang for the
report and Thomas Lin Pedersen for the patch.
Fix parallel::mclapply(...,mc.preschedule=FALSE) to handle raw vector results
correctly. PR#17779
Computing the base value, i.e., 2, “everywhere”, now uses FLT_RADIX, as the original
‘machar’ code looped indefinitely on the ppc64 architecture for the longdouble case.
NEWS 5
CHANGES IN R 4.0.0
SIGNIFICANT USER-VISIBLE CHANGES:
Packages need to be (re-)installed under this version (4.0.0) of R.
matrix objects now also inherit from class "array", so e.g., class(diag(1))
is c("matrix","array"). This invalidates code incorrectly assuming that
class(matrix_obj)) has length one.
S3 methods for class "array" are now dispatched for matrix objects.
There is a new syntax for specifying raw character constants similar to the one used
in C++: r"(...)" with ... any character sequence not containing the sequence
‘)"’. This makes it easier to write strings that contain backslashes or both single and
double quotes. For more details see ?Quotes.
R now uses a ‘stringsAsFactors = FALSE’ default, and hence by default no longer
converts strings to factors in calls to data.frame() and read.table().
A large number of packages relied on the previous behaviour and so have needed/will
need updating.
The plot() S3 generic function is now in package base rather than package graphics,
as it is reasonable to have methods that do not use the graphics package. The generic
is currently re-exported from the graphics namespace to allow packages importing it
from there to continue working, but this may change in future.
Packages which define S4 generics for plot() should be re-installed and package code
using such generics from other packages needs to ensure that they are imported rather
than rely on their being looked for on the search path (as in a namespace, the base
namespace has precedence over the search path).
REFERENCE COUNTING:
Reference counting is now used instead of the NAMED mechanism for determining when
objects can be safely mutated in base C code. This reduces the need for copying in
some cases and should allow further optimizations in the future. It should help make
the internal code easier to maintain.
This change is expected to have almost no impact on packages using supported coding
practices in their C/C++ code.
MIGRATION TO PCRE2:
This version of R is built against the PCRE2 library for Perl-like regular expressions,
if available. (On non-Windows platforms PCRE1 can optionally be used if PCRE2
is not available at build time.) The version of PCRE in use can be obtained via
extSoftVersion(): PCRE1 (formerly known as ‘PCRE’) has versions <= 8, PCRE2
versions >= 10.
6 NEWS
NEW FEATURES:
assertError() and assertWarning() (in package tools) can now check for specific
error or warning classes via the new optional second argument classes (which is not
back compatible with previous use of an unnamed second argument).
DF2formula(), the utility for the data frame method of formula(), now works with-
out parsing and explicit evaluation, starting from Suharto Anggono’s suggestion in
PR#17555.
approxfun() and approx() gain a new argument na.rm defaulting to true. If set to
false, missing y values now propagate into the interpolated values.
Long vectors are now supported as the seq argument of a for() loop.
str(x) gets a new deparse.lines option with a default to speed it up when x is a
large call object.
The internal traceback object produced when an error is signalled (.Traceback),
now contains the calls rather than the deparse()d calls, deferring the deparsing
to the user-level functions .traceback() and traceback(). This fulfils the wish of
PR#17580, reported including two patch proposals by Brodie Gaslam.
data.matrix() now converts character columns to factors and from this to integers.
package.skeleton() now explicitly lists all exports in the ‘NAMESPACE’ file.
New function .S3method() to register S3 methods in R scripts.
file.path() has some support for file paths not in the session encoding, e.g. with
UTF-8 inputs in a non-UTF-8 locale the output is marked as UTF-8.
Most functions with file-path inputs will give an explicit error if a file-path input in
a marked encoding cannot be translated (to the native encoding or in some cases
on Windows to UTF-8), rather than translate to a different file path using es-
capes. Some (such as dir.exists(), file.exists(), file.access(), file.info(),
list.files(), normalizePath() and path.expand()) treat this like any other non-
existent file, often with a warning.
There is a new help document accessed by help("file path encoding") detailing
how file paths with marked encodings are handled.
New function list2DF() for creating data frames from lists of variables.
iconv() has a new option sub = "Unicode" to translate UTF-8 input invalid in the
‘to’ encoding using ‘<U+xxxx>’ escapes.
There is a new function infoRDS() providing information about the serialization
format of a serialized object.
S3 method lookup now by default skips the elements of the search path between the
global and base environments.
NEWS 7
New function proportions() and marginSums(). These should replace the unfortu-
nately named prop.table() and margin.table(). They are drop-in replacements,
but also add named-margin functionality. The old function names are retained as
aliases for back-compatibility.
Functions rbinom(), rgeom(), rhyper(), rpois(), rnbinom(), rsignrank() and
rwilcox() which have returned integer since R 3.0.0 and hence NA when the numbers
would have been outside the integer range, now return double vectors (without NAs,
typically) in these cases.
matplot(x,y) (and hence matlines() and matpoints()) now call the corresponding
methods of plot() and lines(), e.g, when x is a "Date" or "POSIXct" object;
prompted by Spencer Graves’ suggestion.
stopifnot() now allows customizing error messages via argument names, thanks to
a patch proposal by Neal Fultz in PR#17688.
unlink() gains a new argument expand to disable wildcard and tilde expansion.
Elements of x of value "~" are now ignored.
mle() in the stats4 package has had its interface extended so that arguments to the
negative log-likelihood function can be one or more vectors, with similar conventions
applying to bounds, start values, and parameter values to be kept fixed. This required
a minor extension to class "mle", so saved objects from earlier versions may need to
be recomputed.
The default for pdf() is now useDingbats = FALSE.
The default fill colour for hist() and boxplot() is now col = "lightgray".
The default order of the levels on the y-axis for spineplot() and cdplot() has been
reversed.
If the R_ALWAYS_INSTALL_TESTS environment variable is set to a true value, R CMD
INSTALL behaves as if the ‘--install-tests’ option is always specified. Thanks to
Reinhold Koch for the suggestion.
New function R_user_dir() in package tools suggests paths appropriate for storing
R-related user-specific data, configuration and cache files.
capabilities() gains a new logical option Xchk to avoid warnings about X11-related
capabilities.
The internal implementation of grid units has changed, but the only visible effects at
user-level should be
– a slightly different print format for some units (especially unit arithmetic),
– faster performance (for unit operations) and
– two new functions unitType() and unit.psum().
Based on code contributed by Thomas Lin Pedersen.
When internal dispatch for rep.int() and rep_len() fails, there is an attempt to
dispatch on the equivalent call to rep().
Object .Machine now contains new longdouble.* entries (when R uses long doubles
internally).
news() has been enhanced to cover the news on R 3.x and 2.x.
For consistency, N <-NULL; N[[1]] <-val now turns N into a list also when val) has
length one. This enables dimnames(r1)[[1]] <-"R1" for a 1-row matrix r1, fixing
PR#17719 reported by Serguei Sokol.
NEWS 9
Windows:
Rterm now works also when invoked from MSYS2 terminals. Line editing is possible
when command winpty is installed.
normalizePath() now resolves symbolic links and normalizes case of long names of
path elements in case-insensitive folders (PR#17165).
md5sum() supports UTF-8 file names with characters that cannot be translated to
the native encoding (PR#17633).
Rterm gains a new option ‘--workspace’ to specify the workspace to be restored.
This allows equals to be part of the name when opening via Windows file associations
(reported by Christian Asseburg).
Rterm now accepts ALT+xxx sequences also with NumLock on. Tilde can be pasted
with an Italian keyboard (PR#17679).
10 NEWS
R falls back to copying when junction creation fails during package checking (patch
from Duncan Murdoch).
C-LEVEL FACILITIES:
installChar is now remapped in ‘Rinternals.h’ to installTrChar, of which it
has been a wrapper since R 3.6.0. Neither are part of the API, but packages using
installChar can replace it if they depend on ‘R >= 3.6.2’.
Header ‘R_ext/Print.h’ defines ‘R_USE_C99_IN_CXX’ and hence exposes Rvprintf
and REvprintf if used with a C++11 (or later) compiler.
There are new Fortran subroutines dblepr1, realpr1 and intpr1 to print a scalar
variable (gfortran 10 enforces the distinction between scalars and length-one arrays).
Also labelpr to print just a label.
R_withCallingErrorHandler is now available for establishing a calling handler in C
code for conditions inheriting from class error.
INSTALLATION on a UNIX-ALIKE:
User-set ‘DEFS’ (e.g., in ‘config.site’) is now used for compiling packages (including
base packages).
There is a new variant option ‘--enable-lto=check’ for checking consistency of
BLAS/LAPACK/LINPACK calls — see ‘Writing R Extensions’.
A C++ compiler default is set only if the C++11 standard is supported: it no longer
falls back to C++98.
PCRE2 is used if available. To make use of PCRE1 if PCRE2 is unavailable, configure
with option ‘--with-pcre1’.
The minimum required version of libcurl is now 7.28.0 (Oct 2012).
New make target distcheck checks
– R can be rebuilt from the tarball created by make dist,
NEWS 11
UTILITIES:
R --help now mentions the option --no-echo (renamed from --slave) and its pre-
viously undocumented short form -s.
R CMD check now optionally checks configure and cleanup scripts for non-Bourne-
shell code (‘bashisms’).
R CMD check --as-cran now runs \donttest examples (which are run by example())
instead of instructing the tester to do so. This can be temporarily circumvented during
development by setting environment variable _R_CHECK_DONTTEST_EXAMPLES_ to a
false value.
PACKAGE INSTALLATION:
There is the beginnings of support for the recently approved C++20 standard, spec-
ified analogously to C++14 and C++17. There is currently only limited support for
this in compilers, with flags such as ‘-std=c++20’ and ‘-std=c++2a’. For the time
being the configure test is of accepting one of these flags and compiling C++17
code.
BUG FIXES:
formula(x) with length(x) > 1 character vectors, is deprecated now. Such use has
been rare, and has ‘worked’ as expected in some cases only. In other cases, wrong x
have silently been truncated, not detecting previous errors.
Long-standing issue where the X11 device could lose events shortly after startup has
been addressed (PR#16702).
The data.frame method for rbind() no longer drops <NA> levels from factor columns
by default (PR#17562).
available.packages() and hence install.packages() now pass their ... argu-
ment to download.file(), fulfilling the wish of PR#17532; subsequently, avail-
able.packages() gets new argument quiet, solving PR#17573.
stopifnot() gets new argument exprObject to allow an R object of class expression
(or other ‘language’) to work more consistently, thanks to suggestions by Suharto
Anggono.
conformMethod() now works correctly in cases containing a “&& logic” bug, reported
by Henrik Bengtsson. It now creates methods with "missing" entries in the signature.
Consequently, rematchDefinition() is amended to use appropriate .local() calls
with named arguments where needed.
format.default(*,scientific = FALSE) now corresponds to a practically most ex-
treme options(scipen = n) setting rather than arbitrary n = 100.
format(as.symbol("foo")) now works (returning "foo").
postscript(..,title = *) now signals an error when the title string contains a
character which would produce corrupt PostScript, thanks to PR#17607 by Daisuko
Ogawa.
Certain Ops (notably comparison such as ==) now also work for 0-length data frames,
after reports by Hilmar Berger.
12 NEWS
smoothEnds(x) now returns integer type in both cases when x is integer, thanks
to a report and proposal by Bill Dunlap PR#17693.
The methods package does a better job of tracking inheritance relationships across
packages.
norm(diag(c(1,NA)),"2") now works.
subset() had problems with 0-col dataframes (reported by Bill Dunlap, PR#17721).
Several cases of integer overflow detected by the ‘undefined behaviour sanitizer’ of
clang 10 have been circumvented. One in rhyper() may change the generated value
for large input values.
dotchart() now places the y-axis label (ylab) much better, not overplotting labels,
thanks to a report and suggestion by Alexey Shipunov.
A rare C-level array overflow in chull() has been worked around.
Some invalid specifications of the day-of-the-year (via %j, e.g. day 366 in 2017) or
week plus day-of-the-week are now detected by strptime(). They now return NA but
give a warning as they may have given random results or corrupted memory in earlier
versions of R.
socketConnection(server = FALSE) now respects the connection timeout also on
Linux.
socketConnection(server = FALSE) no longer leaks a connection that is available
right away without waiting (e.g. on ‘localhost’).
Socket connections are now robust against spurious readability and spurious avail-
ability of an incoming connection.
blocking = FALSE is now respected also on the server side of a socket connection,
allowing non-blocking read operations.
anova.glm() and anova.glmlist() computed incorrect score (Rao) tests in no-
intercept cases. (André Gillibert, PR#17734)
summaryRprof() now should work correctly for the
Rprof(*,memory.profiling=TRUE) case with small chunk size (and "tseries" or
similar) thanks to a patch proposal by Benjamin Tyner, in PR#15886.
xgettext() ignores strings passed to ngettext(), since the latter is handled by
xngettext(). Thanks to Daniele Medri for the report and all the recent work he has
done on the Italian translations.
data(package = "P") for P in base and stats no longer reports the data sets from
package datasets (which it did for back compatibility for 16 years), fixing PR#17730.
x[[Inf]] (returning NULL) no longer leads to undefined behavior, thanks to a report
by Kirill Müller in PR#17756. Further, x[[-Inf]] and x[[-n]] now give more
helpful error messages.
Gamma() family sometimes had trouble storing link name PR#15891