Why have CRS, projections and transformations changed?

Introduction

Changes in the representation of coordinate reference systems (CRS), and of operations on coordinates, that have been occurring over decades must now be implemented in the way spatial objects are handled in R packages. Up to the 1990s, most spatial data simply used the coordinates given by the local mapping authority; for example, the Meuse bank data set used a planar representation in metres, which turned out to be EPSG:28992, “Amersfoort / RD New”. A major resource for finding out why CRS were specified as they were are Clifford J. Mugnier’s columns in Photogrammetric Engineering & Remote Sensing, references to which are available in rgdal; the Netherlands were covered in the February 2003 column:

td <- tempfile()
dir.create(td)
Sys.setenv("PROJ_USER_WRITABLE_DIRECTORY"=td)
library(rgdal)
## Loading required package: sp
## Please note that rgdal will be retired during October 2023,
## plan transition to sf/stars/terra functions using GDAL and PROJ
## at your earliest convenience.
## See https://r-spatial.org/r/2023/05/15/evolution4.html and https://github.com/r-spatial/evolution
## rgdal: version: 1.6-7, (SVN revision 1203)
## Geospatial Data Abstraction Library extensions to R successfully loaded
## Loaded GDAL runtime: GDAL 3.8.4, released 2024/02/08
## Path to GDAL shared files: /usr/share/gdal
##  GDAL does not use iconv for recoding strings.
## GDAL binary built with GEOS: TRUE 
## Loaded PROJ runtime: Rel. 9.4.0, March 1st, 2024, [PJ_VERSION: 940]
## Path to PROJ shared files: /tmp/Rtmp2wkAla/file135232004c44:/usr/share/proj
## PROJ CDN enabled: FALSE
## Linking to sp version:2.1-4
## To mute warnings of possible GDAL/OSR exportToProj4() degradation,
## use options("rgdal_show_exportToProj4_warnings"="none") before loading sp or rgdal.
data("GridsDatums")
GridsDatums[grep("Netherlands", GridsDatums$country),]
##                                country      month year ISO
## 50  Aruba and the Netherlands Antilles     (July) 2002 ABW
## 57      The Kingdom of the Netherlands (February) 2003 NLD
## 243     The Kingdom of the Netherlands (February) 2021 NLD

While most national mapping agencies defined their own standard geographical and projected CRS, supranational bodies, such as military alliances and colonial administrations often imposed some regularity to facilitate operations across national boundaries. This also led to the creation of the European Petroleum Survey Group (EPSG), because maritime jurisdiction was not orderly, and mattered when countries sharing the same coastal shelf tried to assign conflicting exploration concessions. Experts from oil companies accumulated vast experience, which fed through to the International Standards Organization (ISO, especially TC 211) and the Open Geospatial Consortium (OGC).

Defining the CRS became necessary when integrating other data with a different CRS, and for displaying on a web map background. Many legacy file formats, such as the ESRI Shapefile format, did not mandate the inclusion of the CRS of positional data. Most open source software then used PROJ.4 strings as a flexible representation, but as internationally accepted standards have been reached, in particular ISO 19111, and improved over time by iteration, it is really necessary to change to a modern text representation, known as WKT2 (2019). Now it looks as though almost all corporations and mapping agencies accommodate this representation, and it has been adopted by sp through rgdal, sf and other packages.

demo(meuse, ask=FALSE, package="sp", echo=FALSE)
library(mapview)
x <- mapview(meuse, zcol="zinc")
mapshot(x, file="meuse.png")
knitr::include_graphics(system.file("misc/meuse.png", package="rgdal"))

Coordinate reference system representation

The mapproj package provided coordinate reference system and projection support for the maps package. From mapproj/src/map.h, line 20, we can see that the eccentricity of the Earth is defined as 0.08227185422, corrresponding to the Clark 1866 ellipsoid (Iliffe and Lott 2008):

ellps <- projInfo("ellps")
(clrk66 <- ellps[ellps$name=="clrk66",])
##      name       major         ell description
## 16 clrk66 a=6378206.4 b=6356583.8 Clarke 1866
#eval(parse(text=clrk66$major))
#eval(parse(text=clrk66$ell))
a <- 6378206.4
b <- 6356583.8
print(sqrt((a^2-b^2)/a^2), digits=10)
## [1] 0.08227185422

With a very few exceptions, projections included in mapproj::mapproject() use the Clarke 1866 ellipsoid, with the remainder using a sphere with the Clarke 1866 major axis radius. The function returns coordinates for visualization in an unknown metric; no inverse projections are available.

Like many other free and open source software projects around 2000, R spatial development chose to use the best open source library and infrastructure then available, PROJ.4. Version 4.4 was published by Frank Warmerdam in 2000, based on Gerald Evenden’s earlier work. This earlier work was a library for forward and inverse projection using a key-value string interface to describe the required projection (Evenden 1990). The key-value string is taken as +key=value, where =value could be omitted for some keys, and the definition of each projection is built up from space-separated key-value string, such as +proj=utm +zone=25 +south for Universal Transverse Mercator zone 25 in the southern hemisphere.

PROJ.4 2000-2018

Unlike mapproj, PROJ.4 had begun to introduce the +ellps= key in addition to projection parameters before PROJ.4.4, as some users would not treat Clark 1866 as their natural preference. The need for more care in geodetic specification became pressing from 2000, when civilian use of GPS became as accurate as military use; GPS was typically registered to a more modern geographical CRS than digitized printed maps had been, and most mapping agencies scrambled to update their products and services to “GPS coordinates”.

The ellps= key was followed by the +datum=, +nadgrids= and +towgs84= keys in successive releases to attempt to specify the geodetic model. The +init= key appeared to permit the look-up of sets of key-value strings in the packaged version of a given table with a known authority, typically +init=epsg:<code>, where <code> was the EPSG code of the coordinate reference system. Where +towgs84= was given, a three or seven parameter transformation to the WGS84 datum was provided as a comma-separated string, so that the coordinate reference system also included the inverse coordinate operation from projected coordinates to geographical coordinates defined by the WGS84 datum. This led to the need for a placeholder for geographical coordinates, set as +proj=longlat (or lonlat, or perhaps with reversed axis order latlong or latlon). Some of the issues involved were discussed in a 2010 blog post by Frank Warmerdam.

The PROJ.4 framework functioned well for projection before it was expected to handle datum tranformation too. Within the remit of single mapping agencies, some adaptation could still provide help, say using +nadgrids= in parts of North America (NAD, North American Datum), but where positional data from multiple agencies was being integrated, the framework was showing its age. For example, from about 2010 it was observed that the +datum= and +towgs84= keys sometimes provided contradictory values, leading functions in GDAL reading raster files to prefer +towgs84= values to +datum= values. For users for whom accuracy of better than about 150m was irrelevant, using coordinate reference systems correctly defined in terms of the underlying geographical coordinates was less important, but this is still about five Landsat cells.

While the representation of coordinate reference systems (sometimes supplemented by coordinate operations to transform the underlying geographical coordinates to WGS84) as PROJ.4 strings continued to work adequately, changes were occurring. Many file formats chose to use WKT (well-known text) string representations, starting from the 2007 edition of the ISO standard (ISO 2019). This placed the file reading and writing functions offered by GDAL under stress, especially the exportToProj4() function, as an increasing number of specification components really did not map adequately from the WKT string representation to the PROJ.4 string representation. Another change was that pivoting through a chosen hub when transforming coordinates (in PROJ.4 WGS84) meant that accuracy was lost transforming from the source to the hub, and more was lost from the hub to the target. Why not transform from source to target in one step if possible?

PRϕJ

Signalling changes, PROJ.4 changed its name to PRϕJ, and began burning through major version numbers. PROJ 5 (2018) introduced transformation pipelines (Knudsen and Evers 2017; Evers and Knudsen 2017), representing coordinate operations using a syntax similar to PROJ.4 strings, but showing the whole operation pipeline. PROJ 6 (2019) followed up by shifting from ad-hoc text files for holding coordinate reference system and coordinate operation metadata to an SQLite database. In an increasing number of cases, more accurate coordinate operations could be supported using open access transformation grids, and the grid files needed were now tabulated in the SQLite database. This database is distributed with PROJ, and kept in a directory on the PROJ search path, usually the only or final directory (get_proj_search_paths() returns a vector of current search paths).

(shpr <- get_proj_search_paths())
## [1] "/tmp/Rtmp2wkAla/file135232004c44" "/usr/share/proj"
library(RSQLite)
db <- dbConnect(SQLite(), dbname=file.path(shpr[length(shpr)], "proj.db"))
dbListTables(db)
##  [1] "alias_name"                               
##  [2] "authority_list"                           
##  [3] "authority_to_authority_preference"        
##  [4] "axis"                                     
##  [5] "celestial_body"                           
##  [6] "compound_crs"                             
##  [7] "concatenated_operation"                   
##  [8] "concatenated_operation_step"              
##  [9] "conversion"                               
## [10] "conversion_method"                        
## [11] "conversion_param"                         
## [12] "conversion_table"                         
## [13] "coordinate_metadata"                      
## [14] "coordinate_operation_method"              
## [15] "coordinate_operation_view"                
## [16] "coordinate_operation_with_conversion_view"
## [17] "coordinate_system"                        
## [18] "crs_view"                                 
## [19] "deprecation"                              
## [20] "ellipsoid"                                
## [21] "extent"                                   
## [22] "geodetic_crs"                             
## [23] "geodetic_datum"                           
## [24] "geodetic_datum_ensemble_member"           
## [25] "geoid_model"                              
## [26] "grid_alternatives"                        
## [27] "grid_packages"                            
## [28] "grid_transformation"                      
## [29] "helmert_transformation"                   
## [30] "helmert_transformation_table"             
## [31] "metadata"                                 
## [32] "object_view"                              
## [33] "other_transformation"                     
## [34] "prime_meridian"                           
## [35] "projected_crs"                            
## [36] "scope"                                    
## [37] "sqlite_stat1"                             
## [38] "supersession"                             
## [39] "unit_of_measure"                          
## [40] "usage"                                    
## [41] "versioned_auth_name_mapping"              
## [42] "vertical_crs"                             
## [43] "vertical_datum"                           
## [44] "vertical_datum_ensemble_member"
(metadata <- dbReadTable(db, "metadata"))
##                              key
## 1  DATABASE.LAYOUT.VERSION.MAJOR
## 2  DATABASE.LAYOUT.VERSION.MINOR
## 3                      EPSG.DATE
## 4                   EPSG.VERSION
## 5                      ESRI.DATE
## 6                   ESRI.VERSION
## 7                      IGNF.DATE
## 8                    IGNF.SOURCE
## 9                   IGNF.VERSION
## 10                      NKG.DATE
## 11                    NKG.SOURCE
## 12                   NKG.VERSION
## 13                  PROJ.VERSION
## 14             PROJ_DATA.VERSION
##                                                                              value
## 1                                                                                1
## 2                                                                                3
## 3                                                                       2024-02-24
## 4                                                                          v11.004
## 5                                                                       2023-11-02
## 6                                                                   ArcGIS Pro 3.2
## 7                                                                       2019-05-24
## 8  https://raw.githubusercontent.com/rouault/proj-resources/master/IGNF.v3.1.0.xml
## 9                                                                            3.1.0
## 10                                                                      2020-12-21
## 11                          https://github.com/NordicGeodesy/NordicTransformations
## 12                                                                           1.0.0
## 13                                                                           9.4.0
## 14                                                                            1.17

PROJ 7 (2020) reconfigured the transformation grids, now using the Geodetic TIFF Grid (GTG) format, and created pathways for on-demand download (typically using a content download network (CDN) over CURL) of chunks of such grids to a local user-writable cache held in another SQLite database. After little change from the late 1990s to early 2018, PROJ.4 has incremented its major version number three times in three years, and by 2021 (PROJ 8), the pre-existing application programming interface will be history. In addition, GDAL 3 (2019) has tightened its links with PROJ >= 6, and exportToProj4() now says:

“Use of this function is discouraged. Its behavior in GDAL >= 3 / PROJ >= 6 is significantly different from earlier versions. In particular +datum will only encode WGS84, NAD27 and NAD83, and +towgs84/+nadgrids terms will be missing most of the time. PROJ strings to encode CRS should be considered as a a legacy solution. Using a AUTHORITY:CODE or WKT representation is the recommended way” (https://gdal.org/api/ogrspatialref.html).

For these reasons, sf and sp are changing from PROJ.4 strings representing coordinate reference systems to WKT2:2019 strings, as described in this r-spatial blog. Most users who had been relying on file reading to set the coordinate reference systems of objects will not notice much difference, and legacy PROJ.4 strings can still be used to create new, authority-free definitions if need be.

It will be useful to know that in general "OGC:CRS84" should be used instead of "EPSG:4326", because the latter presupposes that Latitude is always ordered before Longitude. "OGC:CRS84" is the standard representation used by GeoJSON, with coordinates always ordered Longitude before Latitude. This is also the standard GIS and visualization order:

cat(wkt(CRS(SRS_string="OGC:CRS84")), "\n")
## GEOGCRS["WGS 84 (CRS84)",
##     DATUM["World Geodetic System 1984",
##         ELLIPSOID["WGS 84",6378137,298.257223563,
##             LENGTHUNIT["metre",1]]],
##     PRIMEM["Greenwich",0,
##         ANGLEUNIT["degree",0.0174532925199433]],
##     CS[ellipsoidal,2],
##         AXIS["geodetic longitude (Lon)",east,
##             ORDER[1],
##             ANGLEUNIT["degree",0.0174532925199433]],
##         AXIS["geodetic latitude (Lat)",north,
##             ORDER[2],
##             ANGLEUNIT["degree",0.0174532925199433]],
##     USAGE[
##         SCOPE["unknown"],
##         AREA["World"],
##         BBOX[-90,-180,90,180]],
##     ID["OGC","CRS84"]]

Note that all "GEOGCRS" "BBOX" CRS bounding boxes are always specified latitude minimum, longitude minimum, latitude maximum, longitude maximum, even though the declared axis order is the opposite.

Coordinate operations

The introduction of interactive mapping using mapview and tmap among other packages highlights the need to set the coordinate reference system (CRS) of objects correctly, so that zooming does not reveal embarrassing divergences. Using the location of the Broad Street pump disabled by Dr John Snow to stop a cholera epidemic in Soho, London, in 1854 (Brody et al. 2000), we can start to see what steps are being taken. The point location of the pump is given in projected coordinates, defined in the British National Grid. The workflow used by mapview::mapview() is to transform first to WGS84 (OGC:CRS84) using sf::st_transform() if need be, before permitting leaflet to project to Web Mercator (EPSG:3857) internally.

b_pump <- readOGR(system.file("vectors/b_pump.gpkg", package="rgdal"))
## Warning in readOGR(system.file("vectors/b_pump.gpkg", package = "rgdal")): OGR
## support is provided by the sf and terra packages among others
## Warning in ogrListLayers(dsn = dsn): OGR support is provided by the sf and
## terra packages among others
## Warning in ogrInfo(dsn = dsn, layer = layer, encoding = encoding, use_iconv =
## use_iconv, : OGR support is provided by the sf and terra packages among others
## Warning in ogrFIDs(dsn = dsn, layer = layer): OGR support is provided by the sf
## and terra packages among others
## Warning in OGRSpatialRef(dsn, layer, morphFromESRI = morphFromESRI, dumpSRS =
## dumpSRS, : OGR support is provided by the sf and terra packages among others
## Warning in ogrListLayers(dsn): OGR support is provided by the sf and terra
## packages among others
## Warning in ogrFIDs(dsn = dsn, layer = layer): OGR support is provided by the sf
## and terra packages among others
## OGR data source with driver: GPKG 
## Source: "/tmp/RtmpMXM6SG/Rinstf4f9b2c122/rgdal/vectors/b_pump.gpkg", layer: "b_pump"
## with 1 features
## It has 1 fields
## Integer64 fields read as strings:  cat

Reading the file loses the PROJ.4 +datum= key-value pair, but the WKT2:2019 string is complete:

proj4string(b_pump)
## [1] "+proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +units=m +no_defs"
if (packageVersion("sp") > "1.4.1") {
  WKT <- wkt(b_pump)
} else {
  WKT <- comment(slot(b_pump, "proj4string"))
}
cat(WKT, "\n")
## PROJCRS["Transverse_Mercator",
##     BASEGEOGCRS["GCS_OSGB 1936",
##         DATUM["OSGB_1936",
##             ELLIPSOID["Airy 1830",6377563.396,299.3249646,
##                 LENGTHUNIT["metre",1]]],
##         PRIMEM["Greenwich",0,
##             ANGLEUNIT["Degree",0.0174532925199433]],
##         ID["EPSG",4277]],
##     CONVERSION["unnamed",
##         METHOD["Transverse Mercator",
##             ID["EPSG",9807]],
##         PARAMETER["Latitude of natural origin",49,
##             ANGLEUNIT["Degree",0.0174532925199433],
##             ID["EPSG",8801]],
##         PARAMETER["Longitude of natural origin",-2,
##             ANGLEUNIT["Degree",0.0174532925199433],
##             ID["EPSG",8802]],
##         PARAMETER["Scale factor at natural origin",0.999601,
##             SCALEUNIT["unity",1],
##             ID["EPSG",8805]],
##         PARAMETER["False easting",400000,
##             LENGTHUNIT["metre",1],
##             ID["EPSG",8806]],
##         PARAMETER["False northing",-100000,
##             LENGTHUNIT["metre",1],
##             ID["EPSG",8807]]],
##     CS[Cartesian,2],
##         AXIS["easting",east,
##             ORDER[1],
##             LENGTHUNIT["metre",1,
##                 ID["EPSG",9001]]],
##         AXIS["northing",north,
##             ORDER[2],
##             LENGTHUNIT["metre",1,
##                 ID["EPSG",9001]]]]

Pipelines

PROJ.4 assumed that the from/source and to/target coordinate reference system definitions involved in coordinate operations each contained the specifications necessary to get from source to WGS84 and then on from WGS84 to target. PRϕJ drops this assumption, searching among many candidate coordinate operations for viable pipelines. The search is conducted using the tables given in the proj.db SQLite database, which is now backed by authorities, and regularly updated at each release to the current upstream state (see https://proj.org/operations/operations_computation.html for more details). The tables are searched to find lists of candidates.

cov <- dbReadTable(db, "coordinate_operation_view")
## Warning: Column `code`: mixed type, first seen values of type integer, coercing
## other values of type string
## Warning: Column `method_code`: mixed type, first seen values of type integer,
## coercing other values of type string
## Warning: Column `source_crs_code`: mixed type, first seen values of type
## integer, coercing other values of type string
## Warning: Column `target_crs_code`: mixed type, first seen values of type
## integer, coercing other values of type string
cov[grep("OSGB", cov$name), c("table_name", "code", "name", "method_name", "accuracy")]
##                  table_name   code                           name
## 162     grid_transformation   5338           OSGB36 to ETRS89 (1)
## 163     grid_transformation   5339           OSGB36 to WGS 84 (7)
## 213     grid_transformation   7709           OSGB36 to ETRS89 (2)
## 214     grid_transformation   7710           OSGB36 to WGS 84 (9)
## 853     grid_transformation 108057  ETRS_1989_To_OSGB_1936_OSTN15
## 854     grid_transformation 108058   WGS_1984_To_OSGB_1936_OSTN15
## 855     grid_transformation 108059      OSGB_1936_To_Xrail84_NTv2
## 1072 helmert_transformation   1195           OSGB36 to WGS 84 (1)
## 1073 helmert_transformation   1196           OSGB36 to WGS 84 (2)
## 1074 helmert_transformation   1197           OSGB36 to WGS 84 (3)
## 1075 helmert_transformation   1198           OSGB36 to WGS 84 (4)
## 1076 helmert_transformation   1199           OSGB36 to WGS 84 (5)
## 1174 helmert_transformation   1314           OSGB36 to WGS 84 (6)
## 1175 helmert_transformation   1315             OSGB36 to ED50 (1)
## 1729 helmert_transformation   5622           OSGB36 to WGS 84 (8)
## 2566 helmert_transformation 108089 OSGB_1936_To_WGS_1984_8_BAD_DX
## 2718 helmert_transformation 108336 OSGB_1936_To_WGS_1984_NGA_7PAR
##                                         method_name accuracy
## 162                                            NTv2     0.03
## 163                                            NTv2     1.00
## 213                                            NTv2     0.03
## 214                                            NTv2     1.00
## 853                                            NTv2     0.03
## 854                                            NTv2     1.00
## 855                                            NTv2     0.50
## 1072        Geocentric translations (geog2D domain)    21.00
## 1073        Geocentric translations (geog2D domain)    10.00
## 1074        Geocentric translations (geog2D domain)    21.00
## 1075        Geocentric translations (geog2D domain)    18.00
## 1076        Geocentric translations (geog2D domain)    35.00
## 1174 Position Vector transformation (geog2D domain)     2.00
## 1175 Position Vector transformation (geog2D domain)     2.00
## 1729        Geocentric translations (geog2D domain)     3.00
## 2566        Geocentric translations (geog2D domain)     5.00
## 2718      Coordinate Frame rotation (geog2D domain)    21.00

The same search can be conducted directly without using RSQLite to query the database tables, searching by source and target CRS, and in the near future also by area of interest. If we search using only the degraded PROJ.4 string, we only find a ballpark accuracy coordinate operation, yielding a pipeline with two steps, inverse projection to geographical coordinates in radians, and conversion from radians to degrees. Note that rgdal::spTransform() and its wrapper sp::spTransform() use PROJ for coordinate operations. Here we will use "EPSG:4326" briefly for exposition:

list_coordOps(paste0(proj4string(b_pump), " +type=crs"), "EPSG:4326")
## Candidate coordinate operations found:  1 
## Strict containment:  FALSE 
## Visualization order:  TRUE 
## Source: +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000
##         +y_0=-100000 +ellps=airy +units=m +no_defs +type=crs
## Target: EPSG:4326 
## Best instantiable operation has only ballpark accuracy 
## Description: Inverse of unknown + Ballpark geographic offset from unknown to
##              WGS 84 + axis order change (2D)
## Definition:  +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
##              +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
##              +step +proj=unitconvert +xy_in=rad +xy_out=deg

The description component “+ axis order change (2D)” refers to the EPSG:4326 definition, which specifies Northings/Latitude as the first axis, and Eastings/Longitude as the second axis; in sp/rgdal workflows, it is assumed that GIS/visualization order with Eastings/Longitude as the first 2D axis and Northings/Latitude as the second axis is preferred. Because the input data are in GIS/visualization order already, the steps to swap axes to standards conformity and then back to GIS/visualization order cancel each other out.

We can see what is happening if we unset enforcing GIS/visualization order, with an axisswap coming into play:

set_enforce_xy(FALSE)
list_coordOps(paste0(proj4string(b_pump), " +type=crs"), "EPSG:4326")
## Candidate coordinate operations found:  1 
## Strict containment:  FALSE 
## Visualization order:  FALSE 
## Source: +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000
##         +y_0=-100000 +ellps=airy +units=m +no_defs +type=crs
## Target: EPSG:4326 
## Best instantiable operation has only ballpark accuracy 
## Description: Inverse of unknown + Ballpark geographic offset from unknown to
##              WGS 84
## Definition:  +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
##              +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
##              +step +proj=unitconvert +xy_in=rad +xy_out=deg
##              +step +proj=axisswap +order=2,1
set_enforce_xy(TRUE)

While rgdal tries to impose GIS/visualization axis order on the "EPSG:4326" specification, one may end up with a WKT2 string with Latitude first, followed by Longitude without wishing to do so. Using "OGC:CRS84" is more secure, because it does not require such ad-hoc re-specification.

Setting the internal control option set_transform_wkt_comment() to FALSE, we use only the degraded PROJ.4 string when transforming. spTransform() undertakes the same search, chooses the best instantiable coordinate operation on its first pass, then uses that pipeline on all objects. The pipeline specification of the coordinate operation may be retrieved using get_last_coordOp():

set_transform_wkt_comment(FALSE)
isballpark <- spTransform(b_pump, CRS(SRS_string="OGC:CRS84"))
## Warning in project(t(bbox(obj))[, 1:2], tg, inv = TRUE, use_aoi = FALSE): PROJ
## support is provided by the sf and terra packages among others
get_last_coordOp()
## [1] "+proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=unitconvert +xy_in=rad +xy_out=deg"

The coordinate returned is unfortunately in Ingestre Place, not Broad Street.

print(coordinates(isballpark), digits=10)
##          coords.x1  coords.x2
## [1,] -0.1351070464 51.5127926

Let us repeat the search using the WKT2 string; here we see that providing a well-specified CRS representation allows us to choose 2m accuracy for the coordinate operation. Further, we can also see that, had we had access to a named grid, we could have achieved 1m accuracy:

The Helmert transformation has parameters retrieved from the PROJ SQLite database (code 1314):

helm <- dbReadTable(db, "helmert_transformation_table")
## Warning: Column `code`: mixed type, first seen values of type integer, coercing
## other values of type string
## Warning: Column `source_crs_code`: mixed type, first seen values of type
## integer, coercing other values of type string
## Warning: Column `target_crs_code`: mixed type, first seen values of type
## integer, coercing other values of type string
helm[helm$code == "1314", c("auth_name", "code", "name", "tx", "ty", "tz", "rx", "ry", "rz", "scale_difference")]
##     auth_name code                 name      tx       ty     tz   rx    ry
## 239      EPSG 1314 OSGB36 to WGS 84 (6) 446.448 -125.157 542.06 0.15 0.247
##        rz scale_difference
## 239 0.842          -20.489
dbDisconnect(db)

Using the WKT2 CRS representation, we can achieve 2m accuracy (or as the table says in other fields: “Oil exploration. Accuracy better than 4m and generally better than 2m”:

set_transform_wkt_comment(TRUE)
is2m <- spTransform(b_pump, CRS(SRS_string="OGC:CRS84"))
## Warning in project(t(bbox(obj))[, 1:2], tg, inv = TRUE, use_aoi = FALSE): PROJ
## support is provided by the sf and terra packages among others
get_last_coordOp()
## [1] "+proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=push +v_3 +step +proj=cart +ellps=airy +step +proj=helmert +x=446.448 +y=-125.157 +z=542.06 +rx=0.15 +ry=0.247 +rz=0.842 +s=-20.489 +convention=position_vector +step +inv +proj=cart +ellps=WGS84 +step +proj=pop +v_3 +step +proj=unitconvert +xy_in=rad +xy_out=deg"

The output point is close to the Broad Street pump:

print(coordinates(is2m), digits=10)
##          coords.x1   coords.x2
## [1,] -0.1367127374 51.51330218

It is over 100m West-North-West of the Ingestre Place position:

c(spDists(isballpark, is2m)*1000)
## [1] 125.0577
c(maptools::gzAzimuth(coordinates(isballpark), coordinates(is2m)))

This was about as good as one could get prior to PROJ 7 without downloading the missing grid file manually, and installing the downloaded file in a directory that would usually not be user-writable. The whole set of grids can be downloaded and installed manually for workgroups needing to be sure that the same grids are available to all users, as has been the case in the past as well.

The rgdal::project() uses the underlying geographical coordinate reference system, and does not transform, so using the degraded PROJ.4 string and WKT2 give the same output, and because the input is projected, we take the inverse:

(a <- project(coordinates(b_pump), proj4string(b_pump), inv=TRUE, verbose=TRUE))
## Warning in project(coordinates(b_pump), proj4string(b_pump), inv = TRUE, : PROJ
## support is provided by the sf and terra packages among others
## Warning in project(bb, proj, inv = inv, use_aoi = FALSE): PROJ support is
## provided by the sf and terra packages among others
## +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601
## +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=unitconvert +xy_in=rad
## +xy_out=deg
##      coords.x1 coords.x2
## [1,] -0.135107  51.51279
(b <- project(coordinates(b_pump), WKT, inv=TRUE))
## Warning in project(coordinates(b_pump), WKT, inv = TRUE): PROJ support is
## provided by the sf and terra packages among others
## Warning in project(coordinates(b_pump), WKT, inv = TRUE): PROJ support is
## provided by the sf and terra packages among others
##      coords.x1 coords.x2
## [1,] -0.135107  51.51279

The projected points only inverse project the projected coordinates using the specified projection

all.equal(a, b)
## [1] TRUE
c(spDists(coordinates(isballpark), a)*1000)
## [1] 0
list_coordOps(WKT, "OGC:CRS84")
## Candidate coordinate operations found:  9 
## Strict containment:  FALSE 
## Visualization order:  TRUE 
## Source: PROJCRS["Transverse_Mercator", BASEGEOGCRS["GCS_OSGB 1936",
##         DATUM["OSGB_1936", ELLIPSOID["Airy 1830", 6377563.396,
##         299.3249646, LENGTHUNIT["metre", 1]]],
##         PRIMEM["Greenwich", 0, ANGLEUNIT["Degree",
##         0.0174532925199433]], ID["EPSG", 4277]],
##         CONVERSION["unnamed", METHOD["Transverse Mercator",
##         ID["EPSG", 9807]], PARAMETER["Latitude of natural
##         origin", 49, ANGLEUNIT["Degree", 0.0174532925199433],
##         ID["EPSG", 8801]], PARAMETER["Longitude of natural
##         origin", -2, ANGLEUNIT["Degree", 0.0174532925199433],
##         ID["EPSG", 8802]], PARAMETER["Scale factor at natural
##         origin", 0.999601, SCALEUNIT["unity", 1], ID["EPSG",
##         8805]], PARAMETER["False easting", 400000,
##         LENGTHUNIT["metre", 1], ID["EPSG", 8806]],
##         PARAMETER["False northing", -100000,
##         LENGTHUNIT["metre", 1], ID["EPSG", 8807]]],
##         CS[Cartesian, 2], AXIS["easting", east, ORDER[1],
##         LENGTHUNIT["metre", 1, ID["EPSG", 9001]]],
##         AXIS["northing", north, ORDER[2], LENGTHUNIT["metre",
##         1, ID["EPSG", 9001]]]]
## Target: OGC:CRS84 
## Best instantiable operation has accuracy: 2 m
## Description: Inverse of unnamed + OSGB36 to WGS 84 (6) + axis order change
##              (2D)
## Definition:  +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
##              +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
##              +step +proj=push +v_3 +step +proj=cart +ellps=airy
##              +step +proj=helmert +x=446.448 +y=-125.157
##              +z=542.06 +rx=0.15 +ry=0.247 +rz=0.842 +s=-20.489
##              +convention=position_vector +step +inv +proj=cart
##              +ellps=WGS84 +step +proj=pop +v_3 +step
##              +proj=unitconvert +xy_in=rad +xy_out=deg
## Operation 8 is lacking 1 grid with accuracy 1 m
## Missing grid: uk_os_OSTN15_NTv2_OSGBtoETRS.tif 
## URL: https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif

Areas of interest

The search for candidate coordinate operations may be expedited if the area of interest is provided. This is a vector of minumum longitude and latitude followed by maximum longitude and latitude, so we need to inverse project the bounding box of projected sp objects and coerce that matrix into the required order:

if (is.projected(b_pump)) { 
  o <- project(t(unclass(bbox(b_pump))), wkt(b_pump), inv=TRUE)
} else {
  o <- t(unclass(bbox(b_pump)))
}
## Warning in project(t(unclass(bbox(b_pump))), wkt(b_pump), inv = TRUE): PROJ
## support is provided by the sf and terra packages among others
## Warning in project(bb, proj, inv = inv, use_aoi = FALSE): PROJ support is
## provided by the sf and terra packages among others
(aoi <- c(t(o + c(-0.1, +0.1))))
## [1] -0.23510705 51.41279260 -0.03510705 51.61279260

Without an area of interest, the coordinate operation look-up found 8 candidate operations, with a given area of interest, only 5 are found with the strict_containment= argument taking its default value of FALSE, so that the candidates found do not have to contain the area of interest strictly:

nrow(list_coordOps(WKT, "OGC:CRS84", area_of_interest=aoi))
## [1] 5

In this case, changing this argument to TRUE makes no further difference:

nrow(list_coordOps(WKT, "OGC:CRS84", strict_containment=TRUE, area_of_interest=aoi))
## [1] 5

Methods for projection and transformation use the area of interest implicit in the data object, controlled by an argument: use_aoi, default TRUE, for +proj=ob_tran,FALSE`.

Transformation grid files

PROJ 7 introduced on-demand downloading of (chunks of) transformation grids from a content delivery network to a user-writable directory on the PROJ search path (usually the first path component). The status of the downloaded grids is stored in another SQLite database, cache.db. The PRϕJ user-writable CDN directory is set as soon as the internal search path is queried, and for most uses, the default value will allow all programs using PRϕJ such as R packages, QGIS, GRASS, etc., to access any downloaded grids. Grids are checked for staleness at regular intervals. This directory may be set to a non-default value with the PROJ_USER_WRITABLE_DIRECTORY environment variable before rgdal (and any other package using PRϕJ) is loaded and attached, from PRϕJ >= 7.1.0. The code used at the beginning of this vignette is repeated here for convenience:

td <- tempfile()
dir.create(td)
Sys.setenv("PROJ_USER_WRITABLE_DIRECTORY"=td)
library(rgdal)

Let us check that rgdal is running with on-demand downloading disabled:

is_proj_CDN_enabled()
## [1] FALSE

We can enable on-demand download using a function that reports the value of the PRϕJ CDN user-writable directory; we see that with this setting, network download is enabled:

enable_proj_CDN()
## [1] "Using: /tmp/Rtmp2wkAla/file135232004c44"
is_proj_CDN_enabled()
## [1] TRUE

The reader will recall that the search path was stored in shpr above, the first element being the user-writable directory; at this point no SQLite database has been created:

shpr[1]
## [1] "/tmp/Rtmp2wkAla/file135232004c44"
try(file.size(file.path(shpr[1], "cache.db")))
## [1] NA

When we then search for candidate coordinate operations, we see that the operation using an absent grid now sees that download is enabled, and proposes the 1m accuracy candidate, because the required grid can be downloaded (again using the area of interest):

list_coordOps(WKT, "OGC:CRS84", area_of_interest=aoi)
## Candidate coordinate operations found:  5 
## Search specification: Area of interest:  -0.235107046388033, 51.4127925984147, -0.0351070463880326, 51.6127925984147 
## Strict containment:  FALSE 
## Visualization order:  TRUE 
## Source: PROJCRS["Transverse_Mercator", BASEGEOGCRS["GCS_OSGB 1936",
##         DATUM["OSGB_1936", ELLIPSOID["Airy 1830", 6377563.396,
##         299.3249646, LENGTHUNIT["metre", 1]]],
##         PRIMEM["Greenwich", 0, ANGLEUNIT["Degree",
##         0.0174532925199433]], ID["EPSG", 4277]],
##         CONVERSION["unnamed", METHOD["Transverse Mercator",
##         ID["EPSG", 9807]], PARAMETER["Latitude of natural
##         origin", 49, ANGLEUNIT["Degree", 0.0174532925199433],
##         ID["EPSG", 8801]], PARAMETER["Longitude of natural
##         origin", -2, ANGLEUNIT["Degree", 0.0174532925199433],
##         ID["EPSG", 8802]], PARAMETER["Scale factor at natural
##         origin", 0.999601, SCALEUNIT["unity", 1], ID["EPSG",
##         8805]], PARAMETER["False easting", 400000,
##         LENGTHUNIT["metre", 1], ID["EPSG", 8806]],
##         PARAMETER["False northing", -100000,
##         LENGTHUNIT["metre", 1], ID["EPSG", 8807]]],
##         CS[Cartesian, 2], AXIS["easting", east, ORDER[1],
##         LENGTHUNIT["metre", 1, ID["EPSG", 9001]]],
##         AXIS["northing", north, ORDER[2], LENGTHUNIT["metre",
##         1, ID["EPSG", 9001]]]]
## Target: OGC:CRS84 
## Best instantiable operation has accuracy: 1 m
## Description: Inverse of unnamed + OSGB36 to WGS 84 (9) + axis order change
##              (2D)
## Definition:  +proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2
##              +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy
##              +step +proj=hgridshift
##              +grids=uk_os_OSTN15_NTv2_OSGBtoETRS.tif +step
##              +proj=unitconvert +xy_in=rad +xy_out=deg

On making the transformation, we may see that the coordinate operation takes longer than expected, because on first pass the grid is downloaded from the network:

system.time(is1m <- spTransform(b_pump, CRS(SRS_string="OGC:CRS84")))
## Warning in project(t(bbox(obj))[, 1:2], tg, inv = TRUE, use_aoi = FALSE): PROJ
## support is provided by the sf and terra packages among others
##    user  system elapsed 
##   0.061   0.022   0.338

The coordinate operation used now specifies the grid in the pipeline:

get_last_coordOp()
## [1] "+proj=pipeline +step +inv +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.999601 +x_0=400000 +y_0=-100000 +ellps=airy +step +proj=hgridshift +grids=uk_os_OSTN15_NTv2_OSGBtoETRS.tif +step +proj=unitconvert +xy_in=rad +xy_out=deg"

The coordinate values differ little from the 2m accuracy Helmert pipeline:

print(coordinates(is1m), digits=10)
##         coords.x1  coords.x2
## [1,] -0.136687626 51.5133037

as we can see, the 1m accuracy point is 1.7m from the 2m accuracy point, just to the West:

c(spDists(is2m, is1m)*1000)
## [1] 1.751474
c(maptools::gzAzimuth(coordinates(is1m), coordinates(is2m)))

Now the SQLite grid cache database has been created and has grown in size:

try(file.size(file.path(shpr[1], "cache.db")))
## [1] 319488

If we look in the SQLite database of downloaded grids, we see that the grid components that were downloaded. Here we have not yet used the area of interest to limit the number of chunks involved:

library(RSQLite)
db <- dbConnect(SQLite(), dbname=file.path(shpr[1], "cache.db"))
(tbs <- dbListTables(db))
## [1] "chunk_data"                 "chunks"                    
## [3] "downloaded_file_properties" "linked_chunks"             
## [5] "linked_chunks_head_tail"    "properties"                
## [7] "sqlite_sequence"
if (any("chunks" %in% tbs)) print(dbReadTable(db, "chunks"))
##    id                                                   url  offset data_id
## 1   1 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif       0       1
## 2   2 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2621440       2
## 3   3 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2637824       3
## 4   4 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2654208       4
## 5   5 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2670592       5
## 6   6 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2686976       6
## 7   7 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2703360       7
## 8   8 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2719744       8
## 9   9 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2736128       9
## 10 10 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2752512      10
## 11 11 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2768896      11
## 12 12 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2785280      12
## 13 13 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2801664      13
## 14 14 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2818048      14
## 15 15 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2834432      15
## 16 16 https://cdn.proj.org/uk_os_OSTN15_NTv2_OSGBtoETRS.tif 2850816      16
##    data_size
## 1      16384
## 2      16384
## 3      16384
## 4      16384
## 5      16384
## 6      16384
## 7      16384
## 8      16384
## 9      16384
## 10     16384
## 11     16384
## 12     16384
## 13     16384
## 14     16384
## 15     16384
## 16     16384
dbDisconnect(db)

Finally, we disable grid download to return to the status existing when rgdal was attached; the cached grids in this case, using an R temporary directory, will be discarded, but in usual workflows, grids are downloaded once, used often and updated seldom when the server versions change.

disable_proj_CDN()
is_proj_CDN_enabled()
## [1] FALSE

The outcome positions are shown here; at zoom 18 we can see that the 1m accuracy green point matches the Open Street Map location of the pump very well:

library(mapview)
x <- mapview(is2m, map.type="OpenStreetMap", legend=FALSE) + mapview(is1m, col.regions="green", legend=FALSE) + mapview(isballpark, col.regions="red", legend=FALSE)
mapshot(x, file="snow.png")
knitr::include_graphics(system.file("misc/snow.png", package="rgdal"))

Rebuilding CRS objects

Package authors of the almost 150 packages with sp objects with outdated "CRS" objects stored in *.rda or *RData files as data sets should update the "proj4string" slots of these objects and re-store. The GGHB.IG data set in the CARBayesdata package (version 2.1) can serve as an example.

# library(CARBayesdata)
library(sp)
# data(GGHB.IG)
# orig <- slot(GGHB.IG, "proj4string")
(load(system.file("misc/GGHB.IG_CRS.rda", package="rgdal")))
## [1] "orig"
orig
## Coordinate Reference System:
## Deprecated Proj.4 representation:
##  +proj=tmerc +lat_0=49 +lon_0=-2 +k=0.9996012717 +x_0=400000
## +y_0=-100000 +datum=OSGB36 +units=m +no_defs +ellps=airy
## +towgs84=446.448,-125.157,542.060,0.1502,0.2470,0.8421,-20.4894

The original "CRS" object contains a +datum= key (deprecated in GDAL’s exportToProj4() from GDAL 3) and a +towgs84= key, the handling of which has varied in different releases of PROJ >= 6 and GDAL 3. From rgdal 1.5-17 (development version), a PROJ function proj_get_source_crs is used by default, but may be avoided by setting the sp::CRS() get_source_if_boundcrs= argument to FALSE. The function will return the source CRS of a BOUNDCRS object. In some versions of PROJ/GDAL, the inclusion of a +towgs84= key is taken as meaning that the user wishes to create an object with the given source CRS, but because a +towgs84= key is present, it is assumed that the target is WGS 84 with the given transformation method. Proj4 strings with +towgs84= keys have no authority for the parameters given, and do not harmonise with the use of the EPSG database.

To run the examples, we need the current development version of sp from "rsbivand/sp":

sp_version_ok <- length(grep("get_source_if_boundcrs", deparse(args(sp::CRS)))) > 0L

Running sp::CRS() takes the original Proj4 key-value pairs, and converts them into a WKT2 representation as well as returning an (unused) Proj4 string. This is kept in place for packages still expecting to see/use it, but all rgdal functions and methods use the WKT2 representation. Here, we do not try to handle the BOUNDCRS special case, something that leads to difficulties with some versions of PROJ later in workflows. The BOUNDCRS contains a SOURCECRS and a TARGETCRS (which also has swapped axes), while really all that was needed was the contents of the SOURCECRS:

orig1 <- CRS(slot(orig, "projargs"), get_source_if_boundcrs=FALSE)
cat(wkt(orig1), "\n")
## BOUNDCRS[
##     SOURCECRS[
##         PROJCRS["unknown",
##             BASEGEOGCRS["unknown",
##                 DATUM["OSGB 1936",
##                     ELLIPSOID["Airy 1830",6377563.396,299.3249646,
##                         LENGTHUNIT["metre",1]],
##                     ID["EPSG",6277]],
##                 PRIMEM["Greenwich",0,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8901]]],
##             CONVERSION["unknown",
##                 METHOD["Transverse Mercator",
##                     ID["EPSG",9807]],
##                 PARAMETER["Latitude of natural origin",49,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8801]],
##                 PARAMETER["Longitude of natural origin",-2,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8802]],
##                 PARAMETER["Scale factor at natural origin",0.9996012717,
##                     SCALEUNIT["unity",1],
##                     ID["EPSG",8805]],
##                 PARAMETER["False easting",400000,
##                     LENGTHUNIT["metre",1],
##                     ID["EPSG",8806]],
##                 PARAMETER["False northing",-100000,
##                     LENGTHUNIT["metre",1],
##                     ID["EPSG",8807]]],
##             CS[Cartesian,2],
##                 AXIS["(E)",east,
##                     ORDER[1],
##                     LENGTHUNIT["metre",1,
##                         ID["EPSG",9001]]],
##                 AXIS["(N)",north,
##                     ORDER[2],
##                     LENGTHUNIT["metre",1,
##                         ID["EPSG",9001]]]]],
##     TARGETCRS[
##         GEOGCRS["WGS 84",
##             DATUM["World Geodetic System 1984",
##                 ELLIPSOID["WGS 84",6378137,298.257223563,
##                     LENGTHUNIT["metre",1]]],
##             PRIMEM["Greenwich",0,
##                 ANGLEUNIT["degree",0.0174532925199433]],
##             CS[ellipsoidal,2],
##                 AXIS["latitude",north,
##                     ORDER[1],
##                     ANGLEUNIT["degree",0.0174532925199433]],
##                 AXIS["longitude",east,
##                     ORDER[2],
##                     ANGLEUNIT["degree",0.0174532925199433]],
##             ID["EPSG",4326]]],
##     ABRIDGEDTRANSFORMATION["Transformation from unknown to WGS84",
##         METHOD["Position Vector transformation (geog2D domain)",
##             ID["EPSG",9606]],
##         PARAMETER["X-axis translation",446.448,
##             ID["EPSG",8605]],
##         PARAMETER["Y-axis translation",-125.157,
##             ID["EPSG",8606]],
##         PARAMETER["Z-axis translation",542.06,
##             ID["EPSG",8607]],
##         PARAMETER["X-axis rotation",0.1502,
##             ID["EPSG",8608]],
##         PARAMETER["Y-axis rotation",0.247,
##             ID["EPSG",8609]],
##         PARAMETER["Z-axis rotation",0.8421,
##             ID["EPSG",8610]],
##         PARAMETER["Scale difference",0.9999795106,
##             ID["EPSG",8611]]]]

Using the default setting to remove the unintended BOUNDCRS representation, we can get what was (probably) intended (when the sp version is as was before this work-around was added, the following two chunks yield BOUNDCRS for some versions of PROJ/GDAL:

orig1a <- CRS(slot(orig, "projargs"))
cat(wkt(orig1a), "\n")
## BOUNDCRS[
##     SOURCECRS[
##         PROJCRS["unknown",
##             BASEGEOGCRS["unknown",
##                 DATUM["OSGB 1936",
##                     ELLIPSOID["Airy 1830",6377563.396,299.3249646,
##                         LENGTHUNIT["metre",1]],
##                     ID["EPSG",6277]],
##                 PRIMEM["Greenwich",0,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8901]]],
##             CONVERSION["unknown",
##                 METHOD["Transverse Mercator",
##                     ID["EPSG",9807]],
##                 PARAMETER["Latitude of natural origin",49,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8801]],
##                 PARAMETER["Longitude of natural origin",-2,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8802]],
##                 PARAMETER["Scale factor at natural origin",0.9996012717,
##                     SCALEUNIT["unity",1],
##                     ID["EPSG",8805]],
##                 PARAMETER["False easting",400000,
##                     LENGTHUNIT["metre",1],
##                     ID["EPSG",8806]],
##                 PARAMETER["False northing",-100000,
##                     LENGTHUNIT["metre",1],
##                     ID["EPSG",8807]]],
##             CS[Cartesian,2],
##                 AXIS["(E)",east,
##                     ORDER[1],
##                     LENGTHUNIT["metre",1,
##                         ID["EPSG",9001]]],
##                 AXIS["(N)",north,
##                     ORDER[2],
##                     LENGTHUNIT["metre",1,
##                         ID["EPSG",9001]]]]],
##     TARGETCRS[
##         GEOGCRS["WGS 84",
##             DATUM["World Geodetic System 1984",
##                 ELLIPSOID["WGS 84",6378137,298.257223563,
##                     LENGTHUNIT["metre",1]]],
##             PRIMEM["Greenwich",0,
##                 ANGLEUNIT["degree",0.0174532925199433]],
##             CS[ellipsoidal,2],
##                 AXIS["latitude",north,
##                     ORDER[1],
##                     ANGLEUNIT["degree",0.0174532925199433]],
##                 AXIS["longitude",east,
##                     ORDER[2],
##                     ANGLEUNIT["degree",0.0174532925199433]],
##             ID["EPSG",4326]]],
##     ABRIDGEDTRANSFORMATION["Transformation from unknown to WGS84",
##         METHOD["Position Vector transformation (geog2D domain)",
##             ID["EPSG",9606]],
##         PARAMETER["X-axis translation",446.448,
##             ID["EPSG",8605]],
##         PARAMETER["Y-axis translation",-125.157,
##             ID["EPSG",8606]],
##         PARAMETER["Z-axis translation",542.06,
##             ID["EPSG",8607]],
##         PARAMETER["X-axis rotation",0.1502,
##             ID["EPSG",8608]],
##         PARAMETER["Y-axis rotation",0.247,
##             ID["EPSG",8609]],
##         PARAMETER["Z-axis rotation",0.8421,
##             ID["EPSG",8610]],
##         PARAMETER["Scale difference",0.9999795106,
##             ID["EPSG",8611]]]]

The sp::rebuild_CRS() method for "CRS" objects (and thus that for "Spatial" objects) can be used to do the same as using sp::CRS() directly, with special treatment for a number of other objects inheriting from "Spatial":

orig1b <- rebuild_CRS(orig)
cat(wkt(orig1b), "\n")
## BOUNDCRS[
##     SOURCECRS[
##         PROJCRS["unknown",
##             BASEGEOGCRS["unknown",
##                 DATUM["OSGB 1936",
##                     ELLIPSOID["Airy 1830",6377563.396,299.3249646,
##                         LENGTHUNIT["metre",1]],
##                     ID["EPSG",6277]],
##                 PRIMEM["Greenwich",0,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8901]]],
##             CONVERSION["unknown",
##                 METHOD["Transverse Mercator",
##                     ID["EPSG",9807]],
##                 PARAMETER["Latitude of natural origin",49,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8801]],
##                 PARAMETER["Longitude of natural origin",-2,
##                     ANGLEUNIT["degree",0.0174532925199433],
##                     ID["EPSG",8802]],
##                 PARAMETER["Scale factor at natural origin",0.9996012717,
##                     SCALEUNIT["unity",1],
##                     ID["EPSG",8805]],
##                 PARAMETER["False easting",400000,
##                     LENGTHUNIT["metre",1],
##                     ID["EPSG",8806]],
##                 PARAMETER["False northing",-100000,
##                     LENGTHUNIT["metre",1],
##                     ID["EPSG",8807]]],
##             CS[Cartesian,2],
##                 AXIS["(E)",east,
##                     ORDER[1],
##                     LENGTHUNIT["metre",1,
##                         ID["EPSG",9001]]],
##                 AXIS["(N)",north,
##                     ORDER[2],
##                     LENGTHUNIT["metre",1,
##                         ID["EPSG",9001]]]]],
##     TARGETCRS[
##         GEOGCRS["WGS 84",
##             DATUM["World Geodetic System 1984",
##                 ELLIPSOID["WGS 84",6378137,298.257223563,
##                     LENGTHUNIT["metre",1]]],
##             PRIMEM["Greenwich",0,
##                 ANGLEUNIT["degree",0.0174532925199433]],
##             CS[ellipsoidal,2],
##                 AXIS["latitude",north,
##                     ORDER[1],
##                     ANGLEUNIT["degree",0.0174532925199433]],
##                 AXIS["longitude",east,
##                     ORDER[2],
##                     ANGLEUNIT["degree",0.0174532925199433]],
##             ID["EPSG",4326]]],
##     ABRIDGEDTRANSFORMATION["Transformation from unknown to WGS84",
##         METHOD["Position Vector transformation (geog2D domain)",
##             ID["EPSG",9606]],
##         PARAMETER["X-axis translation",446.448,
##             ID["EPSG",8605]],
##         PARAMETER["Y-axis translation",-125.157,
##             ID["EPSG",8606]],
##         PARAMETER["Z-axis translation",542.06,
##             ID["EPSG",8607]],
##         PARAMETER["X-axis rotation",0.1502,
##             ID["EPSG",8608]],
##         PARAMETER["Y-axis rotation",0.247,
##             ID["EPSG",8609]],
##         PARAMETER["Z-axis rotation",0.8421,
##             ID["EPSG",8610]],
##         PARAMETER["Scale difference",0.9999795106,
##             ID["EPSG",8611]]]]

In practice, sp::rebuild_CRS() methods may be used, but users should check the output for coherence, and the PROJ function for extracting SOURCECRS from BOUNDCRS avoided if it has unfortunate consequences.

References

Brody, H., M. R. Rip, P. Vinten-Johansen, N. Paneth, and S. Rachman. 2000. “Map-Making and Myth-Making in Broad Street: The London Cholera Epidemic, 1854.” Lancet 356: 64–68.
Evenden, Gerald I. 1990. Cartographic Projection Procedures for the UNIX Environment — a User’s Manual. http://download.osgeo.org/proj/OF90-284.pdf.
Evers, Kristian, and Thomas Knudsen. 2017. Transformation Pipelines for PROJ.4. https://www.fig.net/resources/proceedings/fig_proceedings/fig2017/papers/iss6b/ISS6B_evers_knudsen_9156.pdf.
Iliffe, Jonathan, and Roger Lott. 2008. Datums and Map Projections: For Remote Sensing, GIS and Surveying. Boca Raton: CRC.
ISO. 2019. ISO 19111:2019 Geographic Information – Referencing by Coordinates. https://www.iso.org/standard/74039.html.
Knudsen, Thomas, and Kristian Evers. 2017. Transformation Pipelines for PROJ.4. https://meetingorganizer.copernicus.org/EGU2017/EGU2017-8050.pdf.