Skip to contents

ColOpenData is a package designed to access curated and wrangled Colombian demographic, geospatial and climate data, retrieved from various open Colombian data sources. The package addresses the challenge of scattered Colombian data across multiple web sources by providing functions that enable users to select and load desired datasets without the need for extensive data acquisition processes. Additionally, the tidy data structure offered for demographic and climate data facilitates analysis and visualization.

ColOpenData is developed at Universidad de Los Andes as part of the Epiverse-TRACE program.

Installation

You can install the development version of ColOpenData from GitHub with:

# install.packages("pak")
pak::pak("epiverse-trace/ColOpenData")

Quick Overview

ColOpenData contains data from two public data sources: The National Administrative Department of Statistics (DANE), and the Institute of Hydrology, Meteorology and Environmental Studies (IDEAM). The available data is divided in three categories:

  • Demographic: Demographic and Socioeconomic data presents information from the National Population and Dwelling Census (CNPV) of 2018 and Population Projections. The CNPV data corresponds to the most recent census available to date and the information is presented as an answer to three questions: How many are we?, Where are we? and How do we live? Further information can be consulted here. The Population Projections data contains the population projections with the post COVID-19 update calculated based on the results of CNPV of 2018. Further information can be consulted here.

  • Geospatial: This data is retrieved from the National Geostatistical Framework (MGN), which includes maps and a summarized version of the 2018 census, aggregated to spatial geometries. The data is available at different aggregation levels including: Blocks, Urban and Rural Sections, Urban and Rural Sectors, Urban Areas, Municipalities and Departments. More information is available here.

  • Climate: Climate data is recovered from backup information provided by IDEAM, containing historical data from the first station in the country until May 31st 2023. This backup includes temperature, precipitation, sunshine duration, wind direction, among others. More information can be found here.

Documentation and vignettes are available for the three modules in the user vignettes.

Similar R packages are offered for international communities, allowing the user to download census, geospatial and climate data.

Lifecycle

This package is currently experimental, as defined by the RECON software lifecycle. Therefore, this is a functional draft and can be tested outside of the development team. However, it still may change over time.

Contributions

Contributions are welcome via pull requests.

Code of Conduct

Please note that the ColOpenData project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.