Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model Data Variable Descriptions #19

Open
cpritcha opened this issue Mar 7, 2019 · 15 comments
Open

Model Data Variable Descriptions #19

cpritcha opened this issue Mar 7, 2019 · 15 comments

Comments

@cpritcha
Copy link
Collaborator

cpritcha commented Mar 7, 2019

from https://github.com/sostenibilidad-unam/SHV/tree/master/ABM_Rversion/MEGADAPT_APP/system_models/flooding_ponding/encharcamientos

  • f_prec_v: precipitation volume.
  • f_esc: volume of runoff.
  • n_tramos: number of drainage sections.
  • q100: eviction capacity.
  • bombeo_tot: number of pumping plants.
  • rejillas: number of grids.

What are the units of f_prec_v, f_esc, q100 etc? @abaezacastro @serranoycandela?

@cpritcha cpritcha changed the title Flood ponding models Flood ponding models variable units Mar 7, 2019
@cpritcha
Copy link
Collaborator Author

cpritcha commented Mar 7, 2019

I created a wiki page https://github.com/comses/megadapt/wiki/Model-Units to put all the unit information in one place.

If we know what the units are it we can come up with more descriptive variable names.

@alee
Copy link
Member

alee commented Mar 7, 2019

Is this a good time to test out the CSDMS standard names ?

@cpritcha
Copy link
Collaborator Author

cpritcha commented Mar 7, 2019

That would be good as documentation but the tilde and hyphen characters cannot be used as R identifiers (they'll get converted to periods in a dataframe using make.names). We can definitely create a mapping between our identifiers and the standardized names.

@abaezacastro
Copy link
Contributor

We will provide the final table of variables with units and meaning by tomorrow.

@abaezacastro
Copy link
Contributor

@cpritcha what do you mean by "but the tilde and hyphen characters cannot be used as R identifiers"? Do we need to change the name of the variables?

@cpritcha
Copy link
Collaborator Author

CSDMS has a variable naming convention that sometime uses tildes and hyphens. For example https://csdms.colorado.edu/wiki/CSN_Quantity_Templates#Elevation has sat-zone in its name.

@cpritcha cpritcha changed the title Flood ponding models variable units Model Data Variable Descriptions Mar 28, 2019
@abaezacastro
Copy link
Contributor

abaezacastro commented Mar 28, 2019

Name column short name New name proposed Description
abastecimi abastecimi watersupply Represent the population of each census block that does not have access to piped water in their houses
agebs_cdmx ageb_id, cvgeo,area censusblock_id An universal identifier of each census block that includes the municipality, and the state.
agebs_presion_hidraulica pres_hid waterpressure Estimation of the pressure of water in pipes as a function of altitude and distance to the Cutzamala system.
antiguedad antiguedad age_infra Proxy for the age of infrastructure based on the sequence of urbanization periods in CDMX
basura basura garbage Proxy of the amount of garbage produced in each census block
bombeo bombeo_tot pumping Number of pumps to direct stormwater in the sewer system
calidad_agua cal_agua waterquality Reports about the quality of the water per census block
crec_urb crec_urb urbangrowth proportion of urbanized area in a census block
critical_zones critic_z criticalzone Areas of the city SACMEX considers critical because of the difficulties of providing potable water
desp_agua desp_agua wastewate Perception of residents about exportation potable water
desv_agua desv_agua waterdeviation Perception of residents about net potable water exports
encharcamientos prom_en ponding Frequency of ponding events in a year
fail_claim fail_claim fail_claim Number of reports of failures in the distribution potable water
falla_distribucion falla_dist failure_dist Number of reports collected by sacmex about lickages in the system and failures in the distribution potable water
falla_drenaje falla_dren failure_dren Number of reports collected by sacmex about failure in the sewer system
falta_dist falta_dist lack_dist Proportion of households without connection to potable water distribution infrastructure
falta_dren falta_dren lack_dren Proportion of households without connection to the sewer system
health enf_14 diarrhea Number of cases of Diarrheal diseases per 10000 individual in the year 2014
income_pc income_pc income_pc Income per capita in dollars (by municipality)
ingreso ingreso income_index resident asset index (by census block)
inundaciones inunda flooding Frequency of flooding event in a year
lambdas lambdas days_no_water average number of days in a week a census block is without water. Data from INEGI
n_tramos n_tramos segments Number of segments of the sewer system in each census block
peticiones_delegaciones_dren pet_del_dr delegation_pressure proxy for the attention government give to different delegations
poptot_10 poblacion population Total population per census block
pop_growth tc_pob pop_growth rate of population change per census block
pres_med pres_med media proxy for media report about water scarcity or flooding events
q_100 q100 capacity_dren a proxy of the total capacity of the sewer system
regiones region region clusters for which the flooding and ponding models where fitted
rejillas rejillas drainages Number of drainage per census block
subsidencia subsidenci subsidence rate of subsidence per year
tanks tanks tanks proportion of houses with water storage tanks
wo_water wo_water wo_water

@cpritcha
Copy link
Collaborator Author

cpritcha commented Mar 29, 2019

What's the difference between capacity_dren and drainages? From looking the UNAM repository the metadata says drainages is the number of grids. What does that mean? @abaezacastro

@abaezacastro
Copy link
Contributor

capacity_dren is the capacity of the pipes of the combined storm and waste-water system. Drainages are the places of the entrance of the stormwaters to the system.

@cpritcha
Copy link
Collaborator Author

cpritcha commented Apr 5, 2019

Thanks @abaezacastro. Do you anything about the climate data units
Here is what I know.

name description units
prec total precipitation in a year volume
runoff total runoff in a year. It should be the difference between prec and infiltration volume

I'm not sure yet about the unit of volume.

@cpritcha
Copy link
Collaborator Author

cpritcha commented Apr 6, 2019

So volume isn't being normalized by area? From what I can tell larger census block do seem to have larger precipitation values.

climate <- read_csv('../../../data/climate_landuse_scenarios/df_prec_escorrentias_excl_0_ff45.csv')
study_area@data$area <- geosphere::areaPolygon(study_area)
climate_area <- climate %>% inner_join(study_area@data %>% select(ageb_id, area)) %>% mutate(prec_per_area = prec / area)
ggplot(climate_area, aes(x=prec, y=area)) + geom_point()
ggplot(climate_area, aes(x=prec_per_area)) + geom_density()

@seanbergin
Copy link
Contributor

We had talked about displaying data based on the 9 municipality units. I'm unable to identify what the column name is for municipality. The above documentation implies that the municipality number is included in the ageb_id/census_block. E.G. 0900700014096 Is it the digits between 090 and 14096? Do we have the ability to map the municipality numbers to the actual names?

@abaezacastro
Copy link
Contributor

@seanbergin

The first two numbers are for the state and the following three numbers are for the municipality. Those are the numbers you need. So, in the example you provided: 0900700014096
09: State (all the polygons should have Id stating in 09)
007: Municipality

@abaezacastro
Copy link
Contributor

Delegation ID
Azcapotzalco 9002
Coyoacán 9003
Cuajimalpa de Morelos 9004
Gustavo A. Madero 9005
Iztacalco 9006
Iztapalapa 9007
La Magdalena Contreras 9008
Milpa Alta 9009
Álvaro Obregón 9010
Tláhuac 9011
Tlalpan 9012
Xochimilco 9013
Benito Juárez 9014
Cuauhtémoc 9015
Miguel Hidalgo 9016
Venustiano Carranza 9017

@cpritcha
Copy link
Collaborator Author

cpritcha commented May 20, 2019

Proposed Name Changes - Source Data

Old Name New Name
abastecimi resident_potable_water_lacking_count
ageb_id censusblock_id
antiguedad infrastructure_age
area area
basura garbage_index
bombeo sewer_system_pump_count
cal_agua waterquality_index
crec_urb urbangrowth
critic_z criticalzone
cvgeo geographic_id
desp_agua resident_potable_water_waste_perception
desv_agua resident_potable_water_export_perception
enf_14 resident_diarrhea_per_capita
fail_claim resident_reports_potable_water_failure_count_per_year
falla_dren resident_reports_sewer_failure_count_per_year
falla_dist
falta_dist household_potable_system_lacking_percent
falta_dren household_sewer_system_lacking_percent
income_pc resident_income_per_capita
ingreso resident_asset_index
inunda resident_reports_flooding_per_year
lambdas household_days_no_potable_water_per_week_mean
n_tramos sewer_system_segment_count
pet_del_dr delegation_social_pressure
poblacion resident_count
pres_hid potable_system_pressure
pres_med media_social_pressure
prom_en resident_reports_ponding_per_year
q100 sewer_system_capacity_max
region --delete--
rejillas sewer_system_storm_drain_count
runoff_bin runoff_presence
subsidenci subsidence_rate_per_year
tanks household_water_storage_tank_available_percent
wo_water resident_reports_potable_water_failure_count_per_area_year

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants