gdphelper.gdpcleaner

Module Contents

Functions

gdpcleaner(gdpdata: pandas.DataFrame)

Author: Gabe Fairbrother

gdphelper.gdpcleaner.gdpcleaner(gdpdata: pandas.DataFrame)[source]

Author: Gabe Fairbrother

Remove spurious columns, Rename relevant columns, Remove NaNs

Parameters
  • gdpdata (DataFrame) –

  • (https (a loaded dataframe based on a downloaded Open Government GDP at basic prices dataset) –

Returns

  • DataFrame (A cleaned and simplified DataFrame of the relevant columns for summary and visualization.)

  • Possible columns (dataset dependent) include – Date: Date of data Location: Province or Jurisdiction Scale: Scale of the Value column (Percent, Millions, etc) Unit: Unit of Measure Value: Portion of the GDP for the Location and Date NAICS_Class: North American Industry Classification System ID Industry: Industry of Record Sub-sector: Non-profit sub-sector Special_Industry: Special Industry Aggregate

Examples

>>> result = gdpcleaner(example_data)