Pipeline objectives
Data preparation
- gaining programming access (API) to external data sources with indicators of socio-economic and spatial development
- downloading and recognizing raw data
- confirming the completeness of primary data from sources (open data, official statistics, Federal Tax & Customs services, budget systems)
- maintaining directories (indicators, analytic dimensions, scenarios, versions, stages of processing), data models of sources and recipients, as well as correspondence tables, taking into account changes over time
- collecting data on investment projects from external sources
Data processing
- elimination of technical errors in primary data formats, including field naming, value formats, shifts in data series, gaps in analytical data slices
- aggregation of downloaded primary data and reduction to a reference structure
- detection of changes in primary data, including retrospectively, including in the data structure, directories, and actual values
- recovery of gaps, elimination of data duplication
- validation and proposals for adjusting the values of indicators by econometric methods according to a set of rules, including:
- meeting balance ratios
- being in the range of acceptable intervals
- satisfying the ratios of the principal components (eigenvectors)
- normalization of the database of projects by implementing a unified system of indicators and characteristics
Model calibration
- calibrations of models of dynamics of macroeconomic indicators of the territory
- calculation of eigenvectors (7 items) of socio-economic development for the territory
- calculation of transition matrices of initial and target macroeconomic data to eigenvectors
- calculation of correlation matrices of influence (sensitivity of changes in indicators to each other and to external factors)
- calculation of intersectoral balance models (multiplier matrices) by detailing macroeconomic data at the industry level
- calibrations of interterritorial balances (stress matrices) models by estimation of commuting, passenger traffic, and cargo traffic
- calibrations supply and demand models across a range of products
Analysis, evaluation, and forecasting
- scenario forecasting for macroeconomic indicators of territories
- sectoral forecasting by territory
- calculations of generalizing indicators (efficiency, reliability, safety, sustainability) for individual territories and median values for a sample of territories
- calculations of the development potential of the territory
- calculation of the deviation of the actual dynamics of socio-economic development indicators from the target dynamics (determined by strategies and national goals)
- calculation of the impact of the database of projects on the socio-economic development of territories
Planning
- determining the magnitude and rhythm of the necessary impact to achieve target indicators for a given vector of regulated indicators
- determining the impact of the program on the territory to achieve the target trajectory, taking into account the specified restrictions
- calculation of a comprehensive plan (by industries, spheres, and territories), taking into account the size of the reproduced resource
Data provision
- uploading data sets and master data in csv, xlsx, parquet, qs, fst data formats
- loading data into the data warehouse for access via API
- providing an interactive dashboard for indicator sets
- providing matrices for evaluating effectiveness
Documentation processes for collecting, processing, and providing
- providing scenarios (actual data, assessment, forecast, plan, scenario, and goal), stages (initial, amended, corrected, stage number), versions, and methods of accounting indicator values, which is necessary for the correct interpretation and tracing of indicator values
- maintaining a library of verification methods, rules for verification and validation of indicator values
- preparing interactive reports on the volume, completeness and identified errors