Euro-SDMX Metadata Structure (ESMS)
Contact | |||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Contact organisation | National Statistical Institute | ||||||||||||||||||||
Contact organisation unit | Statistics on Living Conditions Department, Demographic and Social Statistics Directorate
| ||||||||||||||||||||
Contact name | Desislava Dimitrova, PhD | ||||||||||||||||||||
Contact person function | head of department | ||||||||||||||||||||
Contact mail address | 2 P.Volov street, 1038 Sofia | ||||||||||||||||||||
Contact email address | |||||||||||||||||||||
Contact phone number | +359 2 9857 183 | ||||||||||||||||||||
Contact fax number | |||||||||||||||||||||
Metadata update | |||||||||||||||||||||
Metadata last certified | 29 April 2024 | ||||||||||||||||||||
Metadata last posted | 29 April 2024 | ||||||||||||||||||||
Metadata last update | 29 April 2024 | ||||||||||||||||||||
Statistical presentation | |||||||||||||||||||||
Data description | Survey on income and living conditions (SILC) is a tool for providing timely and comparable data on income distribution, level and structure of poverty and social exclusion. The survey is carried out in a European methodology and provides information about the current state (cross-sectional data) and longitudinal (longitudinal data) changes in income level and structure of poverty and social exclusion. EU-SILC provides four basic files containing target variables based on common concepts and definitions. Annual data for the countries contain the following components: • Household register (D-file); • Personal register (R-file)
• Household data (Н-file)
• Personal data of people aged 16 and more (Р-file)
Each year additional data on the household and household members on specific topics is collected, the so-called ad-hoc modules.
The indicators on poverty and social inclusion are calculated on the basis of the survey "Statistics on income and living conditions" and a common methodology for data collection, target variables obtaining and calculating of common indicators, approved by Eurostat. The poverty rate is the share of households that are below the poverty line which is defined as 60% of the median equivalised disposable income.
| ||||||||||||||||||||
Classification system |
| ||||||||||||||||||||
Sector coverage | Data refers to all private households and individuals living in the private households in the national territory at the time of data collection. The EU-SILC survey is a key instrument for providing information required by the European Semester and the European Pillar of Social Rights, in particular for income distribution, poverty and social exclusion, as well as various related living conditions and poverty EU policies, such as on child poverty, access to health care and other services, housing, over indebtedness and quality of life. It is also the main source of data for microsimulation purposes and flash estimates of income distribution and poverty rates. The following social fields are included in the survey methodology:
| ||||||||||||||||||||
Statistical concepts and definitions | Total household income: Two main concepts for total household income are applied:
Total household gross income (HY010) is computed as the sum for all household members of gross personal income components:
Total disposable household income (HY020) can be computed as total household gross income (HY010) is reduced to:
Household definition: Household is two or more persons, living in one dwelling or part of dwelling, sharing common budget and eating together. Household is a person, living in one dwelling, room or part of it to a dwelling, has a separate budget for the cost of meals and expenses to satisfy other needs. Equivalence scale: For the calculation of indicators of poverty and social inclusion using the total disposable household income is "equalised". Due to the different composition and number of persons in the household equivalent scales apply. Use the modified OECD scale, which gives a weight of 1.0 to the first person aged 14 or more, a weight of 0.5 to other persons aged 14 or more and a weight of 0.3 to persons aged 0-13. The weights are given to each member of the household and summed to obtain an equivalent household size. Total disposable net income for each household is divided by its equivalent size and form the total disposable net income per equivalent unit. | ||||||||||||||||||||
Statistical unit | Units of observation are households and household members. | ||||||||||||||||||||
Statistical population | The BG-SILC target population consists of all private households and their current members residing in the country. Persons living in collective households and in institutions are generally excluded from the target population. | ||||||||||||||||||||
Reference area | Entire territory of Republic of Bulgaria | ||||||||||||||||||||
Time coverage | 2006 - 2023 | ||||||||||||||||||||
Base period | Not applicable. | ||||||||||||||||||||
Unit of measure | |||||||||||||||||||||
BGN, euro, percent (%), number of persons | |||||||||||||||||||||
Reference period | |||||||||||||||||||||
BG-SILC uses the following reference periods for the different variables included in the survey:
The income reference period is the previous calendar year;
| |||||||||||||||||||||
Institutional mandate | |||||||||||||||||||||
Legal acts and other agreements | Basic regulations
Implementing regulations
| ||||||||||||||||||||
Data sharing | Not applicable. | ||||||||||||||||||||
Confidentiality | |||||||||||||||||||||
Confidentiality - policy |
| ||||||||||||||||||||
Confidentiality - data treatment | According Art. 25 of the Statistics Act individual data are not published (they are suppressed). Dissemination of individual data is possible only according to Art. 26 of the Statistics Act. | ||||||||||||||||||||
Release policy | |||||||||||||||||||||
Release calendar | Statistical information is published according to the Release Calendar presenting the results of the statistical surveys carried out by the National Statistical Institute. | ||||||||||||||||||||
Release calendar access | The Calendar is available on the NSI's website: https://www.nsi.bg/en/node/480 | ||||||||||||||||||||
User access | The data for Income and living conditions (EU-SILC) are published on NSI website, section Social Inclusion and Living Conditions in accordance with the Law on Statistics (Chapter 5) and the European Statistics Code of Practice, respecting professional independence and in an objective, professional and transparent manner in which all users are treated equitably. | ||||||||||||||||||||
Frequency of dissemination | |||||||||||||||||||||
Annual | |||||||||||||||||||||
Accessibility and clarity | |||||||||||||||||||||
News release | Poverty and Social Inclusion Indicators. | ||||||||||||||||||||
Publications | Not applicable. | ||||||||||||||||||||
On-line database | Detailed results are available to all users of the NSI website under the heading Social Inclusion and Living Conditions - Poverty and Social Inclusion Indicators: https://www.nsi.bg/en/node/8292 and INFOSTAT
| ||||||||||||||||||||
Micro-data access | Anonymised individual data can be made available for scientific research purposes, and at the individual request of the Rules for the provision of anonymised individual data for scientific and research purposes. | ||||||||||||||||||||
Other | Information service on request, according to the Rules for the dissemination of statistical products and services to NSI. | ||||||||||||||||||||
Documentation on methodology |
Detailed information about the list of social inclusion indicators, definitions and algorithm for their calculation on european level can be found on the following site: | ||||||||||||||||||||
Quality documentation | National Quality Report. | ||||||||||||||||||||
Quality management | |||||||||||||||||||||
Quality assurance | The Survey on Income and Living Conditions (SILC) is an annual survey implemented in the framework of Regulation (EC) No 1700/2019, which defines Scope, Definitions, Time coverage, Characteristics of the data, Sample size, Publication and Access to data. National statistical Institute is certified according to ISO 9001. In practical terms for the EU-SILC survey, this means:
| ||||||||||||||||||||
Quality assessment | Data are accompanied with quality reports analysing the accuracy, coherence and comparability of the data. The quality of the BG-SILC survey can be assumed to be high. Its concepts and methodology have been developed according to European and international standards and using best practices from all EU Member States. BG-SILC indicators are considered to be sufficiently accurate for all practical purposes they are put into. The indicators are disseminated following a predetermined Release calendar. Further work is ongoing to improve the quality and in particular the comparability of the indicators. Key priorities are greater harmonisation of methods for quality adjustment and sampling. There is a yearly ISO 9001 internal and external audits for the whole departm | ||||||||||||||||||||
Relevance | |||||||||||||||||||||
User needs | BG-SILC the main users are:
| ||||||||||||||||||||
User satisfaction | Not applicable. | ||||||||||||||||||||
Completeness | SILC covers only people living in private households (all persons aged 16 and over within the household are eligible for the operation), i.e. persons living in collective households and in institutions are generally excluded from the target population. | ||||||||||||||||||||
Accuracy and reliability | |||||||||||||||||||||
Overall accuracy | As with any other statistical survey, SILC may be burdened with errors due to sampling and other relating to the inability to be interviewed some of the units in the sample, as well as the errors taking place at the stage of data recording, data processing, etc. In terms of precision requirements, the representativeness of the sample and the effective sample size is to be achieved. The effective sample size combines sample size and sampling design effect which depends on sampling design, population structure and non-response rate. Regulation 1700/2019 defines the minimum effective sample sizes to be achieved to compensate for all kinds of non-response. Precision requirements for all data sets are expressed in standard errors and are defined as continuous functions of the actual estimates and of the size of the statistical population in a country or in a NUTS 2 region. The estimated standard error of a particular estimate shall not be bigger than the following amount:
| ||||||||||||||||||||
Sampling error | Computations of standard errors were carried out using SAS programs for the SILC Quality Reports and Complex Sample analysis in IBM SPSS ver.27. | ||||||||||||||||||||
Non-sampling error | Non-sampling errors are basically of 4 types:
| ||||||||||||||||||||
Timeliness and punctuality | |||||||||||||||||||||
Timeliness | SILC cross-sectional and longitudinal data are available in the form of tables 10 months after the end of the data collection period. | ||||||||||||||||||||
Punctuality | Not applicable. | ||||||||||||||||||||
Coherence and comparability | |||||||||||||||||||||
The coherence of two or more statistical outputs refers to the degree to which the statistical processes, by which they were generated, used the same concepts and harmonised methods. A comparison with external sources for all income target variables and the number of persons who receive income from each ‘income component’ will be provided, where the Member States concerned consider such external data to be sufficiently reliable. | |||||||||||||||||||||
Comparability - geographical | Comparability across EU Member States is considered high due to use of harmonised concepts, variables, definitions and classifications. Comparability between different regions of the country is considered high. | ||||||||||||||||||||
Comparability - over time | In Bulgaria no breaks in series/significant changes in year 2022. A number of income measures were implemented during the year which could be explained by taking into consideration the following:
| ||||||||||||||||||||
Coherence - cross domain | The cross-sectional data for the BG-SILC2023 were compared to the Labor force survey 2023 and HBS 2023. When comparing SILC and HBS we must take into account the discrepancies. The differences are to great extent brought about by the methodological diversity. Here are the main methodological differences:
| ||||||||||||||||||||
Coherence - internal | |||||||||||||||||||||
Cost and burden | |||||||||||||||||||||
The total length of interviewing household in average 65.9 minutes. | |||||||||||||||||||||
Data revision | |||||||||||||||||||||
Data revision - policy | Not applicable. | ||||||||||||||||||||
Data revision - practice | No revisions to report. | ||||||||||||||||||||
Statistical processing | |||||||||||||||||||||
Source data |
The sample for BG-SILC 2023 are selected from the sampling frame based on the Population Census 2011. The data base includes all private households and their current members residing in the country. Persons living in collective households and in institutions are excluded from the target population. Student’s and worker’s hostels are excluded at the first stage of selection of PSU, because student’s and worker’s households rarely stay on the same addresses and are difficult to trace. The frame is regularly updated according to the administrative changes made. Household data within the selected PSUs are updated according to the Information System “Demography” data (ISD). The longitudinal component consists of the sub-samples R1, R2, R3, R4 and R6. All personal/household income variables were collected by interview. Where the information is available, the data from the administrative source is directly used. The National Revenue Agency provides data from the register of insured persons. This register used for PY010, PY030, PY050 and HY090 variables. The National Social Security Institute provides data on income from pensions and other social security payments. This register used for PY090, PY100, PY110, PY120, PY130, HY050 and HY110 variables. The Social Assistance Agency provides data on income from social benefits. This register used for HY050, HY060 and HY070 variables.
Sampling unit Two stage sampling on a territorial principle is implemented as follows: - on the first stage - the census enumeration units (PSU) are selected; - on the second stage - the households are identified. Sampling rate and sampling size Concerning the SILC instrument, three different sample size definitions can be applied: - the actual sample size which is the number of sampling units selected in the sample - the achieved sample size which is the number of observed sampling units (household or individual) with an accepted interview - the effective sample size which is defined as the achieved sample size divided by the design effect with regards to the at-risk-of poverty rate indicator Given that the effective sample size has been already treated in the section dealing with sampling errors, in this section the attention focuses mainly on the achieved sample size. The total gross sample size (number of households) has been calculated analyzing the non-response rates and design effects of the previous EU-SILC surveys. The total sample size in 2023 is 9389 households:
Number of households for which an interview is accepted for the database. Rotational group breakdown and total
RB250 = 11,14 Number of persons of 16 years or older who are members of the households for which the interview is accepted for the database, and who completed a personal interview. Rotational group breakdown and total
The sample size for longitudinal component was 27712 households and 52731 persons aged 16 and over.
Number of households in longitudinal component for which an interview is accepted for the database
Number of persons 16 years and older who are members of the households for which the interview is accepted for the database, and who completed a personal interview
| ||||||||||||||||||||
Frequency of data collection | Yearly | ||||||||||||||||||||
Data collection | SILC2021 data are collected with CAPI questionnaires through personal interview with households included in the sample as well as all household members aged 16 and more.
The mean interview duration The mean interview duration per household is calculated as the sum of the duration of all household interviews plus the sum of the duration of all personal interviews, divided by the number of household questionnaires completed. Only households accepted for the database have to be considered. The average household interview duration was 65.9 minutes, while the average individual interview duration was about 21.4 minutes. | ||||||||||||||||||||
Data validation | In the process Data-entry is a logical control of extreme values, filled-in information on all issues, data comparability checks, links between individual questionnaires and registers is carried out. After processing the primary data and receiving the target changes, a verification with the SAS program provided by Eurostat for verification and validation of the data is performed. Additional compatibility checks are performed before publishing the information | ||||||||||||||||||||
Data compilation | The database of each country contains a different types of weights:
Weighting factors were calculated as required to take into account the units’ probability of selection, non-response and to adjust the sample to external data relating to the distribution of households and persons in the target population, such as sex and age, residence or administrative-territorial districts (NUTS 3).
Design factor For the first year of the panel each household from the new rotation group got a sampling weight inversely proportional to the probability of selection of the household. These were the household’s design weights DB080.
To adjust for non-responding households the procedure “weighting classes” was used. The households were divided into classes where the probability to respond was assumed to be homogenous within the classes. Due to lack of information (demographic characteristics) for the non-responding households these classes were the sampling strata. The ratio of the weights of the responding households to the weights of all households in the given class was calculated.
After reflecting the non-responding households the base weights for the new rotation group were calibrated to the population as of 31.12.2022. For the calibration the following variables at individual and at household level were used:
The information on individuals as of 31.12.2022 was available from the ISD. The information on the households was an estimation made on the basis of the updated file on Census 2011 and data on the split-off households from the SILC survey. Persons born in 2023 were not included in the calibration as they were not part of the population as of the end of 2022. For the calibration of weights the SAS Macro Calmar 2 was used. The logit method (M=3 in Calmar) was used for the calibration by setting upper and lower limits of the g-weights. The G-weights were the ratio of the assigned weights and the final calibrated weights. The calibrated weights with reflected non-responding households were the base weights (RB060) for the new rotation group and will be used in the weighting procedure in the following years. These weights were also the longitudinal weights (DB095) of the households from the new rotation group. Weighting procedure for rotation groups from previous survey waves. To get the base weights for the current year, the base weights (RB060) for each rotation group from the previous year were adjusted taking into account the non-response. The adjustment procedure was made on an individual and not on household level.
To adjust for non-response first all persons from the 2022 register (DB135 = 1 & RB110 in (1,2,3,4)) who were followed up in 2023 were marked as responding (current members of the household). Persons who have left the household between the two survey waves were marked as non-responding. A logistic regression was used to calculate the probability for each individual to be enumerated between 2022 and 2023. The weights of the enumerated persons were adjusted with the probability of following up (result of logistic regression) and thus the base weights (RB060) for 2023 were get. The model was applied for each rotation group separately. The independent variables used in the model were: poverty indicators, education, economic activity, age, sex, household size, household type, income, dwelling type. The dependent variable was the one showing if the individual was enumerated or not. New members of the household after first year who were not part of the sample got base weights for the current year as follows:
Weight sharing Each person in the household should receive equal weight within the household (RB050 cross-sectional weight). For this reason, each household member whit zero and non-zero base weight received average base weight within the household.
After the non-response adjustment procedures, each of the 5 rotation groups was calibrated separately to the population as of 31.12.2022 according to the method described above. The same variables and levels as for the new rotation group were used for calibration. Combining all (6) sub-samples After applying all procedures for non-response adjustment and calibration, all sub-samples (rotation groups) were combined together. Each sub-sample separately represented all population of the country. To combine all sub-samples all weights were multiplied an appropriate scaling factor. The scaling factor used was 1/6 as there were 6 rotation groups in the panel. Final cross-sectional weights Calibration of all rotation groups to current population. After successfully applying all the procedures the weights were calibrated to the population as of 31.12.2022. The following variables on individual and household level were used for calibration:
Age groups: (0-15) (16-19) (20-24) (25-29) (30-34) (35-39) (40-44) (45-49) (50-54) (55-59) (60-64) (65-69) (70-74) (75+) In 2016 the number of pensioners was used as calibration variable for first time. This variable had 3 levels: 1 - old-age pensions 2 - social pensions 3 - all others(rest of population) To allocate each person to the correct sub-population data from NSSI was used- number of personal pensions as of 31.12. There were two reasons to use this variable as a calibration variable. First, get better estimation of pensioners and second, to reduce the standard error of the AROPE indicator. After calibration, the final cross-sectional weight DB090 of the household was obtained. The individual cross-section weight RB050 was equal to the corresponding household weight DB090 (RB050=DB090). The newborn in 2022 were not included in the calibration. They received the corresponding household weight after calibration. The personal cross-section weight for all individuals aged 16 and more (PB040) was calculated after the age group (0-15) was removed. Only the individuals who have responded (or were imputed) to the individual questionnaire (RB250 in (11,14)) were used. After one more calibration, the weight PB040 (personal cross-sectional weight for all household members aged 16 and more) was obtained.
| ||||||||||||||||||||
Adjustment | Not aplicable | ||||||||||||||||||||
Comment |