Measuring Productivity at the Industry Level - The India KLEMS Database - ਆਰਬੀਆਈ - Reserve Bank of India
Measuring Productivity at the Industry Level - The India KLEMS Database
AUTHORS Deb Kusum Das Abdul Azeez Erumban Suresh Aggarwal Pilu Chandra Das Changes in India KLEMS Database 2018 over the India KLEMS 2017 version Overall
New Additions to KLEMS Variables
Certain Changes Made in Data Sources and Methodology Capital Input
Intermediate input
Factor Income Share
This document describes the procedures, methodologies and approaches used in constructing the India KLEMS database version 2018. This database is part of a research project, supported by the Reserve Bank of India (RBI) to analyse productivity performance in the Indian economy at disaggregate industry level. This work is meant to support empirical research in the area of economic growth. In addition, the database is meant to support the conduct of policies aimed at supporting acceleration of productivity growth in the Indian economy, requiring comprehensive measurement tools to monitor and evaluate progress. Finally, the construction of the database would also support the systematic production of reliable statistics on growth and productivity using the methodologies of national accounts and input-output analysis. In its definitive version the India KLEMS research project will include measures of economic growth, employment creation, capital formation and productivity at the industry level from 1980-81 onwards. The input measures will incorporate various categories of capital (K), labour (L), energy (E), material (M) and services (S) inputs. A major advantage of growth accounting is that it is embedded in a clear analytical framework rooted in production functions and the theory of economic growth. It provides a conceptual framework within which the interaction between variables can be analysed, which is of fundamental importance for policy evaluation. (Timmer et al., 2007)1. The present document describes the India KLEMS database version 2018. The present version is an extended India KLEMS research project, “Disaggregate Industry Level Productivity Analysis for India – the KLEMS Approach” being undertaken at the Centre for Development Economics, Delhi School of Economics. This one builds on the previous project, which was undertaken at ICRIER, New Delhi2. The Data Manual is intended to guide researchers about the variables (and their construction) used to measure both inputs and total factor productivity (TFP) at the industry level using the dataset. In addition, it is also intended to support national officials statistical agencies in future work on productivity database within the agencies. The dataset includes measures of Gross Value Added (GVA), Gross Value of Output (GVO), Labour (L) , Capital (K), Energy (E) , Material (M), Services (S), Labour Quality (LQ), Labour Productivity (LP) and Total Factor Productivity (TFP) at the industry and economy level from 1980-81 onwards. The database covering the period 1980-81 to 2016-17 has been constructed on the basis of data compiled from CSO, NSSO, ASI, Input-Output tables (I-O tables) and processed according to appropriate procedures. These procedures were developed to ensure harmonisation of the basic data, and to generate growth accounts in a consistent and uniform way. Harmonisation of the basic data has focused on a number of areas such as industrial classification, aggregation levels. The data base covers 27 industries comprising the entire Indian economy. The industries are shown in Table 1.1 below. The variables in the data set are given in Table 1.2. 1.2 Coverage: Industries and Variables In this section we describe the coverage of the India KLEMS database in terms of industries and variables. The time period covered is from 1980-81 (1980) to 2016-17 (2016). At a disaggregated level, database is created for 27 industries. The industrial classification is constructed by building concordance between NIC 2008, NIC 2004, NIC 1998, NIC 1987 and NIC 1970 so as to generate continuous time series from 1980 to 2016. This classification is very close to the International Standard Industrial Classification (ISIC) revision 3. The 27 industries are aggregated to form six broad sectors, namely:
Table 1.1 below provides a listing of the 27 industries, including the higher aggregates. Further the detailed classification and concordance of study industries with NICs is provided in Appendix. Table 1.2 provides an overview of all the series included in our database. Measures of capital (K), labour (L), energy (E), material (M) and service (S) inputs as well as gross output (GO), have been constructed using National Accounts Statistics (NAS), Annual Survey of Industries (ASI), NSSO rounds and Input-Output Tables (IO). In building annual time series on gross output, five inputs and factor income shares, various assumptions are made to fill up gaps in industry details and link series over time. As we know that NSSO rounds of unregistered manufacturing, Input Output Transaction Tables, and Employment and Unemployment Surveys by NSSO are available only for certain benchmark years. Thus, the use of information from these data sources necessitates interpolation and assumption of constant shares for building series of output and inputs. The construction of growth accounting series like total factor productivity, labour productivity are based on theoretical models of production and needs additional assumptions that are spelt out in subsequent chapters of the manual. Finally, the Other Series like NDP at factor cost, compensation of employees etc. are additional series which are used in generating the growth accounts and are informative by themselves. Chapter 2: Gross Value Added Series at the Industry Level For an individual firm or industry, productivity measure can be based on a value added concept where value added is considered as an industry’s output and only primary inputs such as labour and capital are considered as industry input. Value added based productivity measures reflect an industry’s capacity to contribute to economy wide income and final demand. In this sense they are valid complements to gross output based measures. This chapter describes the data sources and methodology used to construct the Gross Value Added (GVA) series at current and constant prices for 27 study industries for the period of 1980-81 (1980) to 2016-17 (2016). GVA of a sector is defined as the value of output less the value of its intermediary inputs. This value added created by a sector is shared among the primary factors of production, labour and capital. The National Accounts Statistics (NAS) brought out by the CSO (Central Statistics Office, Government of India) is the basic source of data for the construction of series on GVA for INDIA KLEMS-industries. NAS provides estimates of GVA (i.e., gross value added) for Indian economy at a disaggregate industry level at both current and constant prices for the period since 1950-51. CSO provide GVA at constant (2011-12) prices onwards 2011-12, and prior to that at constant (2004-05) prices. Up to 2011-12, estimates of GVA at both current and constant (2004-05) prices for all industries are directly obtained from Back Series 2011 and NAS 2014. From 2011-12 onwards, estimates of GVA at both current and constant (2011-12) prices for all industries are directly obtained from NAS 2018. GVA series at current and constant (2011-12) prices are extended backwards up to 1980-81 using annual growth rate estimated from Back Series 2011 and NAS 2014 (this has been done in order to construct India KLEMS GVA series with base 2011-12). NAS estimates of value added for a few industry groups are at a more aggregate level, requiring the splitting of the aggregates. In such cases, the NAS estimates of value added have been split to obtain estimates of value added at a higher level of disaggregation. NAS provide separate estimates of GVA for registered and unregistered manufacturing. However, onwards 2011-12 NAS disaggregated the manufacturing sector in corporate sector and household sector. For splitting the aggregate estimates of GVA for registered manufacturing industries and corporate sector, we have used data from the Annual Survey of Industries (ASI) based on the National Industrial Classification 2004 and 2008 (NIC-2004 & NIC-2008). Whereas, for the unregistered manufacturing sector, we have used results from six rounds of NSSO surveys- [40th (1984-85), 45th round (1989-90), 51st round (1994-95), 56th round (2000-01), 62nd round (2005-06), 67th round (2010-11) and 73rd round (2015-16)] to obtain value added estimates. In India, GDP for unregistered manufacturing is constructed using the labour input method. The estimates of GVA for the unregistered manufacturing sector are obtained as a product of the work force and the corresponding GVA per worker. The information about employment in the unorganised sector is only available in the benchmark years for which NSSO survey data are available. Therefore, there is no consistent source of employment data for the years between these Quinquennial surveys. Even the information on value added per worker is equally limited, since the value-added data are also updated on an approximate 5-year interval (for details, see CSO, 2007). Therefore, estimates of value added for the unregistered manufacturing sectors for the years between the benchmarks have been obtained by interpolation and for years outside the benchmark years by linear extrapolation. For splitting the aggregate estimates of GVA for household sector for recent years (i.e., 2011 onwards), we have used GVA data from 67th round (2010-11) and 73rd round (2015-16). The construction of Gross valued added series involves three steps. Step 1: A concordance table between the classification used in the NAS and the 27 study industry classification used for this project has been prepared. Further, concordance between all the 27 sectors has been constructed with NIC - 1970, 1987, 1998, 2004 and 2008. Out of the 27 study industries, for 20 industries, GVA series both in current and constant prices is directly available from NAS3. The sectors for which data are provided in NAS are Agriculture, Forestry & logging, Fishing, Mining and Quarrying, Manufacturing (registered and un-registered), Electricity, Construction, Trade, Hotels & Restaurants, Railways, Transport by other means, Storage, Communication, Banking & insurance, Real estate, Ownership of Dwelling & Business Services, Pubic Administration & Defense and Other Services. Step 2: For manufacturing industries where direct estimates of GVA were not available from NAS, estimates have been made using additional information from ASI and NSSO unorganised manufacturing data. For 6 out of 13 manufacturing sectors GVA data are directly available from NAS. The list of these industries is provided in the table below. For the remaining 7 industries GVA data is constructed by splitting the NAS data using ASI or NSSO distributions. ASI data (annual) has been used for registered manufacturing whereas interpolated ratios from NSSO 40th (1984-85), 45th (1989-90), 51st (1994-95), 56th (2000-01), 62nd (2005-06), 67th round (2010-11) and 73rd round (2015-16) rounds have been used for Unregistered Manufacturing segments. A list of study industries is presented in Table 2.2 showcasing the methodology used to split GVA of certain NAS sectors to match concordance with our classification. It is important to note that the industry level value added volume indices are based on NAS. CSO provides single deflated value added estimates for all sectors except Agriculture. Following are the details of steps taken in splitting NAS sectors into India KLEMS industries for which direct GVA series are not available. Wood and wood products and manufacturing of furniture NAS back-series 2011 (based on 2004-05 prices) provides GVA of Wood and Wood products, furniture, fixtures etc. (20+361) for registered and unregistered manufacturing sectors. Since 2004-05 (from NAS2011 onwards) we have separate series for Wood and Wood products (20) and Manufacturing of furniture and fixtures (361). For our study, we need these two industries separately and 20 would be India KLEMS sector 20 and 361 would be part of India KLEMS Manufacturing n.e.c and Recycling (36+37) i.e., 361 would be added to 369+37. Thus since 2004-05, we have used the separate GVA series of Wood and Wood Products (20) and Manufacturing of furniture (361) obtained directly from NAS disaggregated series of registered and unregistered statement of GDP by economic activity. NAS back-series 2011 of GVA of Wood and Wood products, furniture, fixtures etc. (20+361) for registered manufacturing from 1980-81 to 2003-04 has been split using the ratio of GVA at current price of Wood and Wood Product (20) to Manufacturing of furniture (361) obtained from Annual Survey of industries (ASI). In case of unorganised manufacturing GVA for these two industries, separate GVA series have been obtained by using the ratio created from NSS unorganised manufacturing surveys of the benchmark years. The ratio of GVA for the interim years between two benchmark years have been linearly interpolated till 2003-04, and from 1980-81 to 1984-85, the ratio of 1984-85 has been used.4 Coke, Refined Petroleum Products and Nuclear Fuel and Rubber and Plastic Products We split ‘rubber, petroleum products’ (which are clubbed under one group in NAS) to arrive at two industry groups i.e., Coke, Refined Petroleum Products and Nuclear Fuel (23) and Rubber and Plastic Products (25). For the organised segment, we use the ASI (annual) data to get the individual sector shares and split the NAS data using these individual shares. Likewise, we use the relevant data from the four NSS surveys mentioned earlier to get the individual sector shares for the unorganised segment of this sector. The ratio of GVA for the interim years between two benchmark years have been linearly interpolated till 2003-04, and from 1980-81 to 1984-85, the ratio of 1984-85 has been used. Basic Metals and Fabricated Metal Products In our industry classification, basic metals and fabricated metal products (27+28) and machinery (29) are separate groups whereas ‘manufacture of fabricated metal products’ (28); ‘manufacture of machinery and equipment n.e.c’ (29) and ‘manufacture of office, accounting and computing machinery’ (30) are clubbed together as metal products and machinery (28, 29 and 30) in NAS. To arrive at individual industry result, we use ASI shares for organised sectors and NSSO surveys for the unorganised sector. We add to the fraction of fabricated metal products (28) from metal products and machinery to basic metals (271+272+2731+2732) already available. Electrical and Optical Equipment In our study classification, ‘electrical and optical equipment’ includes all sectors from 30 to 33. However, ‘electrical machinery’ in NAS includes industries ‘manufacture of electrical machinery and apparatus n.e.c.’ (31) + ‘manufacture of radio, television and communication equipment and apparatus’ (32) and excludes ‘manufacture of office, accounting and computing machinery’ (30) and ‘manufacture of medical, precision and optical instruments’ (33). However, 30 is part of ‘metal products and machinery’ and 33 is part of ‘other manufacturing’ in NAS. We take out 28 from metal products and machinery in NAS, with 29 and 30 being left, which we split using ASI. NSSO surveys have been useful here as well to compute the unorganised segment share. Likewise, we also take out the share of 33 from ‘other manufacturing’ in NAS separately for both organised and unorganised segments to arrive at GVA for electrical and optical equipment. Step 3: According to India KLEMS, output is adjusted for Financial Intermediation Services Indirectly Measured (FISIM). The value of such services forms a part of the income originating in the banking and insurance sector and, as such, is deducted from the GVA. The NAS provides output net of FISIM for some industry groups at a more aggregate level. For instance, in the estimates of GVA obtained for the registered manufacturing sector, adjustment for FISIM in NAS is made only at the aggregate level in the absence of adequate details at a disaggregate level. However, we have allocated FISIM to all the sectors of manufacturing by redistributing total FISIM across sectors proportional to their sectoral GDP shares. Similar redistribution of FISIM has been done in case of Trade sector and Other Services sector. First, the value added series presented in the project are at factor cost (as published in NAS), however, according to the KLEMS methodology as adopted in EU KLEMS , value added data has to be presented in basic prices as adopted in System of National Accounts 1993 (SNA 1993). However, the basic price is the amount receivable by the producer from the purchaser for a unit of a good or service produced as output minus any tax payable, and plus any subsidy receivable, on that unit as a consequence of its production or sale. It excludes any transport charges invoiced separately by the producer. Secondly, in order to make international comparisons, we need to convert the given ‘GDP at factor cost’ to ‘GDP in basic prices’. For this, we require net indirect taxes on production (indirect taxes less subsidies) for 27 industries and for every year since 1980. At present, we have GDP at basic prices from 2004-05 to 2016-17 for some industries of India KLEMS, which has been provided to us by CSO according to the NAS industrial classification. Since the information about indirect taxes and subsidies is not readily available for 27 study industries and also for the given time period, the challenge is to extend the series backwards by splitting up aggregate indirect taxes and subsidies data. Chapter 3: Gross Output Series at the Industry Level This chapter describes the procedures and methodologies used in constructing the database for gross output series at the industry level over the period 1980-81(1980) to 2016-17(2015). We discuss both the raw data sources and the adjustments that have been made to generate the time series on output and value added consistent with the official National Accounts. The methodology for measuring industry output, and value added was developed by Jorgenson, Gallop and Fraumeni (1987) and extended by Jorgenson (1990 a). Following a similar approach as explained in Jorgenson et al., (2005, Chapter 4) and Timmer et al., (2010, Chapter 3), the time series on gross output and intermediate inputs for the Indian economy have been constructed. The gross output of an industry is defined as the value of industry production using primary factors like labour, capital and intermediate inputs purchased from other industries. The gross output production function is separable in inputs and technology. An important advantage of gross output approach is that it provides a complete measure of production and treats all inputs - labour, capital and intermediate inputs symmetrically. In contrast the value added measure of output does not explicitly account for the flow of intermediate inputs which may be the primary component of an industry’s output. We use the more restrictive value added concept primarily because it is useful for aggregation purposes. It is to be noted that aggregate output (aggregated over industry value added) is a value added concept and the detailed methodology of aggregation of output across industries is explained in chapter 8. To construct the gross output series at industry level we use multiple data sources namely National Accounts Statistics, Annual Survey of Industries, NSSO rounds for unorganised manufacturing and Input Output Transaction tables. The data source and methodology used are documented below: National Accounts Statistics: The National Accounts Statistics (NAS) published by the CSO (Central Statistics Office, Government of India) is the basic source of data for the construction of time series on gross output. NAS provides estimates of gross output (GVO) for Indian economy at a disaggregate industry level at current and constant prices since 1950-51. Gross output data is available in NAS for Agriculture, Mining and Quarrying, Construction and Manufacturing sectors (Registered and Unregistered Manufacturing). From 2011-12 onwards, estimates of GVO at both current and constant (2011-12) prices for all industries are directly obtained from NAS 2018. GVO series at current and constant (2011-12) prices are extended backward up to 1980-81 using annual growth rate estimated from Back Series 2011 and NAS 2014. (a) Filling procedures of National Accounts series: It is to be noted that the NAS estimates of gross output for a few industry groups are at a more aggregate level, requiring splitting of the aggregates. In such cases, NAS estimates of output have been split using additional information from Annual Survey of Industries and NSSO rounds of Unregistered Manufacturing to obtain estimates at higher level of disaggregation. Secondly, for Unregistered manufacturing gross output data is available in NAS from 2004-05 onwards. In this case, information from NSSO survey rounds has been used for missing years to derive output estimates of unregistered manufacturing industries at current and constant prices. Annual Survey of Industries and NSSO Quinquennial Survey Reports: As mentioned above, gross output data are available at a more disaggregated level in Annual Survey of Industries (ASI) and NSSO Quinquennial surveys for Registered and Unregistered Manufacturing industries, respectively. These secondary data sources are used in this study for two purposes: (a) in certain cases NAS provides combined estimates of GVO and GVA for two manufacturing industries. In such cases separate estimates for individual study industries are obtained with the help of ASI or NSSO unorganised manufacturing sector data. (b) For the period prior to 2004, NAS does not provide estimates of GVO for unorganised manufacturing industries. To make our estimate of GVO for this period, the NSSO data are used. The Major NSSO Rounds for Unregistered Manufacturing used are 40th Round (1984-85), 45th Round (1989-90), 51st Round (1994-95), 56th Round (2000-01), 62nd Round (2005-06), 67th round (2010-11) and 73rd round (2015-16). Input-Output Transaction Tables: As motioned earlier, for GVA series of service sectors we obtain our estimates from NAS. However, prior to 2011-12 National Accounts do not provide any estimates of gross output of service sectors and hence we rely on Input-Output transaction tables which are available at an interval of 5 years or so. This necessitates interpolation and assumption of constant shares for measuring output of services sectors. The Input-Output transaction tables for benchmark years of 1978-79, 1983-84, 1989-90, 1993-94, 1998-99, 2003-04, 2007-08 and 2013-14 (prepared by National Council of Applied Economic Research5 (NCAER)) are used to derive gross output series for service sectors. The construction of the gross output series from 1980 to 2016 at current and constant prices involves the following steps: Step 1: Measuring Gross Output of Agricultural Sector, Mining and Quarrying, and Construction NAS provides nominal and real GVO series for a) Crops and Plantation, b) Animal Husbandry c) Forestry and Logging d) Fishing. By aggregating the GVO of these four subsectors we derive the GVO of Agricultural sector. The Gross output estimates of Mining and Quarrying and Construction at current and constant prices from 1980-2016 is also directly taken from NAS. Step 2: Measuring Gross Output of Manufacturing Industries For manufacturing industries time series on gross output is obtained by adding the magnitudes for registered and unregistered segments of manufacturing. As mentioned earlier, NAS estimates of gross output for manufacturing industries are at a more aggregate level. In such cases the aggregate output of NAS at current prices has been split using additional information from ASI and NSSO unorganised sector reports. Gross output data for 6 out of 13 manufacturing industries listed in table 3.1 are directly picked up from NAS. For the remaining 7 industries, output is constructed by splitting the NAS output data using ASI or NSSO distributions. ASI data (annual) has been used for registered manufacturing whereas interpolated ratios from NSSO 56th (2000-01), 62nd (2005-06), 67th (2010-11) and 73rd (2015-16) rounds have been used for unregistered manufacturing segments. A list of study industries is presented in Table 3.2 showcasing the methodology used to split GVO of certain NAS sectors to match concordance with our classification. The detailed method of splitting the output of NAS sectors to derive output of individual industries is given as follows: Basic Metals and Fabricated Metal Products; Machinery, n.e.c.; Electrical and Optical Equipment Metal products and Machinery of NAS is split into three parts: Manufacture of Fabricated Metal Products, Manufacture of Machinery and Equipment, Manufacturing of Office Accounting and Computing Machinery. For registered segments individual industry shares from ASI are used to split the data. For unregistered segments sectoral shares are calculated from 56th (2000-01), 62nd (2005-06), 67th (2010-11) and 73rd (2015-16) NSSO rounds of unregistered manufacturing. The shares for interim years have been estimated by interpolation and applied to the combined output of NAS to split it into three industries. Machinery forms a separate study sector. Next, a fraction of Manufacture of Fabricated Metal Products is added to Basic Metals of NAS to form output for study sector Basic Metals and Fabricated Metal products. A fraction of Manufacturing of Office Accounting and Computing Machinery is added with Electrical Machinery and Manufacture of Medical and Optical Instruments to form Electrical and Optical Equipment sector. Coke, Refined Petroleum Products and Nuclear Fuel; Rubber and Plastic Products We split ‘rubber, petroleum products’ (which are clubbed under one group in NAS) to arrive at two industry groups, i.e., Coke, Refined Petroleum Products and Nuclear Fuel (23) and Rubber and Plastic Products (25). For the registered segment, we use the ASI (annual) data to get the individual sector shares and split the NAS data using these individual shares. Likewise, we use the relevant data from the four NSS surveys mentioned earlier to get the individual sector shares for the unregistered segment of this sector. Wood and Products of Wood; Manufacturing, n.e.c.; Recycling NAS back-series 2011 (based on 2004-05 prices) provides GVO of Wood and Wood products, Furniture, Fixtures etc. (20+361) for Registered and Unregistered Manufacturing segments. Since 2004-05 (from NAS2011 onwards) we have separate series for Wood and Wood products (20) and Manufacturing of furniture and fixtures (361). In our study, Wood and Wood Products (20) form a separate industry. Manufacturing of furniture and fixture (361) adds up with Manufacturing n.e.c6 (369) and Recycling (37) to form study industry Manufacturing n.e.c and Recycling. Thus since 2004-05, we have used the separate output series of Wood and Wood Products (20) and Manufacturing of furniture (361) obtained directly from NAS disaggregated series of registered and unregistered segments of GDP by economic activity. Prior to 2004, we use ASI data for registered segments to spit the NAS sectors to arrive at estimates of individual study industry. Prior to 2004, we observed some fluctuation in the estimated GVO series for Manufacturing, n.e.c.; Recycling industry. We tried to correct that by applying GVO to GVA ratio to GVA estimates. These ratios are separately obtained from ASI and NSSO survey rounds and then combined using organised and unorganised industry share in aggregate value added. However, for the period prior to 2004, separate output estimates for Unregistered Manufacturing segments are not available in NAS. Thus, to estimate output of Unregistered Manufacturing for the period 1980 to 2003 the following has been done.
Step 3: Measuring Gross Output for Services Sectors and Electricity, Gas and Water Supply Onwards 2011-12 NAS provided estimates of GVO at current and constant prices. Prior to 2011-12 Gross Output series for Services sectors and Electricity, Gas and Water Supply has been constructed using information from Input-Output Transaction Tables of the Indian economy published by CSO.
Firstly, the present study provides estimates for manufacturing and its sub-branches without segregating manufacturing (and its sub-branches) into organised and unorganised segments. However, given the employment potential and sizable presence of the unorganised segment in many of the manufacturing industries, it would be worthwhile in the Indian context to examine separately the productivity performances of both the organised and unorganised components. Some work to construct the output and input series separately for organised and unorganised components of Indian manufacturing has been done. A paper based on this analysis has been prepared. The data series for organised and unorganised manufacturing is not included in this release of India KLEMS database. Finally, National Accounts do not provide any estimates of gross output of services sector and hence we rely on IOTT which are available at an interval of about 5 years. This necessitates interpolation and assumption of constant shares for measuring output of services sectors. This issue is analogous to those explained in Timmer et al., (2010, Chapter 3) for the EU economy. Griliches (1994) paid particular attention to services sector output as a key source of uncertainty. Chapter 4: Labour Input Series at the Industry Level This chapter provides information on the sources of data and method of measuring labour services. The aim is to estimate labour input so that it reflects the actual changes in the quantity (number of persons) and quality of labour input over time. Labour input is measured by combining data on labour persons and data on labour quality. In the KLEMS framework it is desirable to estimate changes in labour composition by industries on the basis of age, gender and education. The measurement of labour composition is essentially an attempt to distinguish one labour type from the other taking into account the embodied human capital in each person. The source of human capital could be through investment in education, experience, training, etc. The contribution to output by each person also comes from this embodied capital and the reward (wages and earnings) to each person also includes the reward for investment in human capital. Therefore, it is essential to separate out these differences in labour to clearly understand the underlying differences in labour characteristics. It is in this context that an initiative has been taken to estimate labour composition index. Nevertheless, many limitations of India’s employment statistics, especially the availability of information on wages/ earnings of different category of workers which could be used as an indication of their differences in ability makes it difficult to quantify these changes in the labour force in a pertinent way. The problems of employment statistics in India has been widely discussed in the literature (Sivasubramonian; 2004, & Himanshu; 2011). The KLEMS project aims to build a time series of employment series for 27 industrial sectors. However, there exists no time-series data on employment for the Indian economy, except for the organised segment. Therefore, it was essential to make certain assumptions regarding the annual changes in the employment series using available information. Subsequently we discuss these issues in detail. The large scale Employment and Unemployment Surveys (EUS) by National Sample Survey Organization (NSSO)7 and the estimated population series based on the decennial population census are the main data sources for estimating the workforce by industry groups, as per the National Industrial Classification (NIC). The other data sources on employment are Economic Survey (for public enterprises), Annual Survey of Industries (ASI for organised manufacturing Industries) and Labour Bureau Surveys (available since 2009-10). Interpolated population is used for intervening years8. In India, major or Quinquennial rounds of EUS which have been conducted by NSSO since 1980 are 38th (1983), 43rd (1987-88), 50th (1993-94), 55th (1999-2000), 61st (2007-08), 66th (2009-10) and 68th (2011-12) rounds. The major round 32nd (1977-78) has been used for extrapolating the labour series to 1980-81. Since 1989-90, the NSSO has also conducted annual surveys with small sample sizes. While the annual surveys or thin rounds have shorter reference periods, six months in some cases, they also have limited coverage. The thin rounds relate to both rural and urban sectors of the economy. So while some economists have preferred to ignore them almost completely (Sundaram, 2007), others have supported their use (Bhalla and Das, 2005; Srinivasan, 2008). Because of the limitations of thin rounds9, they have not been used in constructing the time series of labour input. In the NSS surveys, the workers are classified on the basis of their activity status into usual principal status (UPS), usual principal and subsidiary status (UPSS), current weekly status (CWS) and current daily status (CDS) for Quinquennial rounds (also known as major rounds) and Usual Status & CWS for annual rounds (also known as thin rounds). While UPS, UPSS and CWS measure number of persons, the CDS gives number of person days. UPSS is the most liberal and widely used of these concepts and despite its limitations10 this seems to be the best measure to use given the data. UPSS, which includes all workers who have worked for a longer time of the preceding 365 days in either the principal or in one or more subsidiary economic activity has been used because of its advantages over others. Advantages of using UPSS, which gives number of persons employed, are: i) It provides more consistent and long term trend, ii) More comparable over the different EUS rounds, iii) NAS’s labour Input Method (LIM) is also now based on Principal and Subsidiary Status, and iv) Wider agreement on its use for measuring employment (Visaria, 1996; Bosworth, Collins and Virmani (BCV), 2007; Sundaram, 2008; Rangarajan, 2009). NSSO has used NIC 1970 for classification of workers by industry in 38th and 43rd rounds, NIC 1987 for 50th round, NIC 1998 for 55th and 61st rounds, NIC 2004 for 66th round and NIC 2008 for 68th round. Therefore as a starting point concordance between India KLEMS 27-sector industrial classification, and NIC-1970, 1987, 1998, 2004 and 2008 was worked out. There are, however, some data problems which need a mention:
Measuring Labour Persons at the Industry Level The construction of time series of labour input requires estimation of numbers of persons. While in India, number of persons has been used as a measure of labour input, OECD (2001) and EU KLEMS have estimated labour productivity in terms of output per labour hour worked. OECD does not favour using count of jobs and has published international comparisons of productivity for OECD countries that uses unadjusted hours. Efforts are made to estimate persons and adjust it for changes in labour composition by calculating the grand labour composition index, thus obtaining the composition change corrected labour input. The methodological issue is how to estimate number of persons employed. In India the total workforce in the country and its distribution over economic activities may be obtained from the decennial Population Census and the Employment and Unemployment Surveys (EUS) of the NSSO12. Out of the two, the latter are more dependable and have been used to assess the changes in employment and unemployment for employment planning and policy analysis. The preference for the use of EUS is generally based on the notion that prior to 2001, the three Censuses have clearly under reported the participation of women in economic activities; whereas the EUS has provided reasonably reliable estimates of the level and pattern of employment (Visaria, 1996). While Population Census underestimates work force participation rates (WPRs), the EUS estimates of total population are significantly lower than the Population Census based estimates – by over 20 percent in Urban India13. However, for the Census 2001, the WFPRs are closer to the rates from the 1999-2000 NSSO round. Due to these advantages of EUS, the present study has used the EUS14 for the survey years, i.e., till 2011-12. Thus, the surveys used are 38th, 43rd, 50th 55th, 61st and 68th rounds15. Between the survey periods interpolation of estimates was done to find employment for the intervening period. However, in few survey periods where the disaggregate employment of an industry was an outlier, it was ignored for interpolation and the interpolation in such cases was done between the other two adjacent rounds. The interpolation was supplemented by data from Economic Survey and ASI also. The growth rates, since 1984-85, from Economic Survey data has been used for the two industries- Electricity, Gas &water Supply; and Public Administration and Defence - where public sector mainly dominates. Similarly the growth in employment, since 1984-85, in the two other industries-Coke, Refined Petroleum products; and Transport Equipment, where a large proportion of employment is in the organised sector is taken from ASI. The interpolation of the employment series from 2012-13 to 2015-16 is based on the growth rates provided by the Labour Force surveys16 for the non-manufacturing industries. For the manufacturing industries, the growth rates obtained from the NSSO surveys of 67th round (2012-13) and 73rd round (2015-16) have been used for the unorganised manufacturing industries and the growth rates from ASI for the corresponding years have been used for the organised manufacturing industries. The estimates for the year 2016-17 are just extrapolated from the 2015-16 by using the average growth rates of employment between 2013-14 & 2014-15 and 2014-15 & 2015-16. The current employment series makes use of all the different available sources of data on employment to obtain a smooth and long time series which captures the relevant information provided by these sources. Between 2005 and 2015, we observed very high growth rate in number of employed persons series for the industry Electrical and Optical Equipment as compared to other manufacturing industries and the overall growth in employment in this industry during 2005-2015 was found to be higher than that for Construction (which seems somewhat unrealistic). It was also observed that there was a negative growth in employment in Textiles, Textile Products, Leather and Footwear although real GVA of this industry more than doubled between 2005 and 2015. To address these issues, some adjustments to the initial employment estimates for Electrical & Optical Equipment and Textiles, Textile Products, Leather and Footwear have been done. Onwards 2005, employment series for Electrical & Optical Equipment has been estimated applying annual growth rates obtained from ASI and NSSO rounds to the number of employed persons for the year 2005-06. Then, for the years 2005-06 to 2015-16, we compute the difference between estimated employment series from EUS rounds and that based on ASI & NSSO rounds for the Electrical and Optical Equipment industry and add it to Textiles, Textile Products, Leather and Footwear industry. This ensures that for the manufacturing as a whole our estimates remain the same as obtained from EUS rounds. Measuring Labour Composition Index The composition of labour force is of considerable importance in the context of productivity measurement, as it provides not only a more accurate indication of the contribution of labour to production but also the impact of compositional changes on productivity. Any improvement in labour skills or if proportions of each labour type in the labour force change, will have an impact on the growth of labour input beyond any change in total persons worked. It would increase the amount of labour input actually used in the process of production. One widely used methodology to capture changes in labour composition is given by Jorgenson, Gollop and Fraumeni (1987), which is that the aggregate labour input of sector ‘j’ is defined as a Törnqvist volume index of persons worked by individual labour types ‘l’ as follows:17 There is a second approach to the measurement of skill levels. The procedure is to use a simple index of educational attainment to adjust for skill differences. So improvement in educational attainment is adjusted by incorporating average years of schooling as the proxy for skill levels. For example, an index of the form: L* = easL assumes that each year of schoolings, raises the average worker’s productivity by a constant percentage, ‘a’. Such studies have been carried out for different time periods and for a large number of countries around the world, typically finding a return to each additional year of education in the range of 7 to 12 percent (BCV, 2007). Most of the recent indices of composition of labour input are based on the methodology of Jorgenson, Gollop, and Fraumeni (JGF) (1987) and uses the Törnqvist translog index. However, this methodology requires large volume of data. Using this methodology Aggarwal (2004) estimated labour composition for the Indian manufacturing labour force. There are however, lot of disagreements on the use of this methodology in the Indian context, as it assumes the existence of perfectly competitive labour markets where wage rate is the indicator of a person’s marginal productivity. The analysts argue that the observed wage differences may reflect factors other than productivity differences, such as age or gender. Its use is also questioned because data for large segment of Indian labour force- the self -employed is not available. Since the Indian labour market is still not very competitive and there are data weaknesses, therefore the researchers in India have generally avoided applying the JGF methodology. For this reason, most Indian researchers have avoided to account for the differences in age and gender characteristics. However, to account for educational differences they have preferred to exercise either of the two choices- one to use Barrow and Lee (1993) methodology and presume a constant rate of return for education (BCV, 2007) or second is to use the limited information available on wages for few rounds and only for casual and regular employees (Sivasubramonian, 2004) to determine the weights of different types of persons. Bosworth and Collins (2008) assumed a constant annual return of 7 percent for each additional year of education irrespective of the level of education. The problem with the assumption of uniform returns for each year is that it ignores all variations- across levels of education, over gender, over age groups, over industries, etc. and only includes education. It is thus not able to capture the impact of change in gender composition, age composition (a proxy for experience) and industry composition. The present version of the manual has included only one of these characteristics and a Grand labour composition index has been estimated based on JGF methodology using suitable data. The gender characteristic is avoided as Indian females still have very low level of education in general and higher education in particular, especially among the middle and senior age group. Therefore, either no data is available or there are very few sample observations for many such cells with cross classification of education, age and industry. The data for the survey years on total number of persons employed by each industry is taken from the current employment series. The estimates for the year 2016-17 are just extrapolated from the 2015-16 by using the average growth rates of labour quality between the last two rounds. The data on employment is essentially derived from the unit level record data of National Sample Survey (NSS) which is made available by NSSO in the form of CD-ROMS for the five Quinquennial rounds beginning from 38th. We estimated the number of employed persons according to UPSS as follows:
For extrapolation backward to 1980-81 to 1982-83, the interpolation of the broad industrial classification of 32nd round and 38th round is used. So the estimates from 32nd round are mainly used as control numbers. For the construction of the labour composition indices we require data on employment and earnings by education for the broad sectors and for each of the 27 industries. We distinguish three types of educational categories for broad sectors and three types of educational categories for each of 27 industries because of large data requirement at disaggregate level. The three education categories used are ‘up to primary’, ‘above primary to higher secondary’, and ‘above higher secondary’. Therefore the following additional steps have also been performed:
Secondly, earnings of self-employed have also been estimated from the monthly consumption expenditure of these households. In this, first the total monthly consumption expenditure is divided by the number of employed persons in the household to get total monthly consumption expenditure per employed person. Then the ratio of wage earnings by total monthly consumption expenditure per employed person has been calculated for each industry by UPSS status for the selected rounds. Assuming the consumption earnings ratio to be same for casual and self-employed persons, the ratio for casual labour is used for self-employed persons and this ratio is multiplied to the total monthly consumption expenditure per self-employed person so as to get the earnings of the self-employed persons. However, if the earnings thus obtained are higher than the earnings obtained from Mincer equation, then the latter are used. So the lower of the two - earnings obtained from Mincer equation or the earnings based on consumption expenditure, are taken to be the earnings of the self-employed persons. Once the above steps are taken to find out the educational distribution of all employed persons in all the five rounds and their corresponding wages, the computation of the grand labour composition index is carried based on the JGF (1987) methodology with 1980-81 equal to 100. The index is estimated for total persons working in 27 different industries in India for the 38th, 50th, 55th, 61st, and 68throunds of NSSO with 1983 (38th round) equal to 100 so as to assess the temporal changes in labour skill. Since the series required is from 1980-81, we have extrapolated it backwards from 1983 to 1980-81. The next phase of this project aims to update the series and perform some robustness checks of the current employment series with the help of the next round of EUS data, which is expected in 2019 and bring out policy implications from the employment trends. Appendix C: Definitions of Employment in NSSO Employment & Unemployment Surveys The surveys of NSSO on employment and unemployment (EUS) aim to measure the extent of ‘employment’ and ‘unemployment’ in quantitative terms disaggregated by various household and population characteristics following the three reference periods of (i) one year, (ii) one week, and (iii) each day of the week. Based on these three reference periods three different measures, termed as usual status, current weekly status, and the current daily status, are arrived at. While all these three approaches are used for collection of data on employment and unemployment in the Quinquennial surveys, the first two approaches only are used for the purpose in the annual surveys. Usual principal status: In the NSS 27th round, the usual principal activity category of the persons was determined by considering the normal working pattern, i.e.,, the activity pursued by them over a long period in the past and which was likely to continue in the future. For the identification of the usual principal status of an individual based on the major time criterion, in NSS 27th, 32nd, 38th, 43rd rounds, a trichotomous classification of the population was followed, that is, a person was classified into one of the three broad groups ‘employed’, ‘unemployed’ and ‘out of labour force’ based on the major time criterion. From NSS 50th round onwards, the procedure was changed and the prescribed procedure was a two-stage dichotomous one which involved a classification into ‘labour force’ and ‘out of labour force’ in the first stage, and thereafter, the labour force into ‘employed’ and ‘unemployed’ in the second stage. Usual subsidiary status: In the usual status approach, besides principal status, information in respect of subsidiary economic status of an individual was collected in all employment and unemployment surveys. For deciding the subsidiary economic status of an individual, no minimum number of days of work during the last 365 days was mentioned prior to NSS 61st round. In NSS 61st round, a minimum of 30 days of work, among other things, during the last 365 days, was considered necessary for classification as usual subsidiary economic activity of an individual. Current weekly status: It is important to note at the beginning that in the EUS of NSSO, a person is considered as worker if he/she has performed any economic activity at least for one hour on any day of the reference week and uses the priority criteria in assigning work activity status. This definition is consistent with the ILO convention and used by most of the countries in the world for their labour force surveys. In NSSO, prior to NSS 50th round and in all the annual surveys till NSS 59th round, data on employment and unemployment in the CWS approach was collected by putting a single-shot question ‘whether worked for at least one hour on any day during the last 7 days preceding the date of survey’. The information so collected was used to determine the CWS of the individuals. This procedure was criticized for being not able to identify the entire workforce, particularly among the females. It was then decided to derive the CWS of a person from the time disposition of the household members for the 7 days preceding the date of survey. The procedure was used for the first time in NSS 50th round. It is seen that the change in the method of determining the current weekly activity had resulted in increasing the WPR in current weekly status approach - more so for the females in both rural and urban areas than for males. The trend observed in NSS 50th round in respect of the WPR according to CWS suggested continuing with the procedure for data collection in CWS in NSS 55th and NSS 61st rounds. Current Daily Status Current Daily Status (CDS) rates are used for studying intensity of work. These are computed on the basis of the information on employment and unemployment recorded for the 14 half days of the reference week. The employment status during the seven days is recorded in terms of half or full intensities. An hour or more but less than four hours is taken as half intensity and four hours or more is taken as full intensity. An advantage of this approach is that it is based on more complete information; it embodies the time utilisation, and does not accord priority to labour force over outside the labour force or work over unemployment, except in marginal cases. A disadvantage is that it relates to person-days, not persons. Hence, it has to be used with some caution.
Chapter 5: Capital Input Series at the Industry Level This chapter outlines the methodology employed to estimate capital services for the 27 industries in the India KLEMS database version 2018. Following an overview of the theoretical method developed by Jorgenson and Griliches (1967), and outlined in Jorgenson, Gollop and Fraumeni, (JGF, 1987), the Chapter discusses the specific empirical approaches we follow to implement these methods within the constraints of data availability for Indian industries. For the measurement of capital services we need capital stock estimates for detailed asset types and the shares of each of these assets in total capital remuneration. Using the Törnqvist approximation to the continuous Divisia index under the assumption of instantaneous adjustability of capital, aggregate capital services growth rate is derived as a weighted growth rate of individual capital assets, the weights being the compensation shares of each asset, i.e., Since our measure of capital input takes account of asset heterogeneity, it was essential to obtain investment data by asset type. We distinguish between 3 different asset types – construction, transport equipment, machinery (includes ICT and non-ICT machinery).23 We exploit multiple sources of information for the construction of our database on capital services. This includes the National Accounts Statistics (NAS) that provide information on broad sectors of the economy, the Annual Survey of Industries (ASI) covering the organised manufacturing sector, the National Sample Survey Organizations (NSSO) rounds for unorganised manufacturing and Input-Output transaction tables. Even though we use multiple sources of data, our final estimates are fully consistent with the aggregate data obtained from the NAS. In addition, our approach to capital measurement is consistent with international practices such as the EU KLEMS24, which ensures the possibility of international comparisons. In what follows we discuss the various sources of data for asset-wise investment and the construction of the relevant variables, in detail. (a) Asset-wise investment for broad sectors of the economy Industry-level estimates of capital input require detailed asset-by-industry investment matrices. NAS provides information on aggregate capital formation by industry of use for 9 broad sectors, which, nevertheless, was not sufficient for our purpose. Therefore, we have collected more detailed data on assets and industries from the CSO.25 This is the data underlying the published aggregate gross fixed capital formation by the broad industry groups, separately for public and private sectors. For those sectors for which the investment matrices were not available from CSO, we gathered information from other sources (e.g. ASI for organised manufacturing and NSSO surveys for unorganised manufacturing) and benchmark it to the aggregate investment series from the National Accounts. The data used in the current version of the India KLEMS is based on the revised NAS with 2011-2012 base, and is available only since 2012. Therefore, for earlier years, we extrapolate the series using growth rates from previous version of the data. However, there were slight differences between the industry groups available in the current release of the data and the previous ones (see Table 5.1), which required some matching of sectors before combining the two series. Table 5.2 provides an overview of asset types available in NAS and their corresponding asset types used in our study. Investment in education and health are obtained directly from national accounts for the period after 2012, for each asset. For years before 2012, we assume the trend in the distribution of output, in order to split the total investment in the aggregates of these sectors into sub-sectors.
Total investment in each asset category is calculated as the sum of private and public sector investment in each asset. Investment in transport equipment is not available separately for private sector. We tried several approaches to impute the private sector transport equipment data. The first was to use the share of transport equipment in non-departmental enterprises. More specifically, we apply the non-departmental enterprise transport equipment to machinery & equipment (including transport equipment) ratio to machinery & equipment in private sector for each industry to obtain industry-wise transport equipment for private sector. We take non-departmental enterprises only, rather than the entire public sector, as it may be more realistic as it consists of public sector companies and statutory corporations, excluding administrative sector. However, the sum of industry estimates generated by this approach was not consistent with the reported aggregate private sector transport equipment. Therefore, we take a second step here, which is to use the industry distribution from this series and apply it to the published total private sector transport equipment data. The estimated transport equipment is then subtracted from each industry’s total machinery & transport equipment data, to obtain machinery in private sector as a residual. However, this approach generates many negative numbers in transport equipment in private sector, particularly in transport services. Therefore, we follow a third approach, which is what we finally use in the database. We first distribute the NAS total machinery in the private sector using the industry distribution of machinery and transport equipment. These estimates are then subtracted from each industry’s total machinery & transport equipment data to obtain the transport equipment investment in the private sector industries. (b) Asset-wise investment for non-NAS sectors NAS provides data only for 9 broad sectors, while we have 27 industries, which necessitated further splitting of some of the NAS sectors. This includes aggregate manufacturing (registered and unregistered separately) with 13 sub-sectors; other services into 4 sub-sectors; and real estate activities and business services into 2 sub-sectors. The manufacturing sector investment data was disaggregated into 13 sub-sectors at the 2-digit level of NIC 1998 using ASI and NSSO data, which will be discussed in detail subsequently. Investment series in service sector has been split into sub-sectors using two alternative approaches–value added shares, and capital/labour ratio in the higher aggregate industry. However, the final data used are based on value added shares, as a sensitivity analysis did not show a significant difference between the two. In order to split the aggregate capital formation in organised manufacturing sector into 13 study sectors, we use the Annual Survey of Industries. However, the published data does not provide any asset--wise investment information; it consists of only the aggregate capital formation or the book value of fixed capital. Most studies in the past have measured gross investment as the difference between book value of asset in period t and in period t-1 and add depreciation in period t to that. This approach has the deficiency of comparing two different samples reported in two different years, where the number of firms/factories might be different. In particular, while using this approach at industry level, for detailed asset categories, it might generate massive negative investment. We follow an alternative approach, following ASI’s definition of gross fixed capital formation (GFCF). ASI defines GFCF as actual additions (newly purchased, second hand and own construction) minus deductions plus depreciation adjustment for discarded assets during the year. This approach is based on a single year’s sample and helps to avoid potential huge negative investment series, and is also consistent with the published ASI GFCF series. The yearly detailed volumes beginning 1964-65 were used to derive the gross fixed capital formation by asset type directly. For the years 1964-1978, the relevant data are obtained from published detailed volumes. For the period, 1983-84 to 2004-05 ASI has generated detailed tables from Block C of ASI schedule that contain data on fixed assets. Data for missing years are interpolated using the changes in investment using book value method. Table 5.3 provides an overview of the asset categories available in ASI, and the relevant asset categories in our study to which they are attributed. Though ASI provides investment in land, for reasons of NAS consistency we exclude it from our database. Once investment in each of these assets and industries are generated using ASI data, we apply this industry-asset distribution to the published aggregate NAS GFCF series for organised manufacturing sector. It may also be noted that from 1960-61 to 1971-72, ASI data are for the census sector and from 1973-74 onwards they are for the factory sector. In order to make these two series comparable over years, we convert the data prior to 1972 to factory sector using the factory/census ratio in 1973. Thus, after these adjustments, we obtain investment data for 13 manufacturing sectors, by asset types, consistent with the NAS aggregates. The data required for creating the gross investment series for the 13 sectors of the unorganised manufacturing sector are obtained from various rounds of NSSO surveys on unorganised manufacturing. We use 6 rounds of NSSO surveys that cover the period 1989-2016. These are 45th round (1989-90), 51st round (1994-94), 56th round (2000-01), 62nd round (2005-06), 67th round (2010-11) and 73rd round (2015-16). Unit level data has been aggregated to 13 industries using the appropriate concordance tables. NSSO provides net addition to owned assets during the reference year within the block of fixed assets, and we use this as a measure of our investment. Asset classification in NSSO has changed over various rounds, and therefore, we have tried to match these with our classification as shown in Table 5.4. The investment series arrived at for four rounds were interpolated to obtain the annual time series of unorganised gross fixed capital formation by asset type. As in the case of registered sector, once the investment by asset types across industries are constructed, the asset-industry distribution is applied to the published NAS aggregate GFCF in unregistered manufacturing to obtain NAS consistent GFCF by asset type and industries. (c) Investment Prices by Asset Types In order to compute asset-wise capital stock using PIM (equation 5.3) and rental price (equation 5.4), we require asset-wise investment price deflators. Since CSO has provided us with investment data by industries and assets both in current and constant prices, we could derive the price deflators with base 2011-2012. For years before 2011, prices were spliced using 2004-2005 base investment deflators. These deflators are directly used for all the three asset categories we have. (d) Initial Stock, Depreciation Rates and Rate of Return As is evident from equations 5.1 to 5.4, our estimates of capital input require time-series data on asset-wise capital stock. Capital stock has been constructed using perpetual inventory method (PIM), where the capital stock (S) is defined as a weighted sum of past investments with weights given by the relative efficiencies of capital goods at different ages, which requires data on current investment by asset types, investment prices by asset types and depreciation rate. Also, for the practical implementation of PIM to estimate asset-wise capital stock, we require an estimate of initial benchmark stock (see Erumban, 2008b for an in-depth discussion on this issue). NAS provides estimates of net capital stock since 1950 for all the broad sectors in its Statement 17: Net Fixed Capital Stock by industry of use. We take the NAS estimate of real net capital stock in 1950 (in 1999-2000 prices) as our benchmark stock for all non-manufacturing sectors, and for manufacturing sectors the same is taken for the year 1964.27 However, since the NAS estimate is available only for broad sectors and for aggregate capital, we use our industry-asset distribution of GFCF in order to create net fixed capital stock estimates by asset type for all the 27 industries. NAS also provides detailed tables on assumed life of assets used for computing capital stock, for private units, administrative units as well as departmental and non-departmental units by asset types.28 We use these estimates of lifetime to derive appropriate depreciation rates for non-ICT assets, using a double declining balance rate. We assume 80 years of lifetime for buildings, 20 years for transport equipment, and 25 years for machinery and equipment. The final depreciation rates used in the study are given in Table 5.5 by asset type. Subsequently, we build our capital stock series by asset types for all the 27 industries using our GFCF series from 1950 (1964) onwards for the non-manufacturing (manufacturing) sectors. Our measure of capital input is arrived using equation (5.1), for which we also require estimates of rental prices (see equation 5.4). Assuming that the flow of capital services is proportional to the capital stock at individual asset level, aggregate capital flows can be obtained using a translog quantity index by weighting growth in the stock of each asset by the average shares of each asset in the value of capital compensation, as in (5.1). The rate of return (i) in equation (5.4) represents the opportunity cost of capital, and can be measured either as internal (or ex-post) rate of return, or as an external (ex ante) rate of return.29 This issue will be addressed in the further revisions of the data. The present version of the database uses an external rate of return, proxied by average of return on government securities and prime lending rate obtained from the Reserve Bank of India30. Therefore, we use a real rate, which is net of capital gain. Hence, the capital gain component in equation (5.4) is excluded while estimating rental price using external rate of return, obtaining Where i* is the real rate of return, nominal interest rate adjusted for CPI inflation rate. Chapter 6: Intermediate Input Series at the Industry Level In this section we describe the basic approach we have used to derive the volume series of Intermediate Inputs namely – Energy input (E), Material input (M) and Services input (S). This breakdown of intermediate inputs can be used for extending the growth accounting exercises, but also convey interesting information about changing pattern in intermediate consumption ( see e.g. JHS 2005, chapter 4) The methodology for measuring industry output, intermediate inputs and value added was developed by Jorgenson, Gallop and Fraumeni (1987) and extended by Jorgenson (1990 a). The cornerstone of this approach is a time series of input-output (IO) tables which gives the flows of all commodities in the economy, as well as payments to primary factors. Every commodity is accounted for, whether produced by a domestic source or imported, and every use is noted, whether purchased by an industry or by a final demand element. All payments to factors of production i.e., labour and capital is accounted for so that all income elements of GDP are included. The methodology of constructing time series on energy, material and services inputs for the European economy has been elucidated in Timmer et al.,, (2010, Chapter 3). Following a similar approach as explained in Jorgenson et al., (2005, Chapter 4) and Timmer et al., (2010, chapter 3), the time series on intermediate inputs for the India KLEMS project have been constructed. Definition of EMS: As in EU KLEMS, this study identifies three main categories of intermediate inputs. They are classified as follows:
Intermediate inputs are broken down into energy, material and services, based on input-output transaction tables using a standard NIC product classification. The following five energy types (and products) have been classified as the Energy input.
The following fourteen input items have been classified as the Service input
All other intermediate inputs barring the above- mentioned nineteen inputs are classified as material input. The key building block for constructing time series on intermediate inputs at current prices, as explained in Jorgenson et al., (2005, Chapter 4), is the input-output transaction tables, that is, the inter-industry transaction tables that provide a description of which industries produce each product and which industries use them. The IOTT gives the inter-industry transactions in value terms at factor cost presented in the form of commodity x industry matrix where the columns represent the industries and the rows as group of commodities, which are the principal products of the corresponding industries. Each row of the matrix shows in the relevant columns, the deliveries of the total output of the commodities to the different industries for intermediate consumption and final use. The entries read down industry columns give the commodity inputs of raw-materials and services, which are used to produce outputs of particular industries. The column entries at the bottom of the table give net indirect taxes (NIT) (indirect taxes – subsidies) on the inputs and the primary inputs (income from use of labour and capital), i.e., Gross Value Added (GVA). As the IOTT is in the form of commodity x industry matrix, the row totals do not tally with the column totals. The difference between each column and the corresponding row totals is due to the inclusion of the secondary products, which appear particularly in the case of manufacturing industries. This is so because by-products are also manufactured by industries in addition to their main products. Thus, while determining the entries in the rows, a by-product of an industry is transferred to the sector (commodity row), whose principal product is the same as the by-product under reference. The columns, however, show the total of principal products and by-products of each industry. All the entries in the IOTT are at factor cost, i.e., excluding trade and transport charges and NIT. The Input Flow Matrix at factor cost, published by CSO, for 1978 is a 60 x 60 matrix. The absorption matrices for 1983, 1989, 1993 and 1998 have 115 sectors. However a detailed 130 sector absorption (commodity x industry) matrix for the Indian economy has been published from 2003-04 onwards. The scheme of sector classification adopted in supply use table 2012-13, IOTT 2007-08 and IOTT 2003-04 vis-à-vis, IOTT 1983-84, IOTT 1989-90, IOTT 1993-94 and 1998-99 has undergone significant change with the disaggregation of some of the sectors, which have become significant in early 2000s.
The methodology for computation of intermediate input series for 27 Industries from 1980-2016 at current and constant prices is explained in steps. Step 1: Concordance is done between IOTT and study industries
Step 2: Obtaining estimates for Material, Energy and Service Inputs for 27 Industries, in benchmark years
Thus, for each benchmark year, estimates are obtained for Material, Energy and Service Inputs that has been used to produce Gross output in the 27 different India KLEMS Industries. Step 3: Projecting a time series (1980 to 2015) of proportions of Material, Energy and Service Inputs in Total intermediate inputs for each of the 27 industries
There exist some abnormal fluctuation in the proportions of Material Inputs, Energy Inputs, and Service Inputs in total intermediate inputs for benchmark years. In that case we drop that specific benchmark proportion and linearly interpolate the adjacent benchmark proportions to estimate proportion for intervening years. For the industry group ‘Coke, Refined Petroleum and Nuclear Fuel’, we estimated proportions of Material Inputs, Energy Inputs, and Service Inputs in total intermediate inputs from ASI data instead of Input Output Transaction Table. This has been done because the relevant proportions differed significantly between the input-output tables and ASI, and the latter is believed to be making a more correct assessment of energy inputs used in the Coke and Petroleum products industry. For example, coal consumed in the Coke industry is taken primarily as material rather than energy. From IOTT, it is difficult to estimate how much of coal consumption is for energy purposes and how much is for material input, where ASI provides details information about energy consumption by energy inputs like: coal, electricity, petroleum and others including natural gas. Step 4: Consistency with NAS The projection of intermediate input vector, using IOTT in Step 3 needs to be consistent with the estimated output from NAS.
1. For Benchmark years: Ratio of total intermediate inputs from NAS to that from IOTT is adjusted proportionately to the absolute value of Energy Inputs/Material Inputs/Service Inputs obtained from IOTT. 2. For Intervening Years: The interpolated proportions of energy inputs, material inputs and services inputs obtained from IOTT, is applied directly to the total intermediate inputs from NAS to get share of each input. Two examples have been given below for the Construction sector: In the examples above, we can see that the total value of intermediate input exactly matches the gap between gross value of output and value added. Similar adjustments have been made to construct the input series for all the other 27 study industries from 1980-2016. For every benchmark year where IOTT is available, adjustment ‘A’ has been done. For every non-benchmark year, where IOTT is not available and an intermediate input vector has been projected, adjustment ‘B’ has been done to generate the comprehensive time series of intermediate inputs consistent with the official National Accounts. Steps 1-4: This gives a time series of Material, Energy, and Service Inputs for 27 study Industries from 1980 to 2016 at current prices. Steps 5 and 6 below, explain the methodology for computation of intermediate input series for 27 study Industries from 1980-2016 at constant price. The approach followed here is to first form the aggregates of materials, energy and services at current price for each study industry from the benchmark Input-Output tables and then develop deflators of materials, energy and service inputs for each of the 27 study industries separately. Step 5: Constructing Deflators of Materials, Energy and Service Inputs for 27 study Industries separately
Step 6: Computing Time Series on Intermediate Input for 27 study Industries from 1980-2014, Constant prices
Thus we have a time series of material, energy and service inputs for 27 study Industries at constant prices. Firstly, Input-Output transaction tables are generally available on a five year interval period and this necessitates interpolation and assumption of constant shares in some cases, to construct the entire time series of EMS from 1980 to 2016. Secondly, unlike studies using detailed survey data, we have to assume that all buyers pay the same price for each commodity because there is no information about price divergences. Thirdly, the estimated time series of intermediate inputs at constant prices will not be consistent with the intermediate inputs of NAS at constant price i.e., the deflated value of intermediate input cannot match the gap between value added and gross output at constant prices from NAS. This is because; NAS uses a single deflation method to estimate Gross Value Added and Output at constant prices. Fourthly, there has been some confusion in the literature on the price concept to be used for intermediate inputs. It is generally acknowledged that the intermediate input weights should be measured from the user’s point of view, i.e., reflect the marginal cost paid by the user. Most studies maintain that purchaser’s price should be used. These prices include net taxes on commodities paid by the user and include margins on trade and transportation (see, for example OECD 2001). However when trade and transportation services are included as separate intermediate inputs, margins paid on other products should also be allocated to these services. Ideally a distinction should be made between the intermediate product valued at purchaser’s price minus margins and the trade and transportation services valued at margins. This is the approach taken in Jorgenson, Gollop and Fraumeni (1987) and in Jorgenson et al., (2005). However, Timmer et al., (2010, Chapter 3) explains that for the EU KLEMS database, intermediate inputs have been valued at purchaser’s price owing to unavailability of necessary data. Because of the use of purchaser’s price concept the shares of services in intermediate inputs do not include trade and transportation margins. Similarly in practice for the database, the time series of intermediate input constructed is at purchaser’s price i.e., it implicitly takes into account the net indirect taxes. The distribution of this net indirect tax between material, energy and services has not been possible because of unavailability of a time series of tax matrix from 1980 to 2016. Chapter 7: Factor Income Share Series at the Industry Level The distribution of income between capital, labour and intermediate inputs, is an important element in growth accounting because income shares, under conditions of competitive markets, can be used to measure the contribution each factor makes towards output growth. 7.1 Methodology for Measuring Labour Income Share Series Under the assumption of constant returns to scale with two factors of production i.e., labour and capital, the sum of the labour income share and capital income share is 1. The labour income share is defined as the ratio of labour income to GVA. Capital income share is accordingly obtained as one minus labour income share. There are no published data on factor income shares in Indian economy at a detailed disaggregate level. National Accounts Statistics (NAS) of the CSO publishes the Net Domestic Product (NDP) series comprising of compensation of employees (CE), operating surplus (OS) and mixed income (MI) for the NAS industries. The income of the self-employed persons, i.e., MI is not separated into the labour component and capital component of the income. Therefore, to compute the labour income share out of value added, one has to take the sum of the compensation of employees and that part of the MI which are wages for labour. The computation of labour income share for the 27 study industries involves two steps. First, estimates of CE, OS and MI have to be obtained for each of the 27 study industries from the NAS data which are available only for the NAS sectors (see Table 7.1). Second, the estimate of MI has to be split into labour income and capital income for each industry for each year (except for those industries for which the reported MI is zero, for instance, public administration). Basis data sources used for the computation of labour income share are NAS, ASI and unit level data of survey of unorganised manufacturing enterprises. These data sources are used to obtain estimates of CE, OS and MI for each of the 27 study industries. For splitting the labour and non-labour components out of the MI of self-employed, the unit level data of NSS employment-unemployment surveys are used along with the estimates of CE, OS and MI obtained from the NAS. a) Construction of Labour Income Share Series in Gross Value Added The estimation of labour income share for the 27 study industries has been done in two steps, as discussed below. Step 1: Estimation of CE, OS and MI for the 27 study Industries For some industries under study, for instance (i) Agriculture, forestry & logging and fishing; (ii) Mining & Quarrying; (iii) Electricity, gas and water supply and; (iv) Construction, the required data are readily available from NAS. For others, the estimates available in NAS have to be distributed across the study industries. In certain cases, the estimates of CE, OS and MI for a particular NAS sector have been distributed across constituent study industries proportionately in accordance with the gross value added in those industries. Estimate of factor incomes for ‘other services’ in NAS , for instance, has been split into estimates for (i) Education; (ii) Health and Social Work, and ; (iii) Other Community, Social and Personal Services including Renting of machinery and business services, and Private Household with Employed Persons. Before 2011-12, NAS provided estimates of factor incomes for registered manufacturing and unregistered manufacturing, but not for individual manufacturing industries. However, onwards 2011-12 NAS disaggregated the manufacturing sector into corporate and household sector. The NAS estimates of factor incomes for registered manufacturing and corporate sector have to be split into various manufacturing industries considered in the study (13 in number) using ASI data. The reported CE in NAS for registered manufacturing and corporate sector has been distributed into those 13 industries in proportion to the reported ASI data on emoluments for various industries. In a similar way, using ASI data, the estimate of OS for registered manufacturing and corporate sector has been distributed. Emoluments are subtracted from gross value added for various industries yielding capital income. The share of different industries in aggregate capital income of organised manufacturing indicated by the ASI data is used to split the estimate of OS for registered manufacturing and corporate sector reported in the NAS. The methodology applied for unregistered manufacturing and household sector is similar. The published results and unit level data of survey of unorganised manufacturing industries have been used for this purpose. The estimates of wage payments (to hired workers) in different industries have been used to split (proportionately) the estimate of CE in the NAS for aggregate unregistered manufacturing. The estimated wage payment is subtracted from the estimated value added to obtain an estimate of capital income and MI of the self-employed in various unorganised manufacturing industries. The estimate of MI provided in NAS for unregistered manufacturing and household sector has then been proportionately distributed across industries using the estimate of capital income and MI of the self-employed in various industries that could be formed on the basis of published results and unit level data of survey of unorganised manufacturing industries. Unlike the ASI data for organised manufacturing, the data for unorganised manufacturing enterprises are available for only select years. The proportions mentioned above could therefore be computed only for those select years (data for five rounds have been used; these are for 45th round (1989-90), 51st round (1994-95), 56th round (2000-01), 62nd round (2005-06), 67th round (2010-11) and 73rd round (2015-16)). It has accordingly been necessary to resort to interpolation/ extrapolation to obtain the relevant proportions for other years. Step 2: Splitting of MI into Labour Income and Capital Income As explained above, the income share of labour is computed as: The derivation of the GVA series for different industries has been briefly explained in Chapter 2. The derivation of CE and MI series has been explained in step I above. Therefore, only the estimation method of ƞ needs to be described. The estimation of ƞ has been done with the help of NSS survey-based estimates of employment of different categories of workers (number of persons and days of work) and wage rates (which has been described briefly in chapter 4) coupled with estimates of MI, basically obtained from the NAS. Two approaches have been taken to get an estimate of ƞ, and the labour income share series for different industries finally adopted in the study makes use of an average of the estimates of ƞ obtained by the two approaches. In the first approach, an estimate of labour income of self-employed workers has been made for each study industry for five years, 1983-84, 1987-88, 1993-94, 1999-00, 2004-05, 2009-10, 2011-12 and 2015-16 on the basis of the estimated number of self-employed, wage rate of self-employed and the number of days of work per week. The industry-wise estimates of number of self-employed, wage rate of self-employed and the number of days of work per week have been made from unit records of NSS employment-unemployment survey (major rounds), as explained in chapter 4 above. The estimates of the number of self-employed, wage rate of self-employed and the number of days of work per week provide an estimate of the annual labour income of self-employed workers which is divided by the MI of self-employed (derived from NAS) to get an estimate of ƞ. For five industries, the ratio in question has been computed and applied. For the other 22 industries, the ratio in question has been computed after clubbing the industries into 11 industry groups.31 In the latter case, a common ratio computed for group of industries has then been applied to constituent industries. The list of industries or industry groups for which ƞ has been estimated is given in table 7.2. In the second approach, the NSS data are used to compute the following ratio: the ratio of labour income of self-employed workers to the labour income of regular and casual workers. Let this be denoted by θ. Then, the estimate of CE provided in the NAS is multiplied by θ to obtain an estimate of the labour income component out of the MI reported in the NAS. The labour component of MI divided by total MI gives an estimate of ƞ. In case the estimated labour component of MI exceeds the estimate of MI, the estimate of ƞ has been taken as unity. Examining the estimates obtained by the first approach, it was found that the estimated labour income share out of MI varied significantly among the estimates for the seven years for which the ratio in question has been estimated. The estimates of ƞ obtained by the second approach has the problem that in a number of cases, the estimated labour component of MI exceeds the estimate of MI given in the NAS, and therefore ƞ is taken as one. The method finally adopted is as follows: (a) The average value of ƞ has been computed for each industry or industry group by taking the estimates for the years 1983-84, 1987-88, 1993-94, 1999-00, 2004-05, 2009-10, 2011-12 and 2015-16. This has been done separately for the estimates based on approach-1 and those based on approach-2. (b) The estimates of ƞ obtained for each industry or industry group by the two approaches have then been averaged. (c) Having obtained an estimate of ƞ, equation 6.1 given above has been applied to compute the labour income share. b) Construction of Factor Income Share Series in Gross output The income share of labour, capital and intermediate inputs in Gross output has been computed using the following steps:
The splitting of unorganised sector factor incomes into individual industries has been done using unorganised survey results. The proportions computed for 1983-84 has been applied for the period 1980-81 to 1983-84. While there were surveys of unorganised manufacturing enterprises in 1978-79 the survey results have not been used for estimation of factor incomes in different industries of the unorganised manufacturing sector. Chapter 8: Growth Accounting Methodology This chapter deals with the methodology of measurement of total factor productivity (TFP) growth for individual industries in the KLEMS framework and the aggregation from industry level productivity measures to measures for broad sectors and the economy as a whole. The methodology of analysis of sources of gross output growth at the individual industry level and sources of Gross value added (GVA) or GDP growth at the broad sector level and economy level will also be presented in this chapter 8.1 Methodology for Measuring Productivity Growth at the Industry level The production function Our measurement of TFP growth for different industries of the Indian economy is based on a gross output production function for each industry j: Y is industry gross output, L is labour input, K is capital input and E, M and S are intermediate inputs-namely energy, material and services and T is an indicator of technology for industry j. All variables are indexed by time (t subscript is suppressed). There are several things to note about the production function- (1) all variables are aggregates of many components that have been discussed in the earlier report/chapters; (2) we assume that the industry production function is separable in these aggregates; (3) time (indicator of technology) enters the production function symmetrically and directly with inputs. An important feature of the gross output approach is the explicit role of intermediate inputs. In our study, we have considered three intermediate inputs - energy, material and services and; this is important as we may find that intermediate inputs are the primary component of some industries outputs.32 Failure to quantify intermediate inputs leads us to miss both the role of key industries that produce intermediate inputs and the importance of intermediate inputs for the industries that use them. In order to estimate the production function at constant prices, the industry level price data is used for output, capital, labour and intermediate inputs, e.g., PYj; PLj, PKj; PEj. PMj, PSj. We assume that all industries face the same input prices. The industry price of an input varies across industries due to compositional effects as industries expend different shares of their investment on each type of asset. The same is true for all inputs. Total Factor Productivity To estimate TFP growth, we begin with the fundamental accounting identity for each industry where the value of output equals the value of inputs. Under specific assumptions of constant returns to scale and competitive markets, we can define TFP growth as Rearranging equation (8.3), yields the standard growth accounting decomposition of output growth into the contribution of each input and the TFP residual where the contribution of an input is defined as the product of the input’s growth rate and its two period average value share. We can further decompose the contribution of each input into a quantity and quality component. As discussed earlier, the quality component represents substitution between H components, while the quantity component represents the increases in each detailed input. Our extended decomposition of the sources of output growth is Where Z is the stock of industry capital, Qk is the capital quality, H is number of persons employed and Ql is the quality of labour. 8.2 Methodology for Aggregation across Industries In the analysis presented in the above section, gross value of output (GVO) is taken as the measure of output of an industry and TFP growth was estimated using the gross output function framework in which capital, labour, material, energy and services are taken as five inputs (in the case of agriculture, land was included among inputs). The Törnqvist index is applied to estimate TFP growth for each year during 1980-81 to 2016-17, which yielded an index of TFP for each industry for that period, permitting estimation of trend growth rate in TFP for the period under study (1980-81 to 2016-17). The method of estimation of TFP growth for individual industries described in section 8.1 cannot be readily applied to a higher level of aggregation. The main problem is that gross value of output cannot be added across industries to generate a measure of output at a higher level of aggregation, say for the economy or of any of the broad sectors. It becomes necessary therefore to consider appropriate methods of aggregation across industries consistent with the gross output function specification at the individual industry level. It is needless to say that one has to use the concept of value added to define an appropriate measure of output at the economy or broad sector level, but even here there are important issues of aggregation, i.e., how value added in different industries should be combined. A very useful discussion of the issues involved is found in Jorgenson et al., (2005). There are three approaches to estimating TFP growth at an aggregate level. This is discussed in detail in Jorgenson et al., (2005). The first approach is called aggregate production function approach. This approach assumes the existence of an aggregate production function (say, at the level of the economy or at the level of manufacturing or services sector). An essential condition is that the production function be separable in primary inputs. Let the production function for a particular industry be defined as: where Y denotes gross output, X intermediate input (in turn a combination of materials, energy and services), K capital input, L labour input and T time (representing technology). Then, the concept of value added requires the existence of a value added function which in turn requires that the above production function be separable in K, L and T, i.e., the production function should take the following form: In this equation, V (.) is the value added function. V denotes value added. Besides the condition of separability of the production function described above, a number of highly restrictive assumptions have to be made for the aggregate production function approach. These include: (a) the value added function is the same across all industries up to a scalar multiple, (b) the functions that aggregate heterogeneous types of labour and capital must be identical in all industries, and (c) each specific type of capital and labour receives the same price in all industries. With these assumptions made, real value added at the aggregate level becomes a simple addition of real value added in individual industries. In notation, where Vi is value added in the ith industry and V is the aggregate value added. Given aggregate value added and somewhat similarly defined aggregate capital and labour input, TFP growth at the aggregate level can be computed. The second approach to aggregation is the aggregate production possibility frontier approach. It also needs the separability condition described in equations 8.8 and 8.9 above, as in the case of the aggregate production function approach. The main difference between the aggregate production function approach and the aggregate production possibility frontier approach is that the latter relaxes the assumption that all industries must face the same value added function. Thus, the price of value added is no longer assumed to be the same across industries. The implication is that the aggregate value added from the aggregate production possibility frontier is given by the Törnqvist index of the industry value added as: In this equation, wi is the share of industry i in aggregate value added in nominal terms. Thus, defining PV,i as the price of value added in industry i and Vi as the real value added in industry i, the share in question may be defined as: and the two period average is defined as: Since each industry is subject to a production function separable in capital, labour and technology, as defined in equation 8.9, real gross output, real value of intermediate inputs and real value added of industry i in a particular year t should satisfy the following relationship: In this equation, uV and uX are the shares of value added and intermediate inputs in the gross value of output in nominal terms. Given data on growth in gross output and growth in intermediate inputs of a particular industry, the above equation yields the growth in real value added of that industry. TFP growth from the aggregate production possibility frontier may be defined as follows: In this equation K and L are aggregate capital input and labour input, respectively, and vK and vL are value shares (or income shares) of capital and labour, respectively. If one maintains the assumption that each specific type of capital and labour input has the same price in all industries, then each type of capital (labour) can be summed across industries and then a Törnqvist index can be constructed to yield aggregate capital (labour) input. Alternatively, different types of capital input can be combined using a Törnqvist index into total capital input used in an industry and then industry level growth in capital input can be combined with the help of a Törnqvist index to obtain growth in capital input at the aggregate level. In a similar manner, the growth in aggregate labour input can be computed. The third approach to measuring the TFP index at a higher level of aggregation is to apply direct aggregation to industry level estimates. This maintains the industry level production accounts as the fundamental building block and begins with industry level sources of growth. Of the three approaches described here, this is the least restrictive in terms of the assumptions involved. The following equation expresses the relationship between aggregate value added growth and the growth in capital and labour inputs and TFP in individual industries: In this equation, vK is the value share of capital in gross output, vL is the value share of labour in gross output, vV is the share of value added in gross output, and wi is the share of industry i in aggregate value added in nominal terms. The bars indicate that the average of two periods, t and t-1, are to be taken. The last term in equation 8.16 is: This is the Domar aggregation of TFP growth rates at individual industry level. Note that it is weighted average of industry level TFP growth rates. But, the weights add up to more than one. The data constructed for GVA and gross output, labour input, capital inputs, intermediate inputs and factor income shares (Chapters 2,3,4,5,6 and 7 respectively), are used to estimate TFPG during the period 1980 to 2016. Labour input is measured using total person worked (see, Chapter 4 for the detailed discussion). The series on growth rate in capital services is taken from Chapter 5. Chapter 9: Extension to KLEMS Variables A. Energy intensity series – note on data sources and computation methodology 9.1 Time period covered and level of disaggregation
9.2 Concepts and methods
Data on GVA and gross output
Data Sources on energy consumption
Conversion factors
Some adjustments made to official data
9.3 Deriving energy use estimates for manufacturing
B. Estimating aggregate economy ICT investment and Capital39 This section explains the approach used to estimate investment in ICT assets for the total economy. Investment in ICT is defined as investment in three asset types: hardware, communication equipment, and software. We first explain the construction of investment data for each of these asset types and subsequently explain the construction of capital stock and capital services. The primary source of data we rely on for these estimates is official national accounts data, input-output tables and World Information Technology and Services Alliance (WITSA)40’s estimates on ICT spending. Software investment: National Accounts Statistics provides software investment by different undertakings covering administrative departments, autonomous bodies, cooperatives, departmental enterprises, household sector, non-departmental enterprises, private corporate sector, and public administration, since 1999. We have obtained this data disaggregated by industries from the Central Statistics Office (CSO), and this has been the primary series we consider as the benchmark series of software investment for India. For years before 1999, we extrapolate software series by applying software/hardware ratio to measured series of hardware investment (measurement of hardware series is discussed below). We generate a series of software/hardware ratio using the trend in software/hardware ratio in the United States. The U.S data has been obtained from the EU KLEMS database.41 While this is a rough assumption, it is better than assuming a constant or linearly growing software/hardware ratio. The U.S, being the ICT leader, may provide a realistic picture of the required hardware/software ratio, particularly in the early years.42 Hardware investment: National Accounts provides a series on investment in ICT equipment since 2012. But it does not provide a split between hardware and communication equipment, which are often considered separately in the literature. Moreover, this data is not available for years before 2012, during which ICT equipment was part of total machinery investment. Therefore, in the India KLEMS approach, we estimate the hardware and communication equipment investments separately for years before 2012, and then club them into one category, ICT equipment, in order to keep consistency with the National Accounts data. In constructing a series of software investment, once hardware data is available, de Vries et al., (2010) suggest using the elasticity of hardware to software investment, estimated using a fixed effect panel regression of software on hardware and a set of control variables. We follow a similar approach to derive hardware investment, but not using econometric techniques. We use software/hardware ratio from WITSA ICT spending data. While doing this, we exclude consumer spending. This ratio is applied to software series obtained from National Accounts, thus providing us a hardware series for 1999-2011. For years before 1999, hardware series is obtained using the trends in the hardware investment series arrived at using a commodity flow approach (CFM)43. That is the benchmark series for 1999 is extrapolated using the annual changes in the investment series obtained using the CFM. In the CFM approach, total economy investment in hardware and communication equipment can be estimated using the information on the overall domestic availability of these goods and its investment component. This requires the use of input-output tables, in combination with NAS and trade statistics. We define the investment in ICT asset i as: As mentioned before, to extrapolate hardware investment series backward we apply the trend in the obtained hardware series to 1999 hardware investment obtained using software/hardware ratio from WITSA applied to software data from NAS. Alternatively, one could directly take the series generated using commodity flow approach for the entire period (i.e., before and after 1999). However, we opted not to do so, because, the industry concordances are not at the maximum precision for IO tables after 2003. Hence, as discussed earlier, we constructed the hardware series for years after 1999 using NAS based software and WITSA based software/hardware ratio. Since the commodity flow approach produces a different estimate for hardware investment in 1999, to keep consistency, we proportionally adjust the series based on commodity flow approach – i.e., by applying the annual changes in commodity flow-based series to the 1999 benchmark estimate of hardware investment. Communication equipment: For the communication equipment series, we directly take the series generated using the commodity flow approach, using data from the IO table, because the industry description was entirely consistent and clear (see Table 1). Once the hardware and communication equipment investment data are obtained, we club these two to obtain total ICT equipment data. The trend in the estimated ICT equipment series is then applied to the published national accounts series on ICT equipment in 2012, to generate a consistent series for the earlier years. This way, we have a complete series of nominal values of ICT investment for the aggregate economy. This approach allows us to generate investment series only for the total economy, as an industry break-down is not possible with the input-output table. For the two assets, ICT equipment and Software, investment series for years before 1983 (i.e., the first benchmark I-O table), we linearly forecast the domestic availability ratios (see equation 1), until 1970. The investment estimates obtained from this linearly interpolated series will be used to derive initial capital stock (see section 5.2). ICT prices and depreciation To deflate the nominal investment series, in order to construct ICT capital stock, we also require price deflators of ICT assets. Price measurement for ICT assets has been an important research topic in the literature, as the quality of ICT capital goods has been rapidly increasing. The use of a single harmonized deflator across countries was widely advocated and used (Timmer and van Ark, 2005; Schreyer, 2002). We use a harmonization procedure suggested by Schreyer (2002) where the US hedonic deflators44 are adjusted for India’s domestic inflation rates, i.e., For the depreciation rates, following the standard practice in the literature. Jorgenson and Vu, (2005) assumes a 31.5 percent depreciation rate for software and hardware and 11.5 percent for communication equipment. Since we do not have the split between hardware and communication equipment in our data (except for the historical series from CFM), we use an average depreciation rate for hardware and communication, viz. 21.5 percent. ICT Capital stock As is the case with all other assets, we use a standard perpetual inventory method (PIM) to estimate ICT capital stock (See section 5.1). Capital stock for a given ICT asset i can be obtained using the PIM as: 1 Timmer, M.P, Mahony, M. and van Ark, B. (2007). The EU KLEMS growth and productivity accounts: an overview. Mimeo University of Groningen & University of Birmingham, 2 Readers can refer to India KLEMS Data Manual Version 2013 for the data set released by RBI on its website in June 2014. 3 From both aggregated 9 NAS sectors Gross Domestic Product by economic activity statement along with the disaggregated statements of these 9 NAS sectors. 4 It uses NAS Series 2004-05 for more recent years. 5 Kanhaiya Singh and M.R. Saluja (2016), Input-Output Table for India, 2013-14, Working Paper no WP-111, National Council of Applied Economic Research, New Delhi 6 The principal components of Manufacturing n.e.c are manufacturing of jewellery and related articles, manufacture of musical instruments, manufacture of sports goods, manufacture of games and toys and other manufacturing. 7 See Himanshu (2011) for a discussion on many issues associated with the NSSO employment surveys. 8 The interpolation methodology and the use of other sources for few industries is explained in the subsequent section. 9 The coverage and the sample size of thin rounds is different from major rounds. 10 Problems in using UPSS are: The UPSS seeks to place as many persons as possible under the category of employed by assigning priority to work; no single long-term activity status for many as they move between statuses over a long period of one year, and Usual status requires a recall over a whole year of what the person did, which is not easy for those who take whatever work opportunities they can find over the year or have prolonged spells out of the labour force. 11 In EU-KLEMS changes in labour composition has been measured by including employment class, gender, age and education (Timmer, op cit; p64) and only 3 categories of education defined as high skilled, medium skilled and low skilled have been taken. This concept of labour composition is refereed as ‘labor quality’ by Jorgenson. It is recognized (Timmer; op cit; p84) that the categorization may lead to biases in the aggregate composition adjustment if employment trends and wage share differ within categories. These education categories generally correspond to- up to Primary education, above Primary to Higher Secondary education and above Higher Secondary or College education. Due to data limitations in India we have measured changes in labour composition only by changes in education profile of labour; hence named it labour education index. Therefore it became necessary to have a more detailed classification of education to capture the changes in skill composition of labour in India. For comparison, we estimated the changes in labour composition based on three education categories also and found that for the total economy the annual trend growth in labour education index is 1.16 per cent as compared to 1.25 per cent based on five education categories. 12 This procedure has been advocated by NSSO itself and has been consistently adopted by almost all researchers. 13 A very lucid description of the same is given by Sivasubramonian (2004). 14 Appendix Table D provides the details about the Census Population, WFPR by UPSS, and persons employed in each of the EUS rounds used in the study. 15 The 66th round has not been used because of its inconsistency with the 61st and 68th round. Most economists consider it as an outlier. 16 Since 2009-10, the labour Bureau, GOI has started collecting annual data on Employment and Unemployment and has released five reports till 2015-16. Its estimates are not comparable with EUS and the unit level data, which is used to get employment by different characteristics-age, gender, education, etc. is also not provided. So only the growth rates provided by Labour Bureau data are used in the KLEMS employment series. 17 Aggregate input is measured as a translog index of its individual components. Then the corresponding index is a Törnqvist volume index (see Jorgenson, Gollop and Fraumeni 1987). For all aggregation of quantities we use the Törnqvist quantity index, which is a discrete time approximation to a Divisia index. This aggregation approach uses annual moving weights based on averages of adjacent points in time. The advantage of the Törnqvist index is that it belongs to the preferred class of superlative indices (Diewert 1976). Moreover, it exactly replicates a translog model which is highly flexible, that is, a model where the aggregate is a linear and quadratic function of the components and time. 18 Although census population is available only decennially, we used the interpolated population figures for the mid-year survey periods [Visaria, (1996) for 1977-78 (Jan) and 1987-88 (Jan); Sundaram, (2007) for 1983 (July), 1993-94 (Jan), 1999-00 (Jan) and 2004-05 (Jan)] and from NSSO (2010) for Jan 2010 and Jan 2012. 19 In EU KLEMS (Timmer, op cit; p 67) it is assumed that the earnings is equal to the earnings of ‘regular' employees. 20 The details of the function can be obtained from the Stata software and from Appendix. 21 Asset prices are used in the aggregation of capital stock. However, it is the price of the capital service that must be used in aggregating capital services (see Jorgenson and Griliches, 1967; Diewert, 1980) as it is the services delivered by capital goods that are used in the production process. Jorgenson and Griliches (1967) have shown that these two prices are related; the asset prices are the discounted value of all future capital services. They are not proportional though, as there are differences in replacement rates and capital gains among different capital assets. 23 This version of the database does not make a distinction between ICT and non-ICT assets, as the industry level data on ICT assets are weak. An attempt to estimate aggregate economy level ICT capital can be found in Erumban and Das (2015). 24 See O’Mahony and Timmer (2009) for a description of EU KLEMS database 25 This data is not publicly available. However, CSO has been kind to compile this data for the India-KLEMS project. 26 In some years transport equipment was provided as part of the machinery and equipment, categorized as ‘tools, transport equipment and other fixed assets’. In such cases, we use transport/tools, transport and other fixed asset ratio in the nearest year to separate transport equipment. 27 This choice is driven by the fact that the first year of availability of ASI data is 1964-65. 28 National Accounts Statistics-Sources and Methods, Chapter 26, CSO (2007) 29 We do not intend to delve into the controversies over the use of internal vs. external rate of return in the context of productivity measurement. Rather, given that this is the first version of our data, we use the external rate and in a later stage, we will also use internal rates. See Erumban (2008a and b) for a discussion on these issues. 30 Reserve Bank of India, Handbook of Indian Statistics, Annual volumes. 31 The estimation is done at group level rather than individual industries on the consideration that the group level estimates will be more reliable. 32 Consider, the semi-conductor (SC) industry, which is a key input to the computer hardware industry. Much of the output is invisible at the aggregate level because semi-conductor products are intermediate inputs to other industries rather than deliverables to final demand - consumption and investment goods. Moreover, SC plays a role in the improvements in quality and performance of other products like - computers, communication equipment and scientific instruments. Failure to account for them leads us to miss the role of key industries that produce intermediate inputs and importance of intermediate inputs for the industries that use them. (Jorgenson, Ho and Stiroh (2005), Productivity, volume 3). 33 Sectoral break-up of consumption of lubricants is not available. It has been assumed that two-thirds is used in industry and one-third in transport. There is a small part being used by tractors in the agriculture sector. This is ignored. The basis for the above assumption is the estimates of consumption presented in a recent report available on the internet (Sabri Hazarika, India’s Lubricants Sector, The Road to Recovery, Philip Capital India Research, available on line at: http://backoffice.phillipcapital.in/Backoffice/Researchfiles/PC_-_Indian_Lubricants_Sector_-_Sep_2016_20160920081315.pdf, accessed on 31 March 2019). For motor spirit, it is assumed that 75 per cent is consumed by households and 25 per cent by the transport sector. The basis for this assumption is break-up of petrol consumption by type of vehicle available for 2012-13 in a report of the Petroleum Planning and Analysis Cell (All-India Study on Sectoral Demand of Diesel and Petrol, Report, Petroleum Planning and Analysis Cell, Ministry of Petroleum and Natural Gas, Government of India, 2013). 34 These energy intensity series (being based on quantum of energy use in physical units) are different from energy intensity that can be computed by dividing deflated value of energy input by real value of gross output available in the India KLEMS database. The latter series has certain limitations. For instance, if power is supplied to agriculture free of cost, this will be captured properly if energy use is measured in physical unit, but not when the cost incurred on energy is considered. 35 Press note on National Accounts Statistics Back-series 2004-05 to 2011-12, base 2011-12, Central Statistics Office, Ministry of Statistics & Programme Implementation, Government of India, dated 28 November 2018. 36 This is based on calorific value of non-coking coal as given in India’s second Biennial Update Report to the UNFCCC (United Nations Framework Convention on Climate Change), December 2018, available on line at: https://unfccc.int/sites/default/files/resource/INDIA%20SECOND%20BUR%20High%20Res.pdf, accessed on 31 March 2019. The calorific value of coal used for power generation is lower than the calorific value of coal used in manufacturing. 37 All-India Study on Sectoral Demand of Diesel and Petrol, Report, Petroleum Planning and Analysis Cell, Ministry of Petroleum and Natural Gas, Government of India, 2013. According to this survey, 13 per cent of diesel is used by the agriculture sector. A working paper of NIPFP (Mukesh Anand, “Direct and Indirect use of Fossil Fuels in Farming: Cost of Fuel-price Rise for Indian Agriculture,” Working paper no. 2014-132, National Institute of Public Finance and Policy, New Delhi, 2014) reports that the share of agriculture in total diesel consumption was 19.2 per cent in 1998-99, 19.8 per cent in 2000-01, 11.9 per cent in 2008-09 and 12.2 per cent in 2010-11. 38 Sudhir Gota, “Transport Emissions and India’s Diesel Mystery Comparing Top-Down and Bottom-Up Carbon Estimates.” Working Paper. Washington, DC: EMBARQ. 2014. Available online at: http://www.embarq.org/our-work/research; Nan Zhou and Michael A. McNeil, “Assessment of historic trend in mobility and energy use in India transportation sector using bottom-up approach.” Journal of Renewable and Sustainable Energy 1(4), 2009. 39 The discussion in this section on aggregate economy ICT investment heavily draws upon Erumban and Das (2016). 40 WITSA provides ICT spending data in a cross section of countries through their Digital Planet Report. See http://www.witsa.org/ 41 We use the 2011 version of the EU KLEMS data, see www.euklems.net. 42 Note that the software investment data – be it directly from the NAS, or obtained using hardware/software ratio from WITSA or from the United States - does not capture pirated software used by companies, if any. While the use of such pirated software would indeed contribute to firm’s output growth, it will never be reported by firms, and hence is hard to capture. 43 See de Vries et al., (2010) and Timmer and van Ark (2005) for a good description of the commodity flow approach. 44 Our harmonized price deflators are based on the U.S hedonic prices, which are constructed using a hedonic regression where prices of ICT equipment regressed on several characteristics, such as for instance processor speed, hardware size, memory etc. |