Data for independent undergraduate research: cleaned data for "COVID-19 and the Uneven Impact on Pharmaceutical Innovation: Evidence from China and the EU" (doi:10.7910/DVN/QAD1L3)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Entire Codebook

Document Description

Citation

Title:

Data for independent undergraduate research: cleaned data for "COVID-19 and the Uneven Impact on Pharmaceutical Innovation: Evidence from China and the EU"

Identification Number:

doi:10.7910/DVN/QAD1L3

Distributor:

Harvard Dataverse

Date of Distribution:

2025-07-06

Version:

1

Bibliographic Citation:

Wang, Haoling, 2025, "Data for independent undergraduate research: cleaned data for "COVID-19 and the Uneven Impact on Pharmaceutical Innovation: Evidence from China and the EU"", https://doi.org/10.7910/DVN/QAD1L3, Harvard Dataverse, V1, UNF:6:Vq58VMvR6xYMGFhNOO33Ow== [fileUNF]

Study Description

Citation

Title:

Data for independent undergraduate research: cleaned data for "COVID-19 and the Uneven Impact on Pharmaceutical Innovation: Evidence from China and the EU"

Identification Number:

doi:10.7910/DVN/QAD1L3

Authoring Entity:

Wang, Haoling (University of Nottingham, Ningbo China)

Distributor:

Harvard Dataverse

Access Authority:

Wang, Haoling

Depositor:

Wang, Haoling

Date of Deposit:

2025-07-06

Holdings Information:

https://doi.org/10.7910/DVN/QAD1L3

Study Scope

Keywords:

Arts and Humanities, Social Sciences

Abstract:

The sample is drawn from the Industrial R&D Investment Scoreboard published by the IRI/JRC, which tracks the world’s top 1000 firms by R&D spending. Chinese and EU-based pharmaceutical companies form a significant portion of the dataset, making it well-suited for a DID design to compare post-COVID changes in R&D investment across regions. The dataset was restructured into a panel of firm-year observations from 2015 to 2024, covering key variables such as R&D input, capital expenditure, profit, and employment. After excluding entries with missing values in core variables, standard data-cleaning procedures using Stata was implemented. The final analytical sample includes 217 firms, covering 114 Chinese and 103 EU companies, observed over an unbalanced panel structure. Company identity is tracked via the company variable. Key financial indicators such as R&D input, profits, and employees exhibit variation across both time and geography, justifying a panel-data approach.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

File Description--f11704490

File: phar1089_dealed.tab

  • Number of cases: 1089

  • No. of variables per record: 41

  • Type of File: text/tab-separated-values

Notes:

UNF:6:Vq58VMvR6xYMGFhNOO33Ow==

Variable Description

List of Variables:

Variables

company

f11704490 Location:

Variable Format: character

Notes: UNF:6:m5zza4E9UgjzERpISJGDaQ==

country

f11704490 Location:

Variable Format: character

Notes: UNF:6:IsBU6UnINHhnlxVJra1zpg==

region

f11704490 Location:

Variable Format: character

Notes: UNF:6:Qt+PyBQFJdLRCoOhfW+ylg==

R&D input (€mn)

f11704490 Location:

Summary Statistics: StDev 907.1593725888406; Valid 1089.0; Mean 350.8243847123816; Min. 19.216; Max. 7704.0

Variable Format: numeric

Notes: UNF:6:Fd5fRaiYJpVtgtl48i85Yg==

employees

f11704490 Location:

Summary Statistics: Min. 0.0; Max. 118900.0; Mean 9377.309985968204; StDev 18419.819121413224; Valid 1069.0

Variable Format: numeric

Notes: UNF:6:XUsEjTNihl5zU9lyhzleUQ==

year

f11704490 Location:

Summary Statistics: Valid 1089.0; StDev 2.7387181850337523; Min. 2015.0; Mean 2019.9366391184574; Max. 2024.0

Variable Format: numeric

Notes: UNF:6:/ABJl4l9L5jTMLT0OYruFw==

company

f11704490 Location:

Value

Label

Frequency

Text

133.

LUOXINPHARMACEUTICALS

1

210.

ZEALAND PHARMA

9

145.

MORPHOSYS

10

150.

NOVOZYMES

10

180.

SHANGHAI PHARMACEUTICALS

9

157.

PHARMA MAR

9

200.

UNIQURE

10

144.

MITHRA PHARMACEUTICALS

7

105.

INFLARX

1

88.

GUANGZHOU PHARMACEUTICAL

4

10.

ALKERMES

10

115.

JAZZ PHARMACEUTICALS

10

90.

GUERBET

10

76.

FLAMEL TECHNOLOGIES

1

192.

STAIDSON BEIJING BIOPHARMACEUTICALS

1

7.

AIM

3

28.

BETTA PHARMACEUTICALS

6

211.

ZELTIA

1

97.

HUA MEDICINE

1

24.

BAYER

10

117.

JHBP

1

120.

JIANGSU RECBIO

2

101.

HUMANWELL HEALTHCARE

10

68.

ENDO INTERNATIONAL

10

73.

EVOTEC

1

166.

RECORDATI

10

178.

SHANGHAI JUNSHI BIOSCIENCES

6

40.

BOEHRINGER SOHN

8

164.

PROTHENA

6

108.

INNOVENT BIOLOGICS

6

181.

SHANGHAI SHYNDEC

5

195.

SWEDISH ORPHAN BIOVITRUM

10

16.

ARGENX

8

216.

ZHEJIANG MEDICINE

10

193.

STALLERGENES

1

131.

LIVZON PHARMACEUTICAL

10

52.

CHONGQING GENRIX

2

132.

LUOXIN PHARMACEUTICALS

1

23.

BAVARIAN NORDIC

10

11.

ALLERGAN

5

204.

WALVAX BIOTECHNOLOGY

4

214.

ZHEJIANG HISUN PHARMACEUTICAL

6

156.

PERRIGO

10

81.

GENOR

3

64.

Company

1

158.

PHARMATHEN

4

187.

SIMCERE PHARMACEUTICAL

5

5.

ADAGENE

3

26.

BEIJING AOSAIKANG PHARMACEUTICAL

5

83.

GRACELL BIOTECHNOLOGIES

2

100.

HUAPONT NUTRICHEM

1

155.

PAION

1

122.

JOINCARE PHARMACEUTICAL GROUP INDUSTRY

4

111.

IO BIOTECH

1

8.

AKESO

5

84.

GRIFOLS

10

118.

JIANGSU KANION PHARMACEUTICAL

10

27.

BEIJING WANTAI BIOLOGICAL

5

6.

AFFIMED

5

153.

ORION OYJ

10

154.

ORPHAZYME

2

113.

ITERUM THERAPEUTICS

1

167.

REMEGEN

5

49.

CHINA RESOURCES DOUBLE CRANE PHARMACEUTICAL

1

25.

BEIGENE

9

54.

CHR HANSEN

9

96.

HORIZON PHARMA

8

93.

HANSOH PHARMACEUTICAL GROUP

6

124.

KEYMED BIOSCIENCES

4

22.

B. BRAUN

2

86.

GRUNENTHAL

4

14.

ANTENGENE CORPORATION

4

30.

BIAL

9

148.

NIDDA GERMAN TOPCO

6

151.

ONCOPEPTIDES

3

103.

HUTCHMED

5

79.

GAN&LEE

4

161.

PORTON

2

66.

DIASORIN

10

43.

CANSINBIOLOGICS

5

45.

CELYAD

1

130.

LFB

5

95.

HINOVA PHARMACEUTICALS

1

98.

HUADONG MEDICINE

10

9.

ALK ABELLO

10

196.

SYMPHOGEN

2

159.

PHARMING

1

173.

SHANDONG LUOXIN PHARMACEUTICAL

1

213.

ZHEJIANG BEIDA PHARMACEUTICAL

1

104.

I-MAB

6

51.

CHINA RESOURCES SANJIU MEDICAL & PHARMACEUTICAL

4

89.

GUANGZHOU WONDFO

1

112.

IPSEN

10

12.

ALMIRALL

10

33.

BIOCARTIS GROUP

4

183.

SHENZHEN SALUBRIS PHARMACEUTICALS

8

82.

GENSCRIPT BIOTECH

6

20.

ATAI

2

71.

ESS

3

182.

SHENZHEN KANGTAI BIOLOGICAL

3

169.

SANOFI

10

114.

JACOBIO

2

139.

MERCK DE

10

188.

SINO BIOPHARMACEUTICAL

10

162.

POXEL

1

189.

SINOCELLTECH

5

37.

BIOTEST

7

119.

JIANGSU NHWA PHARMACEUTICAL

3

136.

MALLINCKRODT

10

217.

ZHEJIANG ORIENT GENE BIOTECH

3

199.

UCB

10

129.

LEPU BIOPHARMA

2

85.

GRUENENTHAL PHARMA

6

191.

STADA ARZNEIMITTEL

4

17.

ASCENDIS PHARMA

9

77.

FORWARD PHARMA

2

99.

HUAPONT LIFE SCIENCE

1

78.

FOSUN INTERNATIONAL

10

72.

EVEREST MEDICINES

4

38.

BOEHRINGER

1

67.

DIZAL (JIANGSU) PHARMACEUTICAL

4

92.

HAISCO PHARMACEUTICAL

4

34.

BIOCYTOGEN

2

4.

ABLYNX

1

197.

TASLY PHARMACEUTICAL

10

58.

CONNECT BIOPHARMA

3

140.

MERIEUX ALLIANCE

7

201.

UNITED LABORATORIES INTERNATIONAL HOLDINGS

6

206.

WUXI BIOLOGICS

5

126.

KRKA

10

48.

CHIESI FARMACEUTICI

10

143.

MERZ PHARMA

7

110.

INVENTIVA

2

149.

NOVO NORDISK

10

13.

ALPHAMAB

4

35.

BIOMERIEUX

2

209.

ZAI LAB

7

46.

CHANGCHUN HIGH & NEW TECHNOLOGY INDUSTRIES

9

29.

BGI GENOMICS

4

21.

AVADEL PHARMACEUTICALS

2

47.

CHENGDU KANGHONG PHARMACEUTICAL

6

106.

INNOCARE PHARMA

5

60.

COSMO PHARMACEUTICALS

1

63.

CUREVAC

3

170.

SERVIER

10

39.

BOEHRINGER INGELHEIM

1

123.

KEDRION

4

125.

KINTOR

5

19.

ASYMCHEM LABORATORIES

4

91.

H LUNDBECK

10

15.

APELOA PHARMACEUTICAL

6

128.

LEO PHARMA

7

194.

SUZHOU ZELGEN

3

185.

SICHUAN KELUN PHARMACEUTICAL

10

135.

MABWELL (SHANGHAI) BIOSCIENCE

4

127.

LABORATORIOS FARMACEUTICOS ROVI

1

36.

BIONTECH

9

137.

MEDA

2

87.

GUANGZHOU BAIYUNSHAN PHARMACEUTICAL

6

75.

FERRING PHARMACEUTICALS

3

74.

FAR EAST SMARTER ENERGY

6

168.

RICHTER GEDEON

10

109.

INVENTISBIO

2

55.

CISEN PHARMACEUTICAL

1

70.

ERYTECH PHARMA

2

215.

ZHEJIANG HUAHAI PHARMACEUTICAL

5

184.

SHIJIAZHUANG YILING PHARMACEUTICAL

10

165.

QIAGEN

10

172.

SHANDONG DONG E E JIAO

1

198.

TIGENIX

1

175.

SHANDONG XINHUA PHARMACEUTICAL

1

152.

OREXO

1

61.

CSPC PHARMACEUTICAL

10

121.

JOINCARE PHARMACEUTICAL

6

53.

CHONGQING ZHIFEI BIOLOGICAL PRODUCTS

5

174.

SHANDONG LUOXIN PHARMACY STOCK

3

134.

LUYE PHARMA

10

212.

ZHE JIANG HUA HAI PHARMACEUTICAL

5

94.

HARBIN PHARMACEUTICAL

3

179.

SHANGHAI MODERN PHARMACEUTICAL

2

18.

ASCENTAGE PHARMA

6

59.

COSMO PHARMA

1

116.

JD HEALTH INTERNATIONAL

5

163.

PROQR THERAPEUTICS

3

176.

SHANDONGJINCHENG PHARMACEUTICAL

1

202.

VALNEVA

5

147.

NABRIVA THERAPEUTICS

2

142.

MERZ

3

203.

VETOQUINOL

1

42.

CANBRIDGE PHARMACEUTICALS

1

56.

CK LIFE SCIENCES

2

1.

3SBIO

8

32.

BIOALLIANCE PHARMA

1

208.

YIFAN PHARMACEUTICAL

4

207.

XIZANG HAISCO PHARMACEUTICAL

3

3.

ABIVAX

1

171.

SHANDONG BUCHANG PHARMACEUTICALS

7

50.

CHINA RESOURCES PHARMACEUTICAL GROUP

8

57.

CLOVER BIOPHARMACEUTICALS

4

69.

ENN NATURAL GAS

4

31.

BIO-THERA

6

102.

HUTCHISON CHINA MEDITECH

3

160.

PHARVARIS

2

205.

WUXI APPTEC

6

107.

INNOCOLL

2

2.

AB SCIENCE

3

44.

CELLECTIS

8

190.

SINOVAC BIOTECH

5

62.

CSTONE PHARMACEUTICALS

5

80.

GENMAB

4

65.

DBV TECHNOLOGIES

6

146.

MYLAN

5

177.

SHANGHAI FOSUN PHARMACEUTICAL

6

141.

MERUS

5

41.

BRII BIOSCIENCES

1

186.

SIHUAN PHARMACEUTICAL

7

138.

MEDIVIR

4

Summary Statistics: StDev 63.59346678273796; Max. 217.0; Mean 110.93480257116633; Min. 1.0; Valid 1089.0

Variable Format: numeric

Notes: UNF:6:VumDZGdrkkQ4d3XRoCQeng==

Interpolation of sales on year

f11704490 Location:

Summary Statistics: Max. 50739.0; Mean 3199.934958402927; Min. 0.0; Valid 1072.0; StDev 7332.040682259772;

Variable Format: numeric

Notes: UNF:6:A3BEOtaG7M5guSbX8l6jgg==

Interpolation of profits on year

f11704490 Location:

Summary Statistics: Min. -10821.0; Max. 15500.811518; Mean 408.6775282043409; Valid 1086.0; StDev 1637.2936859434528;

Variable Format: numeric

Notes: UNF:6:i7Iy2/K8vxG0+XorxPKDFw==

Interpolation of capex on year

f11704490 Location:

Summary Statistics: StDev 400.16082831386393; Min. 0.0; Mean 169.93673078822812; Max. 3462.54610317413; Valid 1064.0

Variable Format: numeric

Notes: UNF:6:J41DOUy5Ci+pC8c1F7t7Yw==

square of capital expenditure

f11704490 Location:

Summary Statistics: Min. 0.0; Max. 1.1989226E7; StDev 954256.1476087391; Valid 1064.0; Mean 188856.68429590674;

Variable Format: numeric

Notes: UNF:6:98W5W8h/Iz6ogijtnPbk7A==

log of RDinput

f11704490 Location:

Summary Statistics: StDev 1.150875161605438; Valid 1089.0; Min. 2.9557433128356934; Max. 8.949495315551758; Mean 4.852476467223644

Variable Format: numeric

Notes: UNF:6:PjsyYZs2GlZz0pWJ4mP/8Q==

log of profits

f11704490 Location:

Summary Statistics: Max. 9.64864730834961; Min. -1.5425739288330078; Mean 5.433076086517235; Valid 708.0; StDev 1.5050936652559175

Variable Format: numeric

Notes: UNF:6:8t+jb1Ex8DwNFKRZ660tAg==

log of capital expenditure

f11704490 Location:

Summary Statistics: Min. -6.214608192443848; Max. 8.149759292602539; Mean 3.7246941765995603; StDev 1.971775182206435; Valid 1049.0

Variable Format: numeric

Notes: UNF:6:TBNcL9woYP3zJ+EeVL85LA==

log of number of employees

f11704490 Location:

Summary Statistics: Mean 7.445515789331479; Valid 1056.0; Min. -4.422848701477051; Max. 11.68603801727295; StDev 2.5955276592932974;

Variable Format: numeric

Notes: UNF:6:ZR48pHe0cDYiA/lRpuDI7w==

log of sales

f11704490 Location:

Summary Statistics: Max. 10.834449768066406; Valid 1055.0; StDev 2.3770927645273545; Mean 6.334825346008011; Min. -4.894988059997559;

Variable Format: numeric

Notes: UNF:6:nct3i0pL61WPTOfeTYceeg==

L1_sales

f11704490 Location:

Summary Statistics: Valid 847.0; Max. 50739.0; Min. 0.0; StDev 7532.456954711602; Mean 3367.4203939954673;

Variable Format: numeric

Notes: UNF:6:XlWSdhVK66moUOlXvqsxfA==

L2_sales

f11704490 Location:

Summary Statistics: StDev 7601.033199929781; Mean 3449.5300958861676; Max. 50739.0; Min. 0.0; Valid 680.0;

Variable Format: numeric

Notes: UNF:6:stuL11UiUah/tko3Gd8NNg==

L1_profits

f11704490 Location:

Summary Statistics: Valid 855.0; StDev 1742.6588165586843; Mean 451.8327187181222; Max. 15500.8115234375; Min. -10821.0

Variable Format: numeric

Notes: UNF:6:nAHBAWF4ZwnnmJ3+2rqcNg==

L2_profits

f11704490 Location:

Summary Statistics: Max. 15500.8115234375; Mean 470.79026414671876; Valid 684.0; StDev 1725.1513190407584; Min. -10821.0;

Variable Format: numeric

Notes: UNF:6:6fFPOyZ7qM1WUlHOdx6B+Q==

L1_Rdinput

f11704490 Location:

Summary Statistics: Mean 361.7688675727972; Valid 857.0; StDev 928.6887965509104; Max. 7704.0; Min. 20.265914916992188

Variable Format: numeric

Notes: UNF:6:W1N4hZZlWnpVM2F9xhsXWA==

L2_Rdinput

f11704490 Location:

Summary Statistics: StDev 938.2556881708622; Valid 686.0; Mean 364.7290735745218; Max. 7704.0; Min. 21.143442153930664

Variable Format: numeric

Notes: UNF:6:76UOxWSNJDcfWqe6jajZyA==

L1_capex

f11704490 Location:

Summary Statistics: Min. 0.0; StDev 399.4461689327324; Max. 2949.0; Valid 839.0; Mean 177.37917393518785;

Variable Format: numeric

Notes: UNF:6:F7pL2ijbgdzUaZJ/zZtjsA==

L1_capex2

f11704490 Location:

Summary Statistics: StDev 896707.6113246479; Max. 8696601.0; Mean 190830.43742711; Valid 839.0; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:yD52JWEBQcvw4nsApHvJYQ==

L2_capex

f11704490 Location:

Summary Statistics: Valid 671.0; Min. 0.0; Mean 180.16774050768143; Max. 2949.0; StDev 403.3094175029634

Variable Format: numeric

Notes: UNF:6:QdquEsOSA8AvF1o5WLX/6w==

L2_capex2

f11704490 Location:

Summary Statistics: Min. 0.0; StDev 907595.3447988441; Valid 671.0; Mean 194876.48903256212; Max. 8696601.0

Variable Format: numeric

Notes: UNF:6:bpvdBw0MGtQDZfp1KuRx7A==

regionbinary

f11704490 Location:

Summary Statistics: Min. 0.0; Mean 0.4811753902662993; StDev 0.49987507216919497; Valid 1089.0; Max. 1.0;

Variable Format: numeric

Notes: UNF:6:tFZxVqk01JlNbdByOnnn2Q==

number of emloyees in different region

f11704490 Location:

Summary Statistics: Min. 0.0; Mean 4442.662597757535; Max. 108000.0; Valid 1069.0; StDev 11444.215389928877

Variable Format: numeric

Notes: UNF:6:quPKkVl+XMDVlCbb4db1pQ==

profits in different region

f11704490 Location:

Summary Statistics: Mean 85.22566935941475; Max. 15500.8115234375; Valid 1086.0; Min. -1677.91650390625; StDev 551.220601067174

Variable Format: numeric

Notes: UNF:6:FmTpGppyWP8SoD31V1QmtA==

capital expenditure in different region

f11704490 Location:

Summary Statistics: Min. 0.0; StDev 148.7297764846374; Max. 1314.9805908203125; Mean 64.66149715167502; Valid 1064.0;

Variable Format: numeric

Notes: UNF:6:iBUkuGHkhPk3s0pPH4/AdQ==

L1_lnsales

f11704490 Location:

Summary Statistics: Valid 835.0; StDev 2.252970765681021; Mean 6.498641828175431; Min. -4.894988059997559; Max. 10.834449768066406

Variable Format: numeric

Notes: UNF:6:37G7enBy82YhAVcz50BV+Q==

L2_lnsales

f11704490 Location:

Summary Statistics: Min. -4.894988059997559; StDev 2.2378100642699694; Mean 6.5434374619744435; Valid 675.0; Max. 10.834449768066406

Variable Format: numeric

Notes: UNF:6:2kNcKwONTpU9wvQMYWLXeA==

L1_lnprofits

f11704490 Location:

Summary Statistics: Max. 9.64864730834961; StDev 1.5145204368222853; Min. -1.5425739288330078; Valid 580.0; Mean 5.464717162162837

Variable Format: numeric

Notes: UNF:6:mJdOYqIecsXt0EEBgc2w1Q==

L2_lnprofits

f11704490 Location:

Summary Statistics: Valid 490.0; StDev 1.5214965462493069; Max. 9.64864730834961; Min. -1.5425739288330078; Mean 5.432381509388892

Variable Format: numeric

Notes: UNF:6:ruaA6ddEP732ne2J6tpS5Q==

L1_lnRdinput

f11704490 Location:

Summary Statistics: Min. 3.0089404582977295; Mean 4.892809818895445; Max. 8.949495315551758; Valid 857.0; StDev 1.143088964111449

Variable Format: numeric

Notes: UNF:6:MyfxoHxv14KKgLzUUYyziw==

L2_lnRdinput

f11704490 Location:

Summary Statistics: StDev 1.1566375002522662; Mean 4.8780648673588605; Valid 686.0; Min. 3.0513298511505127; Max. 8.949495315551758;

Variable Format: numeric

Notes: UNF:6:0yTZh1UPi08Q4FQZvl21oQ==

treat

f11704490 Location:

Summary Statistics: Valid 1089.0; Mean 0.4811753902662993; Max. 1.0; StDev 0.49987507216919497; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:tFZxVqk01JlNbdByOnnn2Q==

post

f11704490 Location:

Summary Statistics: Max. 1.0; StDev 0.499104556577882; Mean 0.46648301193755765; Min. 0.0; Valid 1089.0

Variable Format: numeric

Notes: UNF:6:rWknOMgI8Jvg9NY0ZTRJWg==

high_profit

f11704490 Location:

Summary Statistics: Max. 0.0; StDev 0.0; Valid 1089.0; Min. 0.0; Mean 0.0

Variable Format: numeric

Notes: UNF:6:Ifi7Wxx5xj0lVB6/aOBNlA==

high_capex

f11704490 Location:

Summary Statistics: Valid 1089.0; Mean 0.0; StDev 0.0; Min. 0.0; Max. 0.0;

Variable Format: numeric

Notes: UNF:6:Ifi7Wxx5xj0lVB6/aOBNlA==

high_emp

f11704490 Location:

Summary Statistics: Mean 0.0; StDev 0.0; Min. 0.0; Max. 0.0; Valid 1089.0

Variable Format: numeric

Notes: UNF:6:Ifi7Wxx5xj0lVB6/aOBNlA==