Sample Data and Replication Code for: Mega or Micro? Influencer Selection Using Follower Elasticity (doi:10.7910/DVN/XBKRZL)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Sample Data and Replication Code for: Mega or Micro? Influencer Selection Using Follower Elasticity

Identification Number:

doi:10.7910/DVN/XBKRZL

Distributor:

Harvard Dataverse

Date of Distribution:

2023-09-21

Version:

4

Bibliographic Citation:

Tian, Zijun; Dew, Ryan; Iyengar, Raghu, 2023, "Sample Data and Replication Code for: Mega or Micro? Influencer Selection Using Follower Elasticity", https://doi.org/10.7910/DVN/XBKRZL, Harvard Dataverse, V4, UNF:6:3O8ufwM/M+usLsXMguvYkQ== [fileUNF]

Study Description

Citation

Title:

Sample Data and Replication Code for: Mega or Micro? Influencer Selection Using Follower Elasticity

Identification Number:

doi:10.7910/DVN/XBKRZL

Identification Number:

JMR-22-048

Authoring Entity:

Tian, Zijun (Washington University in St. Louis)

Dew, Ryan (University of Pennsylvania)

Iyengar, Raghu (University of Pennsylvania)

Distributor:

Harvard Dataverse

Access Authority:

Tian, Zijun

Depositor:

Tian, Zijun

Date of Deposit:

2023-09-21

Holdings Information:

https://doi.org/10.7910/DVN/XBKRZL

Study Scope

Keywords:

Business and Management

Abstract:

In the sample data folder, we provide a small sample of hashtags we collected from TikTok Discover page and some videos under them. In the code folder, we show how we 1) Extracted multi-modal video features from the original videos and save them into a local database (under database/generate) from which we generated the training and test data for the SVAE model (under database/output) 2) Train the SVAE model to get a 256-D latent vector representation for each video based on the learned feature weights (under SVAE) 3) Combine the content representation in the above step with other video covariates (under video_info) as the input for our causal inference (under DeepIV) 4) Estimate the DeepIV model to obtain the average and heterogeneous treatment effects (under DeepIV/treatment_effects) Finally, supplementary plots and tests are provided under DeepIV/distribution_plots and mis.

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

File Description--f7593948

File: ab_d3.tab

  • Number of cases: 518305

  • No. of variables per record: 6

  • Type of File: text/tab-separated-values

Notes:

UNF:6:QeOGfxYPXCemoW70rI34PA==

File Description--f7593952

File: covariates.tab

  • Number of cases: 518303

  • No. of variables per record: 18

  • Type of File: text/tab-separated-values

Notes:

UNF:6:QzWzNSu/MVEZknwjv+0+gg==

Variable Description

List of Variables:

Variables

hashtag id

f7593948 Location:

Summary Statistics: Mean 114.0622085455345; Valid 518305.0; StDev 62.52055026119633; Max. 216.0; Min. 1.0

Variable Format: numeric

Notes: UNF:6:Odu1iLxCMEzMSugtkZK8aQ==

id

f7593948 Location:

Summary Statistics: Mean 6.8948718260483963E18; Valid 518305.0; Max. 6.95E18; StDev 2.22298624151671968E17; Min. 6522621.0;

Variable Format: numeric

Notes: UNF:6:+yduu9t4kc8Vndc0mazRcQ==

a

f7593948 Location:

Summary Statistics: Mean 96200.15289066482; StDev 366399.2074602975; Valid 518305.0; Max. 6.143086573E7; Min. 2.09E-18

Variable Format: numeric

Notes: UNF:6:HYOJWD2k3RzJjPS1N2qQuA==

b

f7593948 Location:

Summary Statistics: Valid 518305.0; Max. 5.381870649E8; StDev 1805541.7346174207; Min. 1.18E-15; Mean 363923.0031050988

Variable Format: numeric

Notes: UNF:6:6rA5nrrBkrrhbuebfa/l/g==

loga

f7593948 Location:

Summary Statistics: Min. -40.71107202; Valid 518305.0; StDev 6.459787149767857; Max. 17.93342297; Mean 8.110315577498085

Variable Format: numeric

Notes: UNF:6:5UP5iTLKtGgEwTY3k7qFcA==

logb

f7593948 Location:

Summary Statistics: StDev 2.1358854811251873; Max. 20.10371676; Min. -34.37182692; Valid 518305.0; Mean 11.129220954396148;

Variable Format: numeric

Notes: UNF:6:PWA6PuUHJHOun+kfPQylrg==

hashtag id

f7593952 Location:

Summary Statistics: Max. 216.0; Min. 1.0; StDev 62.52052794828853; Valid 518303.0; Mean 114.06242101617521

Variable Format: numeric

Notes: UNF:6:jcWEXOdjRCxppZpo+vIHHQ==

id

f7593952 Location:

Summary Statistics: StDev 2.22299050261723968E17; Max. 6.95E18; Min. 6522621.0; Valid 518303.0; Mean 6.8948718834349445E18

Variable Format: numeric

Notes: UNF:6:UDIadGO6P4TyN+yR9ahmoA==

INIT_IMPRESSION

f7593952 Location:

Summary Statistics: Max. 20.11003828; StDev 2.151534985294515; Valid 518303.0; Mean 11.018817044879627; Min. 0.0

Variable Format: numeric

Notes: UNF:6:lt8voxHc5Qi29jQFdQivig==

DAY

f7593952 Location:

Summary Statistics: Max. 432.0; Valid 518303.0; StDev 33.87110738945055; Mean 10.125254918449977; Min. 1.0

Variable Format: numeric

Notes: UNF:6:m9hbx2XC5OYvY7aaoEwssw==

IF_TRENDING

f7593952 Location:

Summary Statistics: Mean 0.767086048122396; Valid 518303.0; Max. 1.0; StDev 0.4226882865790715; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:atKBx7Rzw/4zMygXcgztWw==

RANKING

f7593952 Location:

Summary Statistics: Mean 1029.0532545635117; Max. 2000.0; StDev 586.4950496721762; Min. 1.0; Valid 518303.0

Variable Format: numeric

Notes: UNF:6:jkC5FeYwSwMhILgniqkaaQ==

IF_TRENDING*RANKING

f7593952 Location:

Summary Statistics: Max. 2000.0; Valid 518303.0; Min. 0.0; StDev 611.2356540680988; Mean 693.6287229670666

Variable Format: numeric

Notes: UNF:6:L0Xpwyrx28BDazO5GQr1HA==

AGE

f7593952 Location:

Summary Statistics: StDev 102.94677282888703; Max. 2015.1474; Mean 35.21227749598238; Min. 0.0; Valid 518303.0

Variable Format: numeric

Notes: UNF:6:6dAERpz0XoXn38F+xSGbwg==

IF_TRENDING*AGE

f7593952 Location:

Summary Statistics: Valid 518303.0; Min. 0.0; StDev 101.42012039007119; Mean 32.32023150860192; Max. 2015.1474;

Variable Format: numeric

Notes: UNF:6:1+bvuwlNPCvx2rXAjy1smQ==

NUM_HASHTAG

f7593952 Location:

Summary Statistics: Mean 6.662498577087404; Min. 0.0; StDev 3.2748146161790417; Valid 518303.0; Max. 47.0

Variable Format: numeric

Notes: UNF:6:lcGByHVgTzCuT+QVe7w9Rg==

NUM_TRENDING

f7593952 Location:

Summary Statistics: Mean 1.0782457365672395; Valid 518303.0; Min. 0.0; StDev 0.8531250093101734; Max. 10.0;

Variable Format: numeric

Notes: UNF:6:r1dZc2N4FoSfsFIn5c5V8w==

IF_FYP

f7593952 Location:

Summary Statistics: Mean 0.6042160666635852; StDev 0.4890188880175061; Max. 1.0; Min. 0.0; Valid 518303.0;

Variable Format: numeric

Notes: UNF:6:22EmT3Lbm/W8OYuJZ/sP8Q==

FOLLOWING

f7593952 Location:

Summary Statistics: Min. 0.0; Valid 518303.0; Max. 4.0; Mean 2.168902776956292; StDev 0.7700576310594075

Variable Format: numeric

Notes: UNF:6:utYB0thFg6U5N3PohT8zbg==

FOLLOWER

f7593952 Location:

Summary Statistics: Min. 0.0; StDev 2.5554803178228345; Valid 518303.0; Mean 10.514357280779725; Max. 18.0922;

Variable Format: numeric

Notes: UNF:6:LjUIzLEZdCXhvi5FczTx0Q==

AVG_HEART

f7593952 Location:

Summary Statistics: Mean 8.697028061191993; Min. -1.8718; StDev 1.9439172472391835; Max. 17.0755; Valid 518303.0

Variable Format: numeric

Notes: UNF:6:Qk7+dUa1/BcuHNsw42R4jg==

VIDEO

f7593952 Location:

Summary Statistics: Min. 0.0; StDev 0.6563351457266645; Valid 518303.0; Mean 2.144580248184566; Max. 4.527629901;

Variable Format: numeric

Notes: UNF:6:cne7r1J2OxIfMhYzd2Zz3w==

loga

f7593952 Location:

Summary Statistics: StDev 6.459791557762852; Valid 518303.0; Max. 17.93342297; Mean 8.110295722221656; Min. -40.71107202

Variable Format: numeric

Notes: UNF:6:0jtNUnszERpO3gTYBa7tWA==

logb

f7593952 Location:

Summary Statistics: Min. -34.37182692; Mean 11.129197194986286; Max. 20.10371676; StDev 2.135853706959072; Valid 518303.0

Variable Format: numeric

Notes: UNF:6:RfwylOjy7oS5mi2ul30eqA==

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain

Other Study-Related Materials

Label:

database_addnewht.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_af.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_checkspeech.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_check_data.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_createhtpidgy.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_create_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_create_p2_1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_datagen_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_datagen_text_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_formater.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_img.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_img_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_img_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_img_p2_1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_img_toplist.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_label.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_list.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_list_all.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_list_toplist.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_musicid.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_numofscences.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_scenes.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_smile_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_smile_p2_1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_sticker.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_test.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_text.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_text1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_text_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_text_p2_1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_text_toplist.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_trainset_top.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_by_ht.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_by_ht_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_by_ht_partial.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_by_ht_partial_imgembed.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_v1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_v1_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_v2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_variance.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_variance_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_var_yamnet.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_videolen.py

Notes:

text/x-python

Other Study-Related Materials

Label:

check_missing.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_addnewht.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_img.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_label.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_sticker.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_variance.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_var_yamnet.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_videolen.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_yamnet.py

Notes:

text/x-python

Other Study-Related Materials

Label:

combine_traintest_ht_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

database_traintest_by_ht_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

data_htdt100.py

Notes:

text/x-python

Other Study-Related Materials

Label:

comparedays_new.py

Notes:

text/x-python

Other Study-Related Materials

Label:

download_tt.py

Notes:

text/x-python

Other Study-Related Materials

Label:

download_tt1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain

Other Study-Related Materials

Label:

scrape_tt1.py

Notes:

text/x-python

Other Study-Related Materials

Label:

scrape_tt2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

density.py

Notes:

text/x-python

Other Study-Related Materials

Label:

joint.py

Notes:

text/x-python

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain

Other Study-Related Materials

Label:

deepiv.py

Notes:

text/x-python

Other Study-Related Materials

Label:

find_iv.py

Notes:

text/x-python

Other Study-Related Materials

Label:

infos_feature_new.csv

Notes:

text/csv

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain

Other Study-Related Materials

Label:

requirements (with Python 3.6).txt

Notes:

text/plain

Other Study-Related Materials

Label:

age.py

Notes:

text/x-python

Other Study-Related Materials

Label:

authorcount.py

Notes:

text/x-python

Other Study-Related Materials

Label:

check.py

Notes:

text/x-python

Other Study-Related Materials

Label:

checkuser.py

Notes:

text/x-python

Other Study-Related Materials

Label:

checkusername.py

Notes:

text/x-python

Other Study-Related Materials

Label:

checkusers.py

Notes:

text/x-python

Other Study-Related Materials

Label:

check_download_hts.py

Notes:

text/x-python

Other Study-Related Materials

Label:

check_missing.py

Notes:

text/x-python

Other Study-Related Materials

Label:

check_missing_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

check_rank.py

Notes:

text/x-python

Other Study-Related Materials

Label:

combine_newlyaddeddata.py

Notes:

text/x-python

Other Study-Related Materials

Label:

compare.py

Notes:

text/x-python

Other Study-Related Materials

Label:

comparedays.py

Notes:

text/x-python

Other Study-Related Materials

Label:

createtimeandvideolen.py

Notes:

text/x-python

Other Study-Related Materials

Label:

first100ids.py

Notes:

text/x-python

Other Study-Related Materials

Label:

generate_random_p2.py

Notes:

text/x-python

Other Study-Related Materials

Label:

generate_topids.py

Notes:

text/x-python

Other Study-Related Materials

Label:

getallht.py

Notes:

text/x-python

Other Study-Related Materials

Label:

gettop100list.py

Notes:

text/x-python

Other Study-Related Materials

Label:

get_top100_video.py

Notes:

text/x-python

Other Study-Related Materials

Label:

newlyadded.py

Notes:

text/x-python

Other Study-Related Materials

Label:

newlyaddeddata.py

Notes:

text/x-python

Other Study-Related Materials

Label:

pct.py

Notes:

text/x-python

Other Study-Related Materials

Label:

tsneforht.py

Notes:

text/x-python

Other Study-Related Materials

Label:

verified.py

Notes:

text/x-python

Other Study-Related Materials

Label:

wordseg.py

Notes:

text/x-python

Other Study-Related Materials

Label:

6858616085319634178.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6860229816164142341.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6879428180785024261.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6885018462654090502.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6911679265746603270.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6914380541106212101.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6918069687955819781.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6919990947543616769.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6921306091712269570.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6922160309104823558.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6924931055279492357.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6925424277386841349.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6925821712277835013.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6927659175610731778.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6928025573897899269.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6928049316053454085.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6930095438179650821.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6930627608098966790.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6931187713378946310.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

6932558850792951045.mp4

Notes:

video/mp4

Other Study-Related Materials

Label:

dtforvideo.py

Notes:

text/x-python

Other Study-Related Materials

Label:

dttomatrix.py

Notes:

text/x-python

Other Study-Related Materials

Label:

featuresweights.py

Notes:

text/x-python

Other Study-Related Materials

Label:

mave_new_LSTM.py

Notes:

text/x-python

Other Study-Related Materials

Label:

nn_predictor.py

Notes:

text/x-python

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain

Other Study-Related Materials

Label:

infos.py

Notes:

text/x-python

Other Study-Related Materials

Label:

playcount.py

Notes:

text/x-python

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain