Replication Data for: "Data-driven decentralized breeding increases genetic gain in challenging crop production environments" (doi:10.7910/DVN/OEZGVP)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Replication Data for: "Data-driven decentralized breeding increases genetic gain in challenging crop production environments"

Identification Number:

doi:10.7910/DVN/OEZGVP

Distributor:

Harvard Dataverse

Date of Distribution:

2020-05-12

Version:

1

Bibliographic Citation:

de Sousa, Kauê; van Etten, Jacob; Poland, Jesse; Fadda, Carlo; Jannink, Jean-Luc; Gebrehawaryat, Yosef; Lakew, Basazen Fantahun; Mengistu, Dejene K.; Pè, Mario Enrico; Solberg, Svein Øivind; Dell'Acqua, Matteo, 2020, "Replication Data for: "Data-driven decentralized breeding increases genetic gain in challenging crop production environments"", https://doi.org/10.7910/DVN/OEZGVP, Harvard Dataverse, V1, UNF:6:QUZ55x4U3JRGMDb7PNfHpQ== [fileUNF]

Study Description

Citation

Title:

Replication Data for: "Data-driven decentralized breeding increases genetic gain in challenging crop production environments"

Identification Number:

doi:10.7910/DVN/OEZGVP

Authoring Entity:

de Sousa, Kauê (Bioversity International)

van Etten, Jacob (Bioversity International)

Poland, Jesse (Kansas State University)

Fadda, Carlo (Bioversity International)

Jannink, Jean-Luc (Cornell University)

Gebrehawaryat, Yosef (Bioversity International)

Lakew, Basazen Fantahun (Scuola Superiore Sant'Anna)

Mengistu, Dejene K. (Bioversity International)

Pè, Mario Enrico (Scuola Superiore Sant'Anna)

Solberg, Svein Øivind (Inland Norway University)

Dell'Acqua, Matteo (Scuola Superiore Sant'Anna)

Other identifications and acknowledgements:

Kauê de Sousa

Producer:

Bioversity International

Grant Number:

202100-2817

Grant Number:

202100-2817

Distributor:

Harvard Dataverse

Access Authority:

Alliance Data Management

Depositor:

Bioversity International

Date of Deposit:

2020-04-24

Holdings Information:

https://doi.org/10.7910/DVN/OEZGVP

Study Scope

Keywords:

Agricultural Sciences, ABIOTIC STRESS, BREEDING, CLIMATE CHANGE, BIODIVERSITY, PARTICIPATORY RESEARCH, PLANT BREEDING, TRITICUM DURUM, WHEAT, CGIAR Research Program on Climate Change, Agriculture and Food Security, Biodiversity for Food and Agriculture, Africa

Abstract:

A panel of fully genotyped 400 wheat lines derived from genebank accessions in two managed fields in the Ethiopian highlands in 2012 and 2013 were evaluated. We collected phenotypic data and farmer evaluation data in this trial. For the decentralized trial, we distributed a subset of 41 genotypes as packaged sets containing incomplete blocks of three genotypes, plus one commercial variety for each farmer, following the “tricot” citizen science approach. We distributed these packages to 1,165 farmers who planted them on their farms across three regions of Ethiopia. Analyzing data from the centralized and decentralized trials in a side-by-side comparison, we evaluated if our approach can increase genetic gain in marginal crop production environments unlocking the full potential of genomics assisted breeding. For the full replication workflow please visit the GitHub repository (https://github.com/agrobioinfoservices/tricot-genomic).

Time Period:

2013-2016

Country:

Ethiopia

Geographic Coverage:

(Tigray, Oromya, Amhara)

Kind of Data:

Crop/Field data

Methodology and Processing

Sources Statement

Data Access

Notes:

<P><a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/"><img alt="Creative Commons Licence" style="border-width:0" src="https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License</a>.</P>

Other Study Description Materials

File Description--f3814233

File: 1_ReadmeFile_DecentralizedBreeding_DurumWheat.tab

  • Number of cases: 22

  • No. of variables per record: 2

  • Type of File: text/tab-separated-values

Notes:

UNF:6:ZuyH2yJpCF8ud3G6D+vWAQ==

File Description--f3814237

File: 2_DataDictionary_DecentralizedBreeding_DurumWheat.tab

  • Number of cases: 12

  • No. of variables per record: 3

  • Type of File: text/tab-separated-values

Notes:

UNF:6:y254wVVusIk0Ow32A2+YGQ==

File Description--f3814236

File: 3_CodeList_DecentralizedBreeding_DurumWheat.tab

  • Number of cases: 3

  • No. of variables per record: 3

  • Type of File: text/tab-separated-values

Notes:

UNF:6:RYc9qLyqHPgINQY75of98w==

File Description--f3814235

File: 4_DataFile_DecentralizedBreeding_DurumWheat.tab

  • Number of cases: 4655

  • No. of variables per record: 13

  • Type of File: text/tab-separated-values

Notes:

UNF:6:L/h4Lvs6xQlvC3ABwvJzbw==

File Description--f3814234

File: 5_AllFiles_DecentralizedBreeding_DurumWheat.tab

  • Number of cases: 13

  • No. of variables per record: 2

  • Type of File: text/tab-separated-values

Notes:

UNF:6:2lWsgqa471NeuuizR+VKeg==

Variable Description

List of Variables:

Variables

Datasetname

f3814233 Location:

Variable Format: character

Notes: UNF:6:nYA4wRjwAyX1VpcOq+1omw==

Replicationdatafor:"Data-drivendecentralizedbreedingincreasesgeneticgaininchallengingcropproductionenvironments"

f3814233 Location:

Variable Format: character

Notes: UNF:6:QdV/LlYggPJgxekHXQxVYQ==

DataLabel

f3814237 Location:

Variable Format: character

Notes: UNF:6:5m2Q42z9tWDrqUh8ffrmKg==

Description

f3814237 Location:

Variable Format: character

Notes: UNF:6:rh+dJg7HYeAUkndRyNpOaA==

Unitsorcodeused

f3814237 Location:

Variable Format: character

Notes: UNF:6:AYdL3VSmc7/K7zWxSyh85Q==

CodeList

f3814236 Location:

Variable Format: character

Notes: UNF:6:RMRz3OS7PZ7pc63aQrUbtQ==

Code

f3814236 Location:

Variable Format: character

Notes: UNF:6:hZZSl57L76B+d40nON37Qg==

CodeDescription

f3814236 Location:

Summary Statistics: Mean NaN; Max. NaN; Valid 0.0; Min. NaN; StDev NaN

Variable Format: numeric

Notes: UNF:6:PnB3/S9m1ongzuanz1s3vw==

id

f3814235 Location:

Summary Statistics: Max. 11193.0; Mean 1263.9245972072576; Min. 1.0; StDev 2255.795526304691; Valid 4655.0

Variable Format: numeric

Notes: UNF:6:XoatZqu76Rn3+NQfkaopvA==

genotype

f3814235 Location:

Variable Format: character

Notes: UNF:6:IO7Evx5Cmlf6xROOfX3Uww==

accession

f3814235 Location:

Variable Format: character

Notes: UNF:6:BEffMZMmZ0X8+Gl7bg2PPg==

plotid

f3814235 Location:

Summary Statistics: Max. 4.0; Mean 2.5007518796992483; StDev 1.1182499227130824; Min. 1.0; Valid 4655.0;

Variable Format: numeric

Notes: UNF:6:zcCWH4ac5u8woEreTKM4Cw==

year

f3814235 Location:

Summary Statistics: StDev 0.6506344679879326; Max. 2015.0; Mean 2014.1334049409238; Min. 2013.0; Valid 4655.0;

Variable Format: numeric

Notes: UNF:6:LHHpSiOEsotJh/jivnCURQ==

region

f3814235 Location:

Variable Format: character

Notes: UNF:6:vMU466nXALeHUTs6w4EMHw==

lon

f3814235 Location:

Summary Statistics: StDev 0.19576063557314852; Valid 4655.0; Max. 39.28; Mean 39.023715359828145; Min. 38.582

Variable Format: numeric

Notes: UNF:6:/M4Cur0pggz62It9A0yEvQ==

lat

f3814235 Location:

Summary Statistics: StDev 1.732802352060311; Max. 13.663; Mean 11.343350375939842; Min. 8.721; Valid 4655.0;

Variable Format: numeric

Notes: UNF:6:bj/sD0x+/8oqiegu2UYMxg==

gygm

f3814235 Location:

Summary Statistics: StDev 173.44176171781268; Valid 4260.0; Mean 249.9001814358391; Min. 0.0; Max. 883.333333333333;

Variable Format: numeric

Notes: UNF:6:RfOjVwbveCJptMDAWyKYNQ==

farmerrank

f3814235 Location:

Summary Statistics: Valid 4655.0; Max. 4.0; Mean 2.5001074113856068; Min. 1.0; StDev 1.1182501703790377;

Variable Format: numeric

Notes: UNF:6:awAThiqww7MdYWg5vxojTg==

plantingdate

f3814235 Location:

Summary Statistics: Valid 4655.0; StDev 237.3538027763315; Max. 42217.0; Min. 41487.0; Mean 41901.37250268528

Variable Format: numeric

Notes: UNF:6:Rws5RwkHfQMfqKh3OrqHjw==

plotsize

f3814235 Location:

Summary Statistics: Max. 1.6; Min. 0.4; Mean 1.4242749731471527; StDev 0.29681627076232375; Valid 4655.0

Variable Format: numeric

Notes: UNF:6:voQMFO2Pl9DjBEC6f1CSvA==

M

f3814235 Location:

Summary Statistics: Valid 0.0; Min. NaN; Max. NaN; StDev NaN; Mean NaN

Variable Format: numeric

Notes: UNF:6:Slm2EXboX4oPnru96c+Asw==

Datasetname

f3814234 Location:

Variable Format: character

Notes: UNF:6:EoX+DYc8vpch3giDeyRXCg==

Replicationdatafor:"Data-drivendecentralizedbreedingincreasesgeneticgaininchallengingcropproductionenvironments"

f3814234 Location:

Variable Format: character

Notes: UNF:6:nl0agI4pJfG6LzKQ41c7Vg==

Other Study-Related Materials

Label:

diversity.panel.data.gp.rda

Notes:

application/gzip

Other Study-Related Materials

Label:

genotypic.data.durum.wheat.rda

Notes:

application/x-rlang-transport

Other Study-Related Materials

Label:

genoytpic.data.rrBLUP.rda

Notes:

application/gzip

Other Study-Related Materials

Label:

01_add_climate_indices.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

01_add_climate_indices_session_info.txt

Notes:

text/plain

Other Study-Related Materials

Label:

02_PL_model.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

02_PL_model_session_info.txt

Notes:

text/plain

Other Study-Related Materials

Label:

GP_01_get_SNPs_BLUPs_H2.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

GP_02_characterize_environmental_diversity.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

GP_03_derive_log_abilities_by_groups.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

GP_04_perform_genomic_selection.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

GP_05_summarize_outputs_produce_plots.R

Notes:

type/x-r-syntax

Other Study-Related Materials

Label:

helper_00_functions.R

Notes:

type/x-r-syntax