Professional Documents
Culture Documents
Quincunx
Quincunx
of published polygenic scores (PGS): https://www.pgscatalog.org/. table with the same name as the class. Combination of variables indicated • pss_id • pss_id • pss_id
• pgs_name
in bold renders each row unique in each table. • sample_id • sample_id
• scoring_file • stage • variable
The PGS Catalog data provided by the REST API is organised around five
S4 class scores • matches_publication • sample_size • estimate_type
core entities: • reported_trait • sample_cases • estimate
• trait_additional_description • sample_controls • unit
scores samples demographics
PGS Polygenic Scores • pgs_method_name • sample_percent_male • variability_type
• pgs_id • pgs_id • pgs_id • pgs_method_params • phenotype_description • variability
PGP PGS Publications • pgs_name • sample_id • sample_id • n_variants • ancestry_category • interval_type
PSS PGS Sample Sets • scoring_file • stage • variable • n_variants_interactions • ancestry • interval_lower
PPM PGS Performance Metrics • matches_publication • sample_size • estimate_type • assembly • country • interval_upper
EFO EFO traits • reported_trait • sample_cases • estimate • license • ancestry_additional_description
• trait_additional_description • sample_controls • unit • beta_unit • study_id
• pgs_method_name • sample_percent_male • variability_type • pubmed_id
CC BY SA Ramiro Magno • Learn more at https://maialab.org/quincunx • quincunx version 0.0.1 • Updated: 2021-05-05
Other S4 Entities Cohorts, Samples and Sample Sets
Besides the five PGS Catalog entities, there are three other objects that Cohorts
quincunx
can be retrieved from the REST API: trait_categories, cohorts and A cohort is a group of individuals with a shared characteristic. Cohorts are
releases. identified in quincunx by the cohort_symbol variable. Polygenic scoring file
S4 class trait_categories
PGS scoring files are provided by the PGS Catalog to allow computation
trait_categories trait of polygenic scores by users. These files are hosted at the PGS Catalog
• trait_category • trait_category FTP server: http://ftp.ebi.ac.uk/pub/databases/spot/pgs/
• efo_id
(e.g., PROMIS / N=30,000) (e.g., LOLIPOP / N=30,000) (e.g., MHI / N=30,000) (e.g., UKB / N=500,000)
scores/. They are labelled by their respective PGS Score ID (e.g.
• trait Samples PGS000001.txt.gz). For more details please visit: https://
• description
• url
A sample is a group of participants associated with none, one or more maialab.org/quincunx/articles/pgs-scoring-file.html.
catalogued cohorts. The selection from a cohort can be either a subset or
S4 class cohorts its totality. Samples are not identified in PGS Catalog with a global unique File Format
identifier, but quincunx assigns a surrogate identifier (sample_id) to allow Each scoring file contains variant identification, effect alleles and
cohorts pgs_ids relations between tables. respective score weights. The file is formatted as a gzipped tab-delimited
• cohort_symbol • cohort_symbol Possible compositions of samples: text file, with a header containing brief metada about the score. You can
• cohort_name • pgs_id read PGS scoring files into R with read_scoring_file().
• stage
PGS000117.txt.gz
S4 class releases
1 ### PGS CATALOG SCORING FILE - see www.pgscatalog.org/downloads/#dl_ftp for...
Admixture of cohorts. A cohort subset. Another but disjoint subset. Not associated with 2 ## POLYGENIC SCORE (PGS) INFORMATION
releases 3x {pgs_ids, ppm_ids,pgp_ids}
(e.g., PGS000011 / S2 / (e.g., PSS000020 / S1 / MHI / N=862) (e.g., PSS000020 / S2 / MHI / N=2,333)
any cohort. 3 # PGS ID = PGS000117
PROMIS & LOLIPOP / N=8,653)
• date • date 4 # Reported Trait = Cardiovascular Disease
• n_pgs • {pgs_id, ppm_id, pgp_id} Sample Sets 5
6
# Original Genome Build = GRCh37
# Number of Variants = 267863
• n_ppm
• n_pgp A sample set is a group of samples used in a polygenic score evaluation. 7
8
## SOURCE INFORMATION
# PGP ID = PGP000054
• notes Each sample set is identified in the PGS Catalog by a unique sample set 9 # Citation = Elliott J et al. JAMA (2020). doi:10.1001/jama.2019.22241
identifier (PSS ID). 10
11
rsID chr_name chr_position effect_allele reference_allele effect_weight
rs11240779 1 808631 A G 0.00077622
CC BY SA Ramiro Magno • Learn more at https://maialab.org/quincunx • quincunx version 0.0.1 • Updated: 2021-05-05