关于选用多少个PCA作群体分层校订,各大期刊并无一个统一的说法。 故作了以下综述。ide
PCA想选多少就选多少,这个真的不是开玩笑。有文献出处有真相!ui
好比下面文献直接选用10个PCA校订群体分层。blog
Largest GWAS of PTSD (N=20070) yields genetic overlap with schizophrenia and sex differences in heritabilityci
好比选用前5个主成分校订群体分层。it
Accounting for Population Stratification in Practice: A Comparison of the Main Strategies Dedicated to Genome-Wide Association Studiesio
好比选用前3个主成分校订群体分层。变量
GWAS identifies novel SLE susceptibility genes and explains the association of the HLA region软件
好比选用前2个主成分校订群体分层。sso
GWAS analysis of suicide attempt in schizophrenia: Main genetic effect and interaction with early life traumayield
经过EIGENSTRAT软件计算主成分
计算各个主成分是否有显著的统计学意义
将P值小于0.05的主成分归入群体分层校订中。
以下图所示,主成分1和2是显著影响群体结构的(P<0.05),作关联分析时须要归入协变量中
此类作法参考文献:
GWAS Identifies Novel Susceptibility Loci on 6p21.32 and 21q21.3 for Hepatocellular Carcinoma in Chronic Hepatitis B Virus Carriers