Introduction

There is increasing epidemiological1,2,3 and genetic evidence4,5,6,7,8 of a shared genetic etiology between psychiatric disorders and, in particular, between schizophrenia (SCZ) and bipolar disorder (BP)4. Numerous genetic variants have been associated with susceptibility to schizophrenia. Many demonstrate overlapping susceptibility with bipolar disorder, including rare highly penetrant familial mutations (e.g. DISC1 gene9), numerous recurrent and de novo copy number variants (CNVs)10 and an increasing number of common single nucleotide polymorphisms (SNPs)4,5,6,11,12. To date, 12 genome-wide significant loci have been associated with SCZ and/or BP, all with odds ratios of less than 1.26,10. Taken together, evidence sugests genetic succeptibility to SCZ is highly heterogenous with multiple, common, polygenic variants contributing small effects.

The largest study to date, with genome-wide genotyping data on all samples, remains the phase I study of the PGC meta-analysis with 9,394 cases9. We carried out the largest single-phase meta-analysis of schizophrenia and bipolar cases, including data from 16 cohorts (Caucasian, African American and Asian cases and controls), to identify novel susceptibility loci.

Results

Following the meta-analysis of the combined mixed-ancestry schizophrenia, schizoaffective, and bipolar cohorts, 40 variants surpassed genome wide significance (P-values < 5 × 10−8) (Table 1). The 40 SNPs mapped to 6 loci, 5 of which had been previously associated with susceptibility to SCZ and/or BP (ITIH1, SDCCAG8, MHC, MAD1L1 and CSMD1; Figure 1). Two SNPs mapped to a novel locus containing a gene of unknown function, TSNARE1 (t-SNARE domain containing 1) on chr8q24.3 (top SNP rs10098073: fixed effects P 1.28 × 10−9 random effects P 1.28 × 10−9 OR 1.108; rs4129585 fixed effects P 2.38 × 10−8 random effects P 1.28 × 10−9 OR 1.108). In addition to the two genome-wide significant SNPs, multiple other SNPs in LD showed a trend towards association at the locus (Figure 2). Odds ratios for the most significantly associated SNP (rs10098073) across the 16 cohorts ranged from 1.003 to 1.258 (SD 0.06) with one outlier, the Dublin cohort, crossing 1 at OR 0.97. The genomic inflation factor of the meta-analysis was 1.08 indicating that population stratification had been adequately controlled. Analyzing the Caucasian cases and controls alone (n = 11,681 cases and 24,498 controls), the top TSNARE1 SNP, rs10098073, remains genome-wide significant (rs10098073 P-val 3.947 × 10−8 OR 1.12; rs4129585 P-val 1.061 × 10−7 OR 1.17).

Table 1 Genome wide significant variants following SCZ meta analysis
Figure 1: Manhattan plot of the SCZ-BP meta-analysis.
figure 1

The red dotted line indicates genome wide significance threshold of 5 × 10−8. Loci highlighted in red surpassed the genome-wide significance threshold in this study.

Figure 2
figure 2

TSNARE1 regional association plot.

While variants at TSNARE1 have not reached genome-wide significance in previous analyses, rs10098073 showed a consistent trend towards association with schizophrenia12. For the PGC discovery and replication analyses, the P-value for rs10098073 was 2.09 × 10−5; OR 1.077; Analysis of the same SNP in the CLOZUK10 study produced a P-value of 0.0165; OR 1.101. After combining the PGC and CLOZUK studies, analysis of rs10098073 resulted in P 1.19 × 10−6; OR 1.101. The CLOZUK sample (n = 2652) has not been included in any other GWAS apart from the primary study12. There are no overlapping samples between the CLOZUK study and our sample set and therefore the CLOZUK sample can serve as an interdependent replication of the association.

Discussion

The function of the TSNARE1 gene remains unknown. A recent publication suggests it may have evolved within the vertebrate lineage from the harbinger transposon superfamily13. Bioinformatic predictions based on phylogenetic ancestry indicate it may bind SNARE (soluble N-ethylmaleimide-sensitive factor attached protein receptor) proteins and have SNAP receptor activity. TSNARE1 may therefore have a vertebrate-specific function in intracellular protein transport and synaptic vesicle exocytosis.

Taken together, our meta-analysis confirmed 5 genome-wide significant loci previously reported and identified a novel sixth locus that has not previously been shown to associate with schizophrenia at genome-wide significance. The locus spans TSNARE1, a gene of unknown function that is predicted to have a vertebrate-specific function in synaptic vesicle exocytosis.

Methods

The study included 13,394 schizophrenia and bipolar cases and 34,676 controls (detailed in Supplementary table 1). Of these, 3,182 schizophrenia (of which 377 were classified as schizoaffective) cases and 1,032 bipolar I cases were collected from 28 clinical trials conducted by Janssen Research & Development, LLC (detailed description in Supplementary methods). These samples were matched to 15,277 and 8,000 controls, respectively, from the biorepository at the Center for Applied Genomics (CAG) of the Children's Hospital of Philadelphia (CHOP). All controls were recruited at CHOP and had no diagnosis or family history of psychiatric disease based on their medical record. All Janssen cases were genotyped on the Illumina 1M-DuoV3, while CHOP controls were analyzed using either the Illumina HumanHap550 or 610 Quad arrays.

In addition, 1,157 cases meeting DSM-IV-TR criteria for schizophrenia or schizoaffective disorder from the Center for Applied Genomics (CAG) at The Children's Hospital of Philadelphia and the Department of Psychiatry at the University of Pennsylvania, School of Medicine14 and 2,107 controls from the biorepository at CAG were also included in the analysis. Samples were genotyped on the Affymetrix 6.0 array at The Children's Hospital of Philadelphia (CHOP) as previously described14.

The remaining 8,023 schizophrenia cases and 9,292 controls were part of the Schizophrenia Psychiatric Genome-Wide Association Study Consortium (PGC), as previously described11, and were obtained from the NIMH as schizophrenia distribution 9 (https://www.nimhgenetics.org/).

As different cohorts were genotyped on different platforms, we imputed genotypes based on HapMap 3 reference panel for all samples prior to meta-analysis to allow for direct comparison of variants across the entire cohort. Both fixed and random effect meta-analyses were then carried out. Detailed methods are presented in the supplementary materials.