Introduction

DYX1C1 is a recently identified candidate gene for dyslexia. Disruption of the gene by a translocation was detected in dyslexia patients (Taipale et al. 2003; McGrath et al. 2006). The DYX1C1 gene maps to chromosome 15q21 and consists of 10 exons dispersed over about 78 kb of genomic DNA. The protein sequence encoded by DYX1C1 is 420 aa in length and the protein sequences of the nonhuman primate homolog are 98.6–99.5% similar to that of humans (Taipale et al. 2003). DYX1C1 is expressed broadly in several adult tissues including lung, kidney, and brain (Taipale et al. 2003; Wang et al. 2006), where its protein is localized in white matter including the nuclei of cortical neurons and glial cells (Taipale et al. 2003; Fisher and Francks 2006). Eight single nucleotide polymorphisms (SNPs) in the DYXICI genes of 20 patients with reading disabilities were detected (Taipale et al. 2003; McGrath et al. 2006). The 1249G-to-T and -3G-to-A mutations were among the SNPs found more frequently in patients with dyslexia than in controls (Fisher and Francks 2006). Developmental dyslexia is associated with migration anomalies in the neocortex as well genetic susceptibility (Pennington and Smith 1983; Galaburda et al. 1985; Smith et al. 1998; Chang et al. 2005; Sokol et al. 2006). DYX1C1 plays a role in the migration of neocortical neurons. Specifically it is required for the transition of the multipolar stage of migration (Wang et al. 2006).

The identification of cancer biomarkers has been a major focus of investigation in the field of cancer research. However, effective and reproducible cancer biomarker developments are still far off and enormous efforts are needed in this area (Brinkman 2004). In cancerous cells, splicing mechanisms are significantly altered by defects in splice sites caused by frequent point mutations (Venables 2004; Skotheim and Nees 2007). Aberrant neuronal-specific splicing of amphiphysin 2 in non-neuronal cells may contribute to the progression human melanoma (Ge et al. 1999). Expression levels of alternatively spliced variants of survivin are not equal to each other in malignant breast tissues. In the case of breast cancer, different survivin transcripts show different functions in the progression of apoptosis (Ryan et al. 2005; Pajares et al. 2007). Our study focused on the application of cancer biomarkers using transcription variants. The DYX1C1 gene gives rise to three alternatively spliced transcripts according to the GenBank database. Among them, one is associated with an HERV-H LTR element that could affect the protein-coding region. The other transcripts are not associated with any repeat sequences. Thus, their expression levels were investigated by RT-PCR amplification in various human tissues including matched pairs of normal/tumor tissue samples derived from tumor patients.

Materials and methods

Sample preparation

RNA samples from normal human tissues (adrenal gland, bone marrow, cerebellum, adult brain, fetal brain, fetal liver, heart, kidney, liver, lung, placenta, prostate, salivary gland, skeletal muscle, spinal cord, testis, thymus, thyroid, trachea, and uterus) were purchased from Roche. Matched pairs of adjacent normal/tumor tissues (2 colon, liver, uterus, breast, and stomach) were purchased from Ambion, and normal/tumor frozen tissue samples of colorectal adenoma patients are obtained from the Department of Internal Medicine, College of Medicine Pusan National University. Pure mRNA was extracted using the PolyA Tract mRNA isolation system (Promega).

RT-PCR amplification and sequencing

The DYX1C1 gene transcripts were analyzed by RT-PCR amplification. M-MLV reverse transcriptase with an annealing temperature of 42°C was used for reverse transcription reactions with an RNase inhibiter (Promega). The transcript associated with an HERV-H LTR was amplified by the primer pairs S1 (5′-AGG CAC GAA GAG CAA TGA AT-3′) and AS2 (5′-GTG GGG CCG TTT TAT AGG AT-3′), designed using sequence information obtained from GenBank (accession no. NM_001033560). The other two transcripts were amplified by the primer pairs S2 (5′-GAC ATA GCT GAA CTT TGC GAT TT-3′) and AS2 (5′-GAT TTA TAA TAT TTT GCC CTC AAC AG-3′), designed using sequence information from GenBank (accession no. NM_001033559 and NM_130810). For the semi-quantitative RT-PCR analysis, three independent experiments were performed. PCR conditions were as follows: after the initial step of 94°C for 4 min, 30/32/34 cycles of 95°C for 40 s, 55°C for 40 s, and 72°C for 90 s were performed. The primer locations are indicated in Fig. 1. As a standard control, G3PDH was amplified by the primer pairs GPH-S (5′-GAG CCC CAG CCT TCT CCA TG-3′) and GPH-AS (5′-GAA ATC CCA TCA CCA TCT TCC AGG-3′) from human G3PDH (GenBank accession no. NM_002046). These control PCR experiments were also conducted three times using the following conditions: after the initial step of 94°C for 4 min, 30/32/35 cycles of 95°C for 40 s, 55°C for 40 s, and 72°C for 40 s were performed.

Fig. 1
figure 1

Spliced variants of the DYX1C1 gene and their structural analysis The DYX1C1 gene is shown to have three alternatively spliced transcripts in the GenBank database. V1, V2, and V3 are the originally reported transcripts, and V1-1, V1-2, V1-3, and V1-4 are newly identified alternative transcripts

Analysis of RT-PCR products

Differentially expressed PCR products were analyzed using the Image J (1.36b) [http://rsb.info.nih.gov/ij/] program to compare the expression levels of each transcript. Image J program can calculate area and pixel value statistics of user-defined selections. The intensity of three alternative transcript including new transcripts were evaluated numerically to compare easily. G3PDH was used as the control.

Results and discussion

Identification and characterization of DYX1C1 alternative transcripts

From the GenBank database analysis, three alternatively spliced isoforms of the DYX1C1 gene were identified (Fig. 1). One of the transcripts is associated with an HERV-H LTR element (NM_001033560). The antisense-oriented HERV-H LTR element is located in the last exon of the DYX1C1 gene. An additional coding region and poly-A signal are provided by the HERV-H LTR element. The alternatively spliced transcripts were ubiquitously expressed in several human tissues, and four additional alternative transcripts (V1-1, V1-2, V1-3, and V1-4) were identified using a RT-PCR approach (Fig. 2). The four newly identified transcripts are associated with the HERV-H LTR located in the last exon. New transcripts (V1-1, V1-2, V1-3, and V1-4) acquired the novel exons or lost the original exon from intronic sequences. V1-1 acquired one exon (9′) from intronic sequences compared to V1-2. V1-3 was generated by the deletion of the original eighth exon and the acquisition of a new exon (7′) from intronic sequences. The shortest alternative transcript, V1-4 was generated by excision of the eighth exon due to the selection of alternative splice sites.

Fig. 2
figure 2

Expression patterns of HERV-H LTR-associated (a) and -unassociated (b) DYX1C1 transcripts in normal human tissues. G3PDH indicates the positive control

One of the most abundant HERV families, HERV-H is distributed throughout the human genome with 100 copies of both the full-length element and solitary LTRs (Sin et al. 2006, 2007). HERV elements have commonly been shown to be responsible for tissue-specific expression by providing promoter or enhancer elements, and other HERVs have appeared to harbor different polyA signals (AATAA) that may alter transcript polyadenylation (Wentzensen et al. 2007). The solitary LTR from HERV-H in DYX1C1 has the potential to regulate its expression pattern by providing a polyA signal (Nelson et al. 1996; de Parseval et al. 1999). NADSYN1 (NAD synthetase) has been shown to have several new alternative transcripts resulting from HERV-H LTR-derived polyA signals and these transcripts are expressed in normal human tissues (Sin et al. 2007).

Application of DYX1C1 alternatively spliced transcripts as cancer biomarkers

HERV-H elements are reported to be strongly expressed in testicular, lung and gastrointestinal cancers (Wentzensen et al. 2007; Wilkinson et al. 1990; Hirose et al. 1993). The accumulation of mutations in HERV sequences that result in cryptic splice sites changes the function of proteins by modifying the expression of transcripts in cancerous tissues. Therefore, we performed semi-quantitative RT-PCR analysis using five normal and adjacent tumor tissue sets (Figs. 3, 4). Among five splice variants (V1, V1-1, V1-2, V1-3 and V1-4), V1 was the predominant transcript in the five normal and tumor tissue samples. However, we could not compare the expression pattern of V1-1 with V1-2 because there is difference in size of only 20 base pairs between them. Therefore, we could not discriminate between the individual transcripts. Intriguingly, RT-PCR analysis of V2 and V3 transcripts show specific expression patterns. Although V3 was generally more strongly expressed than V2 in normal and adjacent tumor tissue sets, their expression levels were different between normal and tumor tissue samples (Fig. 4a, b). Moreover, additional five adjacent normal/tumor colon cancer samples were applied for the confirmation of differential expression pattern. Although, two samples did not show the positive result, three samples showed differential expression pattern in tumor tissues (the line 10 showed weak band of V2 and V3 transcript) (Fig. 4c). For the scarcity of patient adjacent normal/tumor samples, we could not analyze many cases of patient samples. However, V2 and V3 transcript of DYX1C1 could be a good biomarker for colon tumor diagnosis.

Fig. 3
figure 3

Comparative analysis of HERV-H LTR-associated DYX1C1 transcripts (V1, V1-1, V1-2, V1-3, V1-4) in normal and tumor tissue samples. G3PDH indicates the positive control. a Lanes: M size marker, 1 colon (T), 2 colon (N), 3 liver (T), 4 liver (N), 5 uterus (T), 6 uterus (N), 7 breast (T), 8 breast (N), 9 stomach (T), 10 stomach (N). b RT-PCR was conducted three times and G3PDH was used as a control. PCR products were analyzed quantitatively using the Image-J program. The X-axis of the bar graph indicates normal (N)/tumor (T) tissue samples. Lanes: M size marker, 1 colon (T), 2 colon (N), 3 liver (T), 4 liver (N), 5 uterus (T), 6 uterus (N), 7 breast (T), 8 breast (N), 9 stomach (T), 10 stomach (N) and the Y-axis of the bar graph indicates the relative expression levels of the 5 transcripts (V1, V1-1, V1-2, V1-3, and V1-4)

Fig. 4
figure 4

Comparative analysis of the other transcripts (V2, V3) in normal (N) and tumor (T) samples. G3PDH indicates the positive control. a Lanes: M size marker, 1 colon (T), 2 colon (N), 3 liver (T), 4 liver (N), 5 uterus (T), 6 uterus (N), 7 breast (T), 8 breast (N), 9 stomach (T), 10 stomach (N). b RT-PCR was conducted three times and G3PDH was used as a control. PCR products were analyzed quantitatively using the Image-J program. The X-axis of the bar graph indicates normal (N)/tumor (T) tissue samples. Lanes: M size marker, 1 colon (T), 2 colon (N), 3 liver (T), 4 liver (N), 5 uterus (T), 6 uterus (N), 7 breast (T), 8 breast (N), 9 stomach (T), 10 stomach (N) and the Y-axis of the bar graph indicates the relative expression levels of 2 transcripts (V2 and V3). P < 0.05 in Student’s t-test is indicated by *. c Lanes: M size marker, 1 colon (N), 2 colon (T), 3 colon (N), 4 colon (T), 5 colon (N), 6 colon (T), 7 colon (N), 8 colon (T), 9 colon (N), 10 colon (T). Lane 10 showed the weak expression pattern of V2 and V3 transcript. Their expression pattern showed only presence/absence status of transcript, therefore, we did not analyze quantitatively using the Image-J program

The expression levels of DYX1C1 transcripts seem to be altered by the progression of tumors specifically in colorectal tissue. The DYX1C1 transcript could be useful as a molecular marker to detect colorectal cancer (Fig. 4). A typical model of splice variant-derived variations in expression patterns between normal and tumor tissues is found with the Survivin gene. It is strongly expressed in cancer tissues. However, it could not be detected in normal tissues, indicating a potential role in tumorigenesis (Li 2005). Alternative splice variants of survivin show different expression patterns during tumor progression (Pajares et al. 2007; Li 2005; Mahotka et al. 1999; Badran et al. 2004; Krieg et al. 2002; Meng et al. 2004).

Taken together, alternative transcripts of DYX1C1 could be applied as cancer biomarkers to detect colorectal cancer. The different proteins derived from the V2 and V3 transcripts need to be analyzed further in studies using various colorectal normal and tumor tissue samples.

Conclusion

Our results indicated that alternatively spliced transcript variants of the DYX1C1 gene could be used as cancer biomarkers to detect colorectal cancer using a quantitative expression approach.