Introduction

Rice (Oryza sativa L.) is the main food crop and source of calorie intake for one third of the world’s population. Although rice is often considered as an aquatic crop, more than 45% of the rice-growing area is not permanently flooded (International Rice Research Institute 2002). Drought is a major factor involved in yield instability in both the upland and rainfed lowland ecosystems (Widawski and O’Toole 1990). Drought resistance is therefore an important selection criterion in such ecosystems. Many traits are said to contribute to drought resistance (Ludlow and Muchow 1990; Fukai and Cooper 1995), but none is effective in all agronomic situations. Avoidance strategies that enable a plant to maintain its water status during water stress appear to be a reasonable option in a broad range of agronomic situations. A deep root system able to extract water at depth and respond to evaporative demand, provided there is water in the profile, is the most consensual of the traits contributing to drought avoidance at least in upland conditions (Yoshida and Hasegawa 1982; Nguyen et al. 1997; Price et al. 2002). In addition to its role in buffering drought effects, the root system plays an important role in anchoring the plant, capturing mineral resources, and in early vigor and competitiveness with weeds. It is also the site of major hormone biosynthesis.

The development of the root system is controlled by constitutive intrinsic pathways that determine its global architecture as a function of the genetic potential of the accessions for root development and by response pathways that modulate its development in response to stress (water stress, submergence, nutrient deficiencies, etc.) and contribute to root developmental plasticity (Malamy 2005).

Descriptors of root development generally include maximum root length (MRL), which determines the baseline for access to nutrient and water resources, and root number (RN), root thickness (RTHK), and root length density, which determine the intensity of colonization of the soil profile and the ability to exploit soil resources within the baseline. Root length density is very strongly correlated with root dry weight (RDW), a parameter that is easier to measure (Yadav et al. 1997). Root dry weight has frequently been subdivided according to the layers where the roots were collected, with special attention paid to the deepest roots through parameters such as deep root weight (DRW) or the deep root weight to total root weight ratio (DRR). The thickness of the roots is strongly correlated with the diameter of the xylem vessels, and thick roots contribute to better water flux with less risk of cavitation (Yambao et al. 1992). In rice, root emission is synchronized with tiller emission following the phyllochron model (Nemoto et al. 1995). Allometric ratios such as root-to-shoot weight ratio (R/S), root dry weight per tiller, (R/T), deep root to shoot weight (DR/S), or deep root weight per tiller (DR/T) are used to describe the coordination between growth and development of the roots and shoots. A more precise description of the morphology and anatomy of the rice root system at tissue level has recently been provided (Rebouillat et al. 2008).

In O. sativa, significant genetic variation has been observed in root number, diameter, depth, branching, vertical density distribution, root-to-shoot ratio, water extraction, and root penetration (O’Toole and Bland 1987; Lafitte et al. 2001). The pattern of variation reflects the organization of O. sativa in six isozymic groups as described by Glaszmann (1987). The major differentiation is between isozymic group 1 (indicas), characterized by a superficial, thin, highly branched root system with a low R/S ratio, and isozymic group 6 (japonicas) characterized by a deep, thick, less branched root system with a high R/S ratio. Most of the diversity is distributed between groups rather than within groups, with the exception of isozymic groups 2 and 6 which have a substructure linked to adaptation to the ecosystem, e.g., temperate irrigated versus tropical upland accessions in group 6 and irrigated boro versus upland aus accessions in group 2 (Courtois et al. 1996; Lafitte et al. 2001).

Following the development of molecular genetic approaches, the genetic architecture of the rice root system has been extensively studied by teams interested in improving drought resistance in rice. At least 24 papers have been published in the last 10 years on quantitative trait locus (QTL) detection for a large number of root parameters in several populations and different growth conditions (Champoux et al. 1995; Ray et al. 1996; Redona and Mackill 1996; Yadav et al. 1997; Price et al. 1999, 2000, 2002; Ali et al. 2000; Hemamalini et al. 2000; Tripathy et al. 2000; Zheng et al. 2000, 2003; Shen et al. 2001; Zhang et al. 2001a, b; Kamoshita et al. 2002a, b; Courtois et al. 2003; Nguyen et al. 2004; Xu et al. 2004; Li et al. 2005b; Horii et al. 2006; Yue et al. 2006; MacMillan et al. 2006).

These studies led to marker-aided selection, enabling validation of the position and effect of some of the QTLs (Shen et al. 2001; Steele et al. 2006) and to on-going QTL cloning work (Ahmadi et al., unpublished results).

Most papers mentioned above included a comparison of QTL positions across studies. Such comparisons are informative, but tedious to undertake at a large scale. To enable an easier shift from QTLs to the underlying candidate genes, progress is needed in combining the large body of QTL information. There are several possible ways to combine data from different studies to detect consensus QTLs and narrow QTL confidence intervals. One way would be to reanalyze the initial datasets in a combined way as was done in mouse by assuming a common biallelic mode of inheritance in several mapping populations (Li et al. 2005a). The rice mapping populations have characteristics that would fit such a design well. Most populations are recombinant inbred lines (RIL) or doubled haploid (DH) indica × japonica populations where two major alleles segregate, and they share a large corpus of markers, notably restriction fragment length polymorphism and microsatellite markers from Cornell University or from the Japanese Rice Genome Project (Harushima et al. 1998). However, this approach is limited by the difficulty involved in accessing raw phenotypic data. Another option is to perform meta-QTL analysis using the method proposed by Goffinet and Gerber (2000) and improved by Veyrieras et al. (2007). This statistical approach enables QTL results from independent studies to be combined into a single result using only published QTL data. Software packages based on this methodology are available such as BioMercator (Arcade et al. 2004) and Meta-QTL (Veyrieras et al. 2007), a new software that removes some of the limitations of BioMercator in the number of QTLs it can handle at once. Meta-QTL analyses for flowering time in maize (Chardon et al. 2004), resistance to a nematode in soybean (Guo et al. 2006), earliness in bread wheat (Hanocq et al. 2007), and N uptake in maize (Coque et al. 2008) have already shown the usefulness of this approach in determining the number of “true” QTLs underlying the observed QTLs and in positioning consensus QTLs with more precision. In rice, Norton et al. (2008) conducted a study on root QTLs, but it was limited to the Azucena × Bala population and only three QTL clusters.

The main objective of this study was therefore to combine the results of all QTL detection studies conducted from 1995 to 2007 on root traits in rice through a meta-QTL analysis involving the whole genome.

Such an approach requires easy access to the entire QTL dataset for the traits and species under study. Many reported rice QTLs have already been assembled in the Gramene database (http://www.gramene.org) which is the broadest and so far most extensive source of public information on QTLs in cereals. However, in Gramene, specific information on the QTLs includes flanking markers, logarithm of the odds (LOD) score and phenotypic variation explained by the QTLs, but not the experimental conditions under which the QTLs were detected, which can be important when dealing with abiotic stresses. Information such as the mapping population size, which can be needed to compute the QTL confidence interval (Darvasi and Soller 1997), is not directly linked to the QTL. The link between the QTLs and the genome sequence is also indirect. The secondary objective of this study was therefore to develop a database on rice QTLs for drought resistance from which it will be easy to extract the elements necessary for meta-QTL analyses on drought-related traits.

Results

Database development

We extracted the relevant information on 1,467 QTLs controlling drought traits in rice from 42 published papers. An additional 23 papers were also analyzed, bringing the total number of QTLs in the database to 2,137, but their topic (QTLs for flowering time, QTL for tolerance to salinity, and QTLs detected in the IR64xAzucena population) was not relevant for the present paper. Basic information on QTLs is available to the scientific community in the rice module of the Tropgene database (http://tropgenedb.cirad.fr/en/rice.html), which is devoted to genetic, genomic, and phenotypic information on tropical crops (Ruiz et al. 2004). In comparison with the original papers, the names of the traits have sometimes been modified for the purpose of homogenization. A definition of the traits largely inspired by Gramene and connected to the Trait Ontology was established and is available in Tropgene DB (http://tropgenedb.cirad.fr/html/rice_QTL.html).

The list of parameters recorded for each QTL is given in Supplementary Table 1. The list includes the “experimental conditions” [field or greenhouse conditions, aerobic or anaerobic conditions, and controlled (well watered, normal impedance) or stressed conditions (water stress or compaction)], which are particularly important for QTLs involved in response to abiotic stresses. The QTL set can therefore be split according to this parameter and meta-QTL analyses run on a specific subset of conditions.

The parameter list also includes the QTL positions with reference to the sequence of the Nipponbare pseudo-chromosomes v5.0 to facilitate the shift from the QTL and meta-QTL confidence intervals to the underlying genes. The positions of the QTLs were determined by the physical position of the most significant markers for QTLs mapped through single marker analysis. For QTLs detected through interval or composite interval mapping, their positions were defined as the mid-values of the physical positions of the two markers flanking the QTL peak because the names of flanking markers were always indicated in the papers, while this was seldom the case for estimated QTL positions. To determine the physical positions of the markers, Gramene data or BLAST results based on Gramene sequence information were used when available. Otherwise, for example in the case of amplified fragment length polymorphisms (AFLPs), the nearest marker with a known sequence in the original genetic map was used as a proxy. This solution led to a loss of precision in positioning the marker, but in our dataset, AFLPs represented only 5.9% of the markers linked to QTLs. Information on the physical position of each marker or of the proxy we used is also available in Tropgene DB (http://tropgenedb.cirad.fr/html/rice_Marker.html). For original data or additional details on a particular experiment, the reader is referred to the initial publications in the list of references. These QTLs can be visualized on a synthetic map using the Comparative Map Viewer Cmap v 0.13 from the GMOD project (http://www.gmod.org) integrated into Tropgene DB (http://tropgenedb.cirad.fr/en/rice.html).

Root QTLs

From our QTL database, we extracted all the QTLs related to root traits. They come from the 24 papers listed in Table 1. For the Bala × Azucena population, a new more saturated genetic map was recently developed (Khowaja et al. 2009) and used to reanalyze the initial phenotypic data (Price et al. 1999, 2000, 2002; MacMillan et al. 2006), so the new QTLs were used in this study. The list of the traits with their acronym is given in Table 2. A few QTLs with a LOD score below 2.0 were discarded from the analysis to reduce the risk of including false positives.

Table 1 Rice Root QTL Studies Included in the Analysis
Table 2 Root Parameters for Which QTLs Were Detected

The root QTL set was composed of 675 QTLs for 29 root traits detected in 12 mapping populations (Tables 1 and 2). In several cases, the same populations had been used by different teams in independent experiments. These populations were mostly derived from indica/japonica or japonica/indica crosses (eight populations representing 560 QTLs) for polymorphism reasons. The set also included two indica × indica populations (91 QTLs), two temperate japonica/tropical japonica populations (21 QTLs), and one temperate japonica/aus population (three QTLs). Some traits such as MRL or RTHK were measured in a large number of studies (13 and 12, respectively), while other traits such as lateral root length or deep root number were measured in just one study. For each trait, Table 3 presents an overview of the number of QTLs in segments of 5.0 Mb covering the whole genome. QTLs were found in almost all segments with an average of 8.4 QTLs per segment, but a concentration of above 20 QTLs was observed in six segments, namely, the 30- to 40-Mb segment of chromosome 1, the 25- to 35-Mb interval of chromosome 2; the 0- to 5-Mb interval of chromosome 3, the 30- to 35-Mb interval of chromosome 4, the 15- to 20-Mb interval of chromosome 9, and the 15- to 20-Mb interval of chromosome 11. These segments, which represented 10% of the genome in length, carried 30% of the total number of QTLs.

Table 3 Number of QTLs and Meta-QTL for Diverse Root Traits in 5-Mb Windows

Supplementary Figure 1 presents the chromosomal location of the QTLs. Traits for which less than 20 QTLs were detected, of least interest for the meta-QTL analysis, are not included in the figure for reasons of space. The last letter of the QTL name indicates under which conditions the QTL was detected, with “w” standing for well-watered conditions (368 among the 675 QTLs), “s” standing for water stress conditions (147 QTLs), and “c” for compaction stress using the wax petrolatum layer system (160 QTLs). For the traits for which the number of QTLs detected in both control and stressed conditions permitted binary comparisons [MRL, RTHK for well-watered and water stress conditions; RN versus PRN, RTHK versus penetrated root thickness (PRTHK) for standard and compacted conditions], the QTLs observed under stressed conditions were generally co-localized with QTLs detected under favorable conditions notably in the six QTL hot spot areas mentioned above. Several isolated QTLs specific to one condition or another were also found (Supplementary Figure 2). Some clusters grouping only QTLs detected under well-watered conditions were also observed.

Meta-QTL analysis

Meta-QTL analysis was possible for ten root traits (RN, PRN, MRL RTHK, PRTHK, RDW, DRW, DRR, DR/T, and RPI) with sufficient QTL density (more than five) on at least one of the 12 chromosomes. Therefore, only 306 QTLs out of the 675 originally identified, representing a total of 32 root trait × chromosome combinations, were submitted to the meta-analysis. A total of 119 meta-QTLs were detected. The complete list of the 119 meta-QTLs, with their physical position as well as the list of the 306 QTLs, is given in Supplementary Table 2. Graphical results of the meta-QTL analysis are presented in Supplementary Figure 3. In each trait × chromosome figure, the lengths of the colored segments in each QTL bar are proportional to their probability of belonging to the meta-QTL of the same color. The probability for a QTL to belong to a cluster is indicated in Supplementary Table 2. Most of the QTLs (93.0%) were attributed to just one cluster with a probability of more than 90%. The QTLs with a probability of attribution to several clusters were those with the longest confidence intervals, which were generally found in the doubled haploid or backcross mapping populations.

Extracts of Supplementary Table 2 and Supplementary Figure 3 focusing on MRL and chromosomes 1 and 9 are given as examples in Table 4 and Figs. 1 and 2.

Table 4 QTLs Reported and Meta-QTLs Detected for Maximum Root Length on Chromosomes 1 and 9
Fig. 1
figure 1

QTL and meta-QTLs for maximum root length on chromosome 1. The meta-QTLs on the body of the chromosome are represented by bars of different intensities of gray. The lengths of the gray segments in each QTL bar are proportional to their probability of belonging to the meta-QTL of the same intensity of gray. Distances on the map are in megabases. A color version of the figure can be found in Supplementary Figure 3.

Fig. 2
figure 2

QTL and meta-QTLs for maximum root length on chromosome 9. The meta-QTLs are represented by bars of different intensities of gray on the body of the chromosome. Distances on the map are in megabases. A color version of the figure can be found in Supplementary Figure 3.

The 119 meta-QTLs detected corresponded to a reduction of 61% in the number of QTLs. The reduction was not evenly distributed. On some chromosomes such as chromosome 3 for RTHK, the QTLs were distributed in such a way that little was gained by meta-analysis (ten QTLs resolved into six meta-QTLs). Some QTLs remained isolated. On some others, QTLs were concentrated on a short segment and the effect of the meta-analysis was striking. Good examples are the 30- to 40-Mb segment of chromosome 1 carrying 15 QTLs for MRL resolved into three QTLs (Fig. 1) or the 10- to 23-Mb segment of chromosome 9 carrying 14 QTLs for MRL resolved into just one (Fig. 2).

As for resolution power, the increase in precision of the QTL location was also strong. The average confidence interval of the QTLs involved in the meta-analysis was 4.14 Mb and went down to 1.98 for the meta-QTLs. The meta-QTL confidence interval was generally smaller than the smallest confidence interval of the QTLs of the cluster. This was all the more true when the cluster encompassed a large number of QTLs.

Twenty-five meta-QTLs had a confidence interval below 250 kb, small enough to envision browsing the underlying genes. Six of the meta-QTLs on chromosome 2 corresponded almost exactly to the same physical interval, which reduced the number of different meta-QTLs to 20. The genes located in these segments were retrieved from the Nipponbare sequence. Those with an indication of biological function are listed in Supplementary Table 3. The splice variants of a given gene were not individualized. The number of genes with a known function underlying the meta-QTLs varied from 0 to 32. Among those that could be of particular interest in the context of root development (in bold in Supplementary Table 3), we observed genes involved in auxin signaling and transport, transcription factors, and sugar metabolism genes.

Comparison between the positions of the meta-QTLs and those of three cloned genes controlling flowering time showed that for Hd3a and Hd1 on chromosome 6, the confidence interval of the meta-QTLs mqDUR6-2 and mqDUR6-3 included the gene positions (Supplementary Figure 3). This remained true when the QTL with the longest confidence interval was removed from the analysis, which resulted in a sharp decrease in the confidence interval of the meta-QTLs. On chromosome 3, the two meta-QTLs detected in the region clearly did not overlap with Hd6, but nor did most of the QTLs in this zone. For the analysis of plant height QTLs on chromosome 1, one of the meta-QTLs, mqHGT1-4, was close to Sd1, but still 340 kb away, while some of the QTL confidence intervals included the gene position (Supplementary Figure 3). The QTLs for MRL on the distal part of chromosome 1, which have a major effect, have often been said to correspond to Sd1, assuming a pleiotropic effect of the semi-dwarfism gene. Sd1 was indeed within the confidence interval of the meta-QTL mqMRL_1-4 for MRL.

Discussion

We constructed a database of drought QTLs in rice that assembles a large body of information on each QTL (http://tropgenedb.cirad.fr/en/rice.html). These QTLs can be visualized on an integrative map using a map viewer. Complex queries on various fields including the experimental conditions under which the QTLs were detected are possible directly on the web site. Another convenient option is to download a subsample of QTLs in an Excel format file and filter it on any of the available fields. This procedure is particularly useful for immediate assessment of the presence of a QTL in a given physical zone by filtering the locus position field for a range of physical positions. Conversely, existing bioinformatics tools such as those available in OrygenesDB (http://orygenesdb.cirad.fr/) can be used to list the genes underlying any QTL confidence interval (Droc et al. 2006).

The compilation presented here focuses on root traits, but the same work is possible for any other drought resistance trait such as leaf rolling score, leaf drying score, or osmotic adjustment for which most QTLs from the 1995–2007 period have been compiled in the database. The database module will be yearly updated with integration of the newly published QTLs.

The projection of all QTLs on a virtual map allows the issue of whether the QTLs detected under stress conditions are or are not a subset of those detected under favorable conditions to be addressed. In the case of root traits, QTLs detected under stress conditions co-localized with QTLs detected under favorable conditions in the majority of the cases. The possibility that different QTLs (and presumably the underlying genes) expressed in the different situations can be clustered in the same region cannot be excluded, but more likely, the same QTL (and underlying gene) works differently depending on the growth conditions, as demonstrated by MacMillan et al. (2006). For the few clusters grouping only QTLs detected in one condition, the more frequent presence of clusters grouping only QTLs expressed under favorable conditions can be explained by the larger number of QTLs of this type, associated with the higher heritability of traits measured under favorable conditions. The detection of QTLs in some populations and not in others may be due to the segregation of the underlying genes in various mapping populations and fixation of the same genes in other populations.

We ran the meta-analysis separately for each trait because the method normally requires the experiments to be independent. Nevertheless, it is possible to pool different correlated traits measured in the same individuals in a single meta-analysis since Goffinet and Gerber (2000) showed through simulation that the procedure was robust even in such cases. Assuming that several of the traits studied are pleiotropically related, analysis of pooled traits may be more powerful than analysis of an individual trait. It may also increase the number of chromosomes with more than six QTLs and therefore ensure more complete coverage of the genome than the 32 trait × chromosome combinations we studied. The pooled trait approach is presently being applied to the root QTLs of the Azucena × Bala population (Khowaja et al. 2009). Time will tell which approach was best when the genes are cloned. Meanwhile, one would expect that the analysis of pooled traits works best for pleiotropy situations but less well for tight linkage.

The global QTL dataset we submitted to meta-analysis was not as precise as each of the individual QTL mapping studies we compiled. This was unavoidable because of (1) the intrinsic need of the meta-QTL analysis tool for homogeneous data, (2) the heterogeneity of the source data, and (3) some of our methodological choices. Information on the precise positions or confidence intervals of the putative QTLs, for example, was often absent from the papers, with just indication of the flanking markers. Our decision to use a proxy for non-sequenced markers led to reduced precision concerning the position of QTLs linked to these markers. Also, the ordering of the markers on some genetic maps can be locally different from their order on physical maps. If the problem is inconsequential with single marker analysis, the impact of map errors on QTL position is likely to have increased with the shift to simple or composite interval mapping analyses. The information set that we were finally able to use was therefore reduced to the minimum common denominator when all papers were taken into account. Another option would be to exclude the least informative papers from the analysis.

Offsetting its need for a homogeneous QTL dataset, meta-QTL analysis allows an advance over browsing the database by reducing the number of observed QTLs to a more limited number of meta-QTLs with a narrower confidence interval. The best example is probably the meta-QTL for MRL on chromosome 9 whose confidence interval was reduced to a length of 20 kb, thereby reducing resolution to a few genes (two genes in this interval). At first glance, however, neither of the two genes seemed to be a good candidate for root system development, but six genes away (a distance of 40 kb), there was an auxin efflux carrier that was a better candidate based on its function. It thus appears that the reduction in the confidence interval may be very important, but these results should be considered with caution in such cases. For other QTL clusters, the gain was more modest since the average confidence interval of the meta-QTLs was close to 2 Mb, which, in rice, represents an average of 200 genes. Meta-QTL analysis works better with a large number of QTLs and gives less reliable results when the number of QTLs per chromosome is below six. But this is also the situation in which meta-QTL analysis is the least interesting because the QTLs are less widely distributed among the mapping populations, or have smaller effects.

We evaluated the efficiency of meta-QTL analysis in improving the prediction of the meta-QTL positions by applying the method to QTLs surrounding four cloned genes. On chromosome 6, which carries the Hd1 and Hd3 flowering time genes, precision was considerably improved since the confidence intervals of the two meta-QTLs co-localizing with the genes were smaller than that of each individual QTL mapped in the individual studies we analyzed. But a third meta-QTL was also detected. On chromosome 3, however, the meta-QTLs did not cover the Hd6 gene, but nor did most of the QTLs, so it is likely that another gene located closer to the top of chromosome 3 is responsible for the meta-QTL for flowering time detected there. On chromosome 1, the Sd1 gene controlling plant height was very close to the meta-QTL with the narrowest confidence interval of the three meta-QTLs detected in this area. The number of cloned QTLs used for this validation was low, and this result may be a coincidence. Moreover, both flowering time and plant height have higher broad sense heritability than root traits, and QTLs for root traits will have been located with less precision than those used in the evaluation of the efficiency of the method. In addition, meta-QTL analysis frequently identified several meta-QTLs under a QTL cluster. It is possible that each meta-QTL corresponds to a different gene, but the situation may also reflect a bias in the method. Nevertheless, all things considered, these results make us confident that the method described here is valuable, probably all the more so when the QTL dataset is of high quality.

There are several possible ways to go further in the identification of candidate genes, the highest efficiency being expected from a combination of options producing a range of evidence. For the meta-QTLs with the longest confidence intervals, the identified regions can be explored by association mapping of appropriate genotype panels and a denser set of markers for the region concerned, enabling the QTL position to be narrowed down. The method can be used for segments longer than 500 kb, which is the upper limit of linkage disequilibrium span in rice (Mather et al. 2007), and shorter than 4 Mb, which is the average QTL confidence interval in our dataset and the point at which association mapping will not provide added value compared to bi-parental population mapping. For meta-QTL with a confidence interval below 500 kb, the underlying candidate genes can easily be listed. Under our meta-QTLs, we found, for example, genes involved in auxin signaling and transport or in cell division, transcription factors expressed under abiotic stresses, and sugar metabolism genes that could act on root growth or serve as osmoprotectants. However, all genes within the confidence interval could be candidates for further work, and those with no known function could not be eliminated either. Links between the gene function and the QTL can be reinforced by the involvement of the gene in a relevant pathway such as those described by Malamy (2005), Nibau et al. (2008), or Rebouillat et al. (2008) for root development or by the expression of the gene in root tissues, a method explored by Norton et al. (2008). Mutants with t-DNA insertion in the candidate genes can also be searched in OrygenesDB, their expression in root tissues checked by using Oryza-Tag Line GUS and GFP data (Larmande et al. 2008) and the mutants analyzed to check for the mutation effect on root morphology or development.

However, it is important to note that even without knowing the genes involved, the reduction both in the number of QTLs and size of their confidence interval should greatly facilitate the manipulation of the QTL carrying segments through marker-assisted selection. The scheme to build improved genotypes will be all the more complex if the number of segments to manipulate is large (Hospital and Charcosset 1997). For a limited number of candidate genes, one possible approach is to use the genotype building strategy proposed by Servin et al. (2004), focusing for example on the six zones already identified as potentially interesting in this paper. But the chances are that the interesting alleles with large effects will already be present in material with good drought resistance and that breeders will have to manipulate a larger number of QTLs with smaller effects. Additional progress leading to the accumulation of these QTLs can then be envisioned through marker-aided recurrent selection (Hospital et al. 2000; Bernardo and Charcosset 2006).

Methods

Statistical methods

The first element needed to run a meta-analysis is a single integrative map where all markers mentioned in the different studies are positioned. From the data available in the papers, it was not possible to establish a consensus genetic map since the genetic distance between markers was not indicated for all the mapping populations. Consequently, we chose to use the marker order of the physical map and the physical distance between markers as explained above.

The second element required is the confidence intervals of the QTL. In some papers, the confidence interval was computed using a support interval that was either LODmax-1 or LODmax-2, but in most papers, the confidence interval was not given. To homogenize the data and estimate missing confidence intervals, a simpler solution was to use the empirical formula proposed by Darvasi and Soller (1997) for F2 and BC and extended by Guo et al. (2006) to other population types. For a BC, an F2 or a DH population, the QTL confidence interval at the 95% level is 530/Nr 2, where N is the population size and r 2 is the proportion of the total variance explained by the QTL. For a RIL population, the formula becomes 163/Nr 2. These formulas generated confidence interval limits in centimorgan (cM). We converted these values into physical positions using a ratio of 1 cM per 244 kb (Chen et al. 2002) assuming that the ratio of genetic to physical distance was invariant throughout the genome. It is known that the correspondence between physical and genetic maps is not homogeneous along the chromosomes. But, in rice, unlike in maize, barley, and wheat, the reduced frequency of recombination mostly affects the areas surrounding the centromeres, and recombinationally inactive regions are limited to a few megabases (Chen et al. 2002). The bias the assumption is likely to introduce in the analyses will be concentrated in the centromeric regions, but in our dataset, the QTLs located in the 5-Mb window around the centromeres (2.5 Mb each side) represented only 9.1% of the total number of QTLs.

The meta-QTL analysis was run using the software package MetaQTL (Veyrieras et al. 2007). The method relies on a clustering algorithm based on a Gaussian mixture model and enables determination of the likely number of clusters considered as the “true” QTLs underlying the QTLs observed in a given region. The optimal number of clusters is chosen by means of an information-based criterion, and the position and confidence interval of the meta-QTLs can then be estimated. Meta-QTL proposes five different criteria for model choice, but here, we present only the results based on the Akaike information content criterion, which performed well in simulations (Veyrieras et al. 2007).

Meta-analysis was not attempted for chromosomes carrying less than six QTLs, as in this case, the model performed poorly because of over-parameterization.

To facilitate the visualization of the QTLs and meta-QTLs and the shift back to markers, the QTLs and meta-QTLs were projected on a virtual physical map composed of microsatellite markers corresponding to those used to build the IR64xAzucena SSD genetic map (Ahmadi et al., unpublished data). The distances between markers mentioned on the map are physical distances expressed in megabases.

The genes underlying the meta-QTLs were extracted with the tool “Get sequence” of OrygenesDB (http://orygenesdb.cirad.fr) applied to the Nipponbare sequence (Droc et al. 2006).

Evaluation of efficiency of meta-QTL analysis

A comparison of the position of consensus QTLs obtained by meta-analysis with that of cloned genes for the same trait was performed to evaluate the efficiency of the method. No QTL for root trait has yet been cloned, but the approach was possible for flowering time and plant height. A total of 80 QTLs for flowering time detected under well-watered conditions were collected from ten papers (Li et al. 1995; Yano et al. 1997; Lafitte and Courtois 1999; Lin et al. 2000; Maheswaran et al. 2000; Yamamoto et al. 2000; Yu et al. 2002; Hittalmani et al. 2003; Lanceras et al. 2004; Dong et al. 2004). A total of 59 QTLs for plant height were collected from ten papers (Li et al. 1995; Lafitte and Courtois 1999; Hemamalini et al. 2000; Yu et al. 2002; Courtois et al. 2003; Hittalmani et al. 2003; Lanceras et al. 2004; Xu et al. 2004; MacMillan et al. 2006; Gomez et al. 2006). All the QTLs are compiled in TropgeneDB. These QTL were submitted to a meta-QTL analysis in the same way as the root QTLs. The meta-QTL positions for flowering time were then compared to the physical position of three cloned flowering time QTLs: Hd1 cloned by Yano et al. (2000) and Hd3a cloned by Kojima et al. (2002), both on chromosome 6, and Hd6 cloned by Takahashi et al. (2001) on chromosome 3. These QTLs corresponded to genes Os06g16370 located between base pairs 9,335,361 and 9,337,634 on chromosome 6, Os06g06320 located between bp 2,939,005 and 2,941,453 on chromosome 6 and Os03g10940 located between base pairs 5,595,162 and 5,599,842 on chromosome 3, respectively. The same comparison was undertaken for the meta-QTLs for plant height on chromosome 1 whose positions were compared to that of Sd1 (Os01g66100) cloned by Monna et al. (2000) located between base pairs 38,709,231 and 38,712,353.