-
Články
- Časopisy
- Kurzy
- Témy
- Kongresy
- Videa
- Podcasty
Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
Authors: Norio Tanaka aff001; Akihisa Takahara aff001; Taichi Hagio aff001; Rika Nishiko aff001; Junko Kanayama aff001; Osamu Gotoh aff001; Seiichi Mori aff001
Authors place of work: Project for Development of Innovative Research on Cancer Therapeutics, Cancer Precision Medicine Center, Japanese Foundation for Cancer Research, Ariake, Koto-ku, Tokyo, Japan aff001; Data4C’s Co. Ltd., Minami-azabu, Minato-ku, Tokyo, Japan aff002
Published in the journal: PLoS ONE 15(1)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0227427Summary
DNA fragmentation is a fundamental step during library preparation in hybridization capture-based, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study.
Keywords:
Cancer treatment – Substitution mutation – Mutation databases – DNA sequencing – DNA libraries – Nucleotide sequencing – DNA fragmentation – DNA fragmentation techniques
Introduction
Next-generation sequencing (NGS) technologies have facilitated the delivery of precision medical care to patients with cancer. Short-read sequencing technology has been widely exploited for this purpose, encompassing amplicon - or hybridization capture-based library preparations [1]. This diagnostic strategy relies on accurate sequencing and interpretation to provide patients with the right clinical decision [1, 2]. Genome sequencing using hybridization capture comprises multiple steps, including tissue processing, tissue storage, DNA isolation, DNA fragmentation, probe hybridization, library amplification, sequencing, and informatics analysis [2–5]. Sequencing errors can be introduced at any of these steps, and nucleotides can be further modified through oxidation during tissue processing, tissue storage, DNA isolation, and DNA fragmentation [2–4, 6]. Nucleotide incorporation errors can in turn create polymerase reaction biases, affect precise library amplification, and generate sequencing noise [2–4]. Although substantial efforts have been made to minimize such sequencing noise experimentally, stochastic errors remain persistent [2–4].
Ultra-sonication has long been a standard method for DNA fragmentation during hybridization capture-based, short-read sequencing. Ultrasonication creates even cuts in the DNA across the entire genome, thereby providing a simple means of controlling fragment size in a non-biased manner [2, 3]. However, the physical scattering of DNA solution during the process often leads to a loss of DNA sample, which can be critical when the sample amount is limited to nano - or picogram quantities, such as found with biopsied tissue fragments. Several commercial library preparation kits are available, including the HyperPlus (KAPA Biosystems), SureSelect QXT (Agilent Technologies), Fragmentase (New England Biolabs), and Nextera Tagmentation (Illumina) kits, each of which uses endonucleases or transposases for DNA fragmentation. Although these kits minimize DNA loss, it remains largely unknown what degree of sequencing errors are caused by the enzymatic fragmentation process.
In the current study, we identified numerous artifactual SNVs/indels among libraries constructed using the HyperPlus kit for DNA fragmentation. These sequencing errors—characterized by variants located at the center of palindromic structures and near the 5’ or 3’ ends of the read, with multi-nucleotide substitutions—were deemed to have been introduced by the endonuclease treatment step and the following fill-in process for end repair, but not as a result of the whole sequencing process per se. Taking advantage of these noise properties, we developed an algorithm to efficiently distinguish sequencing errors from genuine mutations. This algorithm could be used in future studies to improve datasets that rely on enzymatic fragmentation using the same or a similar enzyme during library preparation.
Results
Distinct features of somatic SNVs/indels derived from different library preparation kits
In our sequencing facility, the SureSelect kit (Agilent, Santa Clara, California, United States) is the default protocol for library preparation for exome analysis and requires 200 ng of DNA as the starting amount. However, in some cases, the starting amount is less than 200 ng; this typically occurs when samples are extracted from small tissue fragments. For such samples, we use the HyperPlus kit (KAPA Biosystems, Cape Town, South Africa) for library preparation, which requires a minimum of only 20 ng of DNA. Typically, there tends to be sufficient amount of matched normal DNA for the standard processing, and consequently—although not ideal—it is often the case that somatic mutation calling occurs for tumor and matched normal samples prepared using different DNA fragmentation methods.
Using exome sequencing of tumor samples—with the exception of hypermutators—we will typically detect several tens or hundreds of somatic SNVs/indels per sample. However, in our experience, we noted that some tumor samples exhibited an extraordinarily large number of SNVs/indels, exceeding a few thousand, regardless of tissue origin, histological type, or method of tissue preservation. We also noted that these tumor samples were prepared using the HyperPlus kit and that the paired normal DNA samples were prepared with the SureSelect kit. Further inspection of data pertaining to 31 tumors (16 gastric, 13 lung and 2 rectal cancers) prepared using the HyperPlus kit revealed a higher median number of SNVs (median: 2,308, range: 1,119–3,996) and indels (median: 89, range: 33–437) as compared with data from tumor samples prepared using the SureSelect kit.
This obvious discrepancy prompted us to perform pairwise comparisons of sequenced reads from the same tumor DNA libraries prepared with the SureSelect kit versus those prepared with the HyperPlus kit (Fig 1). Six tumor tissues preserved as fresh-frozen samples were used for this analysis. For somatic calls, normal DNA samples were prepared using the SureSelect kit. In our comparisons of the exome sequencing process, we noted one major difference between the two library preparation kits: the SureSelect kit uses ultra-sonication for DNA fragmentation whereas the HyperPlus kit relies on endonuclease treatment. Quality assessment of the sequencing data from the six tumor samples generated by both methods showed no difference in the percentage of reads with Q30 values (the Phred score assigns a Q score of 30). However, the percentage of on-target reads, which is dependent on the library preparation method, differed between the SureSelect and HyperPlus kits (Table 1). Therefore, we obtained and analyzed somatic SNVs and indels from two combinations: tumor DNA–SureSelect with normal DNA–SureSelect (“Somatic SNV/indels with SureSelect”) and tumor DNA–HyperPlus with normal DNA–SureSelect (“Somatic SNVs/indels with HyperPlus”) (Fig 1).
Fig. 1. Experimental procedure to prepare sequencing libraries using Agilent SureSelect or KAPA HyperPlus kits, and the analytical pipeline to call somatic SNVs and indels. A major difference between the two kits is the use of ultra-sonication versus enzymatic treatment with an endonuclease for DNA fragmentation. The same tumor DNA samples were used for library preparation, and the SureSelect kit was used for all of the paired normal DNA in the somatic mutation detection. Tab. 1. Quality assessment of sequencing. The features of the somatic SNV/indel call results derived from the SureSelect and HyperPlus treatments are shown in Fig 2. Despite starting with the same sample of DNA, the HyperPlus libraries resulted in 2.3 - to 9.9-times more SNV/indel detections than the SureSelect libraries (Fig 2A). Importantly, most of SNVs/indels derived from the SureSelect treatment were nested within those from the HyperPlus libraries, but most of the SNVs/indels from the HyperPlus treatment were not also common to the SureSelect-treated libraries (Fig 3A). Given that the numbers of SNVs/indels from the SureSelect libraries for the six tumor samples were comparable with that described in previous literature [7, 8], we concluded that the HyperPlus libraries generated a substantial number of somatic SNVs/indels as non-biological sequencing artifacts.
Fig. 2. Differences in the sequenced results using the HyperPlus kit versus the SureSelect kit. Fig. 3. Features of SNV/indel noise caused by the HyperPlus kit. (A) Venn diagram of somatic SNVs/indels detected in the HyperPlus- and SureSelect-treated samples. Magenta and green lines indicate HyperPlus and SureSelect treatments, respectively. The regions [a] (red) and [b] (blue) indicate somatic SNVs/indels detected in the HyperPlus-treated but not the SureSelect-treated samples (red), and those shared by both HyperPlus- and SureSelect-treated samples (blue), respectively. Note: features were extracted from BAM files from libraries prepared with the HyperPlus kit for tumor samples. (B) Histogram depicting recurrence in somatic SNV/indel detection across 28-sample libraries prepared with the HyperPlus kit. X- and y-axes indicate the number of recurrent detections of identical SNVs/indels and the number of SNV/indels, respectively. Red and blue colors indicate somatic SNVs/indels in [a] and [b] in (A), respectively. (C) Histogram for the distribution of SNV/indel position in the read. X- and y-axes indicate the positions of SNVs/indels within 5-bases and the number of reads, respectively. Left panel. Position of somatic SNVs/indels detected in [a] (red). Right panel. Positions of somatic SNVs/indels detected in [b] (blue). The number of wildtype nucleotide reads that mapped to the same genomic coordinate as the detected SNV/indel is indicated in grey. (D) Histogram for the ratio of the soft-clipped reads. X- and y-axes indicate the ratio of soft-clipped reads at 0.05 intervals and the number of SNV/indels, respectively. Closer inspection of the data uncovered that many of these somatic SNVs were coincidently located at the center of palindromic sequences, herein designated as “SNV-centered palindromes” (SCPs). HyperPlus libraries also more frequently generated longer SCPs, whereas no SCP over 15 bases in length was detected among the SureSelect libraries (Fig 2B).
COSMIC mutational signature analyses [10] were performed to assess the overall pattern of the SNV artifacts generated by the HyperPlus kit; the scores are depicted as heatmaps in Fig 2C. Consistent with previous reports [11, 12], the SureSelect libraries produced tumor-type–associated signature profiles, with higher scores for signatures 1, 2, and 4 for lung cancers (A001, A004, A005 and B012) and signatures 1 and 6 for rectal cancers (C742 and C772). However, the HyperPlus libraries showed constant peaks associated with signatures 3, 4, and 22, even across tissues of origin, indicating that the HyperPlus treatment generated a specific set of nucleotide substitutions in the genome as the “HyperPlus signature” (Fig 2C). From these observations, we concluded that the HyperPlus treatment method led to non-biological sequencing artifacts, with biased nucleotide substitutions at characteristic palindromic parts of the genome.
Attributes of sequencing artifacts by HyperPlus
To obtain a more detailed characterization of these sequencing artifacts, we divided the somatic SNV/indel calls from the HyperPlus libraries into two categories of variants: HyperPlus-specific SNVs/indels (category [a]), and commonly detected SNVs/indels, which were found with both libraries (category [b]). This categorization was based on the premise that most of the SNVs/indels in category [a] were likely to be noise generated by the HyperPlus method of preparation, and that genuine somatic SNVs/indels would predominantly be found in category [b] (Fig 3A). We noted three distinctions in the SNVs/indels between the two categories: 1) Most SNVs/indels in category [a] were at least once detected across the pooled data from the HyperPlus libraries, but this was not the case for those in category [b] (Fig 3B); 2) SNVs/indels in category [a] were frequently located 10 - to 15-bases away from the 5’ or 3’ edge of the read (defined as “positional bias”), whereas SNVs/indels in category [b] were more uniformly distributed (Fig 3C); 3) Reads with SNVs/indels from category [a] were more substantially soft-clipped than those from category [b] (50.8% and 5.0% on average), which implies multi-nucleotide substitutions at the 5’ or 3’ end of the read (Fig 3D).
Designing a filtering algorithm to remove sequencing artifacts derived from HyperPlus
Despite these shortcomings, enzymatic fragmentation for library preparation is often unavoidable, particularly when only nanograms of DNA sample is available. Taking advantage of the salient properties of the sequencing noise generated by the HyperPlus method, we sought to develop a filtering algorithm to remove these artifacts from somatic SNV/indel call results to optimize the sequencing data (Fig 4A). The algorithm comprised two filtering steps: First, we excluded recurrently detected SNVs/indels across the pooled HyperPlus data, unless the SNVs/indels were already registered in the COSMIC database. Second, we developed and utilized a predictive model to remove SNVs/indels that showed a positional bias in a read and/or those on frequently soft-clipped reads. Positional bias was quantified using the Kolmogorov-Smirnov (KS) test to compare variant and wildtype alleles. The extent of soft clipping was measured using the ratio of soft-clipped reads per total reads with SNVs/indels. Overall, the predictive model was based on logistic regression to classify the SNV/indel as noise or signal (Fig 4A).
Fig. 4. Filtering process for the removal of somatic SNV/indel noise generated by treatment with the HyperPlus kit. 2-D scatter plots show the relationship between the KS p-value for positional bias and the ratio of soft-clipped reads (Fig 4B). Whereas category [a] SNVs/indels (mostly sequencing artifacts) were characterized by lower KS p-values and/or a higher ratio of soft-clipped reads, SNVs/indels in category [b] (mostly genuine SNVs/indels) had higher KS p-values and a lower ratio of soft-clipped reads. A threshold was then estimated to distinguish SNVs/indels between the two categories using a generalized linear model with the logit link function. Using receiver operating characteristic (ROC) curve analysis for the six-sample training data, the final model was established and shown to be capable of distinguishing SNVs/indels between the two categories with a specificity of 0.914 and a sensitivity of 0.979 (Fig 4B).
Noise reduction in the training data
We next applied our noise filtering algorithm to the six-sample training data to assess how filtering affects the data derived using the HyperPlus and SureSelect treatments (Fig 5). The total numbers of SNVs/indels in category [a] (likely noises from HyperPlus treatment) and category [b] (likely genuine mutations) were 11,731 and 2,984, respectively. Of these, recurrently detected SNVs/indels across the in-house pooled data prepared with the HyperPlus kit reached 10,928 and 16, of which 389 (3.6%) and 11 (68.8%) were registered in the COSMIC database (ver. 82). Because these 389 and 11 SNVs/indels were considered probably genuine, they were returned to the filtering process. This left 1,192 and 2,979 SNVs/indels in categories [a] and [b], respectively.
Fig. 5. Noise reduction in the six-sample training data. We then proceeded to the next step of the logistic regression based on positional bias and soft clipping. The predictive model classified 1,090 and 62 SNVs/indels as HyperPlus noise in categories [a] and [b], respectively. As anticipated, after filtering, most of the SNVs/indels in category [a] (99.1%; 11,628/11,731) were removed, but far fewer SNVs/indels were removed for category [b] (2.2%; 67/2,984) (Fig 5A and 5B). The resultant number of SNVs/indels in the HyperPlus data after filtering was similar to that in the unfiltered SureSelect data (Fig 5E, left panel). Filtering efficiently removed SNVs with SCPs longer than 13 nucleotides from category [a], whereas most of the SNVs in category [b] remained in the group (Fig 5C). Among 11,695 filtered SNVs, 3,407 SNVs were located at the center of odd-length palindromes (length ≥ 5 bases) and 66.6% of such SCPs were recurrently observed across the samples (Fig 5D). An inspection of the substrings of the palindromes revealed substantial diversity in the length and nucleotide sequence among the samples (371–655 [median 568] different palindromes per sample; Fig 5D and S1 Table). Furthermore, consistent with the presence of positional bias of the artifactual SNVs (Fig 3C), we found that, in 90.4% of SCPs, the entire palindrome sequence was nested within 30 bases from the edge of the read (S1 Fig).
Consequently, the frequency of SNVs per length of SCP among the HyperPlus data after filtering was normalized to that of the SureSelect data (Fig 5E middle panel). Similarly, filtering rendered the mutational signature profiles of the six tumors mostly indistinguishable between the HyperPlus and SureSelect treatments (Fig 5E right panel). These observations confirmed the validity of the filtering algorithm in the six-tumor training samples.
Noise reduction in test data
We next assessed the effects of the filtering algorithm on the remaining samples not used to develop the predictive model for filtering (Fig 6). For this, we used 39 tumor data derived from three independent genomic cohorts: a gastric cancer cohort 1 (n = 3), a lung cancer cohort (n = 9), and a gastric cancer cohort 2 (n = 27). Among the 39 samples, 25, 9, and 5 samples were sequenced with the KAPA HyperPlus, KAPA Hyper, and Agilent SureSelect library preparation kits, respectively. There were nine formalin-fixed paraffin-embedded (FFPE) and 30 fresh-frozen tumor samples. We show the number of SNVs and indels, and the pattern of the mutational signatures before and after filtering the data (Fig 6).
Fig. 6. Noise reduction in the test data. The experimental procedure for the Hyper kit is similar to that for the HyperPlus kit, except that the Hyper kit uses ultra-sonication for DNA fragmentation instead of endonuclease treatment, similar to the SureSelect kit. Noteworthy, there was no significant difference in the number of SNVs/indels or the pattern of the mutational signature between the SureSelect-treated and Hyper-treated samples before filtering, suggesting that the Hyper kit per se does not produce the sequencing errors recorded for the HyperPlus kit.
Filtering substantially reduced the number of SNVs/indels in the HyperPlus data but had little effect on the Hyper and SureSelect data. The median (range) proportions of the remaining SNVs/indels were 10.8% (0.01%–46.9%), 85.2% (47.6%–98.8%), and 94.3% (86.5%–98.6%) for the HyperPlus, Hyper, and SureSelect datasets, respectively (Fig 6A). In the mutational signatures (Fig 6B), filtering removed cancer type-independent peaks for signatures 3, 4, and 22, with a more uniform distribution of signatures in the HyperPlus data. Subsequently, the noise reduction rendered more signals for signatures 1 and 6 for both gastric cancer cohorts (#1 and #2) and signatures 1, 4, 7, 13, 20, 22, and 24 for the lung cancer cohort (Fig 6B). On the other hand, the filtering algorithm did not change the profiles of the Hyper and SureSelect data (Fig 6B). These observations demonstrate that the artifacts introduced by sample preparation with the HyperPlus kit were removed selectively and efficiently by the noise reduction algorithm.
Reduced but persistent artifactual SNVs/indels by somatic mutation calling with normal–HyperPlus/ tumor–HyperPlus libraries
For a more controlled analysis, we sought to examine the error production rates with the HyperPlus and SureSelect kits using the same DNA fragmentation method in paired normal–tumor samples; e.g., HH combination (normal–HyperPlus versus tumor–HyperPlus) and SS combination (normal–SureSelect versus tumor–SureSelect). Normal and tumor samples from two rectal cancer cases (C742 and C772) were analyzed. We found a substantially reduced number of somatic SNVs/indels for the HH combination (190 and 168 for C742 and C772 tumors), which was almost similar to that found for the SS combination (194 and 179 for C742 and C772 tumors). Whereas the SS and HH combinations detected common and specific SNVs/indels, it is important to note that the HyperPlus-associated sequencing errors and the production of SNV-centered palindromes were persistent in the HH combination, as detected by the filtering algorithm (Table 2). These findings clearly indicate an experimental difficulty in being able to completely cancel the sequencing noise produced by the HyperPlus treatment, even after using the same fragmentation method in paired normal and tumor samples, and suggest the necessity of using informatics to filter the noise.
Tab. 2. Somatic mutation calling with the same DNA fragmentation method in pairs of normal and tumor DNA. Discussion
The advent of NGS has meant that DNA analysis can be achieved in an efficient and highly sensitive high-throughput manner, offering a means to generate large amounts of data, decipher the subtle yet potentially informative distinctions between samples, and help to facilitate an understanding of genetic disease. In hybridization capture-based short-read sequencing, DNA fragmentation is a necessary step in the preparation of nucleic acids, as the quality of the sequencing is contingent on both the randomness of the DNA fragmentation as well as the overlap of the resultant fragments. Furthermore, because fragment size tends to differ across NGS platforms and sequencing runs, efficient control of DNA fragment size is imperative.
Ultra-sonication is one such method that can control DNA fragment size by evenly cleaving DNA throughout the entire genome and, as such, has remained a gold standard in sequencing. However, studies have reported that ultra-sonication produces sequencing noise in the form of oxidative nucleotide modifications, such as guanine to 8-oxo guanine (8-oxo-G) and cytosine deamination [2, 6, 13]. Nebulization is another commonly used mechanical method of DNA shearing. In this method, compressed nitrogen or air is forced into the DNA through a small hole, generating random sheared fragments with both overhangs and blunt ends.
In addition to these mechanical modes of fragmentation, several kits have been developed recently using enzymatic treatment to shear the DNA; albeit, it remains largely unknown whether sequencing errors occur with these alternative modes of cutting. One previous report showed that Fragmentase (New England Biolabs) causes more artifactual indels than sonication or nebulization; although, the number of indels generated by Fragmentase appeared to be within the two-fold range of that produced by the physical methods [14].
We consider the sequencing noise in the HyperPlus-treated samples to be derived as a consequence of endonuclease treatment. There are three major reasons for this proposition. First, we note positional biases in the mutations, with errors frequently located 10 - to 15-bases from the 5’ or 3’ end of the read. This implies that the positions are associated with the cutting sites of the HyperPlus endonuclease. Second, the Hyper kit, manufactured by the same company as the HyperPlus kit, uses ultra-sonication for DNA fragmentation instead of endonuclease treatment, and did not produce the same amount of noise as that generated by the HyperPlus kit. Third, artifactual SNVs were often observed at the centers of palindromic sequences, suggestive of another bias in sequence recognition by the endonuclease(s) in the fragmentation step.
Previous studies have highlighted biases in the cleavage sites targeted by “non-specific” endonucleases, such as DNase I [15–18]. The HyperPlus endonuclease—the type and composition have not been disclosed (KAPA Biosystems)—seemingly has preferential recognition sites for genomic DNA, and these include palindromic sequences. Importantly, the SCPs were not only substantially diverse in length and sequence but also 66.6% of SCPs recurrently appeared across a range of samples. In addition, in almost all (90.4%) of the SCPs, the entire palindromic sequence was nested within 30 bases from the edge of the read. Based on these properties, the HyperPlus endonuclease is considered to be an endonuclease(s), which prefer DNA sequences with diverse palindromic structure (over 1,000 palindromes with different lengths and sequences) without any specificity. Since a restriction enzyme is defined as an endonuclease with specific recognition site [19], we speculate that the HyperPlus endonuclease is not a mixture of restriction enzymes. Nevertheless, limited information prevented us from further inferring the exact enzyme(s) responsible for the sequencing noise measured in our study. Other endonucleases for DNA fragmentation, such as Fragmentase [14], may also generate sequencing noise that could be misinterpreted as genuine mutations. Fragmentase is a mix of two enzymes: one randomly creates nicks in the dsDNA while the other one cuts the strand opposite to the nicks. It is possible that the noise created by Fragmentase could be similarly ameliorated from the data through a specific algorithm, like the one employed in this study.
Given that endonucleases themselves are incapable of incorporating nucleotides into the DNA or causing mutations [19], we speculate that mutations arise after enzymatic fragmentation during the “fill-in process” orchestrated by the DNA polymerase for end repair (“End repair & A-tailing enzyme” prior to adaptor ligation in the HyperPlus kit). Ultra-sonication randomly cleaves DNA molecules at different genomic positions and, therefore, in the subsequent fill-in process, nucleotides are incorporated at different genomic positions in different DNA molecules. Even if an erroneous nucleotide is incorporated into the cleaved sites, the resultant artifact would not be recognized as a mutation, because it would not consistently appear at the same position on different molecules. However, because the HyperPlus endonuclease preferentially cleaves specific sites on the DNA, when an erroneous nucleotide is incorporated, the resultant artifact could be mistakenly recognized as a mutation because it appears repeatedly at the same position on different molecules. For instance, hairpin structures made in palindromes may result in nucleotide mis-incorporation into the center of a palindromic sequence, which would ordinarily be detectable as a mutation, albeit incorrectly. Moreover, multi-nucleotide substitutions near the end of the read—another feature of the artifactual noise—can arise as more than one mis-incorporation during the fill-in process. By filtering the data using our algorithm, these positional biases and other artifacts are identified and excluded, thereby minimizing the number of non-genuine mutations. For instance, the algorithm designed in this study will identify and exclude mutation-based sequencing artifacts within the center of palindromic sequences, as well as multi-nucleotide substitutions near the ends of the read.
We found a substantial number of somatic SNVs/indels in the paired analysis of the six tumor samples using the SureSelect treatment for normal samples and the HyperPlus treatment for tumor samples (SH). We considered that such noise could be avoided by using the same DNA fragmentation method for paired samples (i.e., HH combination), and tested this using samples from two rectal cancer cases. Even though we confirmed a substantial reduction in the number of SNVs/indels using just one fragmentation method, upon careful examination, we detected the persistence of HyperPlus noise among the resultant SNVs/indels from the HH combination; this noise was frequently classified by the algorithm in other pairwise comparisons and characterized by palindromic structure. This finding reinforces our proposal of the risk that persistent errors may be confused with genuine mutations due to their recurrent appearance in a cohort. In such situations, the algorithm developed in this study can be used to distinguish true mutations from sequencing errors. The current study hence provides the technical basis to remove sequencing noise derived from HyperPlus endonuclease treatment.
Materials and methods
Starting amount of DNA
In our sequencing facility, the default protocol for library preparation in exome sequencing is the use of the SureSelect kit (Agilent Technologies). In cases where there is less than 200 ng of DNA, we use the HyperPlus (KAPA Biosystems) kit. Thus, for the purposes of this comparative study, the starting amounts of DNA were 40 and 200 ng for preparation with the HyperPlus and SureSelect kits, respectively.
DNA fragmentation by ultra-sonication
DNA shearing by ultra-sonication was performed with the E220 Focused-ultra-sonicator (Covaris) for 360 s at 4°C according to the manufacturer’s recommendations. After shearing, the median peak in fragment length was 177 bp (range, 160–185 bp), as measured using the 2200 TapeStation (Agilent Technologies).
DNA fragmentation using HyperPlus endonuclease
DNA was incubated with the HyperPlus “Frag Enzyme” (KAPA Biosystems) at 37°C for 30 min, according to the manufacturer’s recommendations.
Library preparation
After enzymatic fragmentation (HyperPlus) or ultrasonic shearing (SureSelect), we performed end repair, phosphorylation, and the ligation of barcoded adaptors according to each of the manufacturer’s protocols. DNA samples were then captured by hybrid capture using the SureSelect Human All Exon V5 kit (Agilent Technologies). The captured libraries were amplified with the addition of index sequences, and were multiplexed before sequencing.
Sequencing
Libraries were sequenced using the HiSeq2500 (Illumina), according to the manufacturer’s recommendations, with a median depth of coverage of 260 (124–271) per tumor with the HyperPlus kit, 294 (257–334) per tumor with the SureSelect kit, and 172 (148–225) per normal tissue sample with the SureSelect kit.
Bioinformatics tools for somatic SNV/indel calls
Sequenced reads were aligned with BWA (Burrows-Wheeler Aligner; ver. 0.7.12) to the human genome reference (hg19) [20]. GATK (GenomeAnalysisTK; ver 3.4–46) was used to recalibrate the variant quality score and to perform local realignment [21]. Somatic SNVs were called with VarScan (ver. 2.3.7), MuTect (ver.1.1.5), and Karkinos (ver. 3.0.22) [22–24]. VarScan (ver. 2.3.7), SomaticIndelDetector (ver.2.3–9), and Karkinos2 (ver.0.1) were used to detect somatic indels [21, 22, 24]. SNVs and indels were considered as genuine only when they were detected by at least two of three callers and used for subsequent analyses. SNVs/indels were annotated with ANNOVAR [25] (2015 Mar 22 released version). COSMIC (Catalogue Of Somatic Mutations In Cancer; v82) database [12] was integrated into the ANNOVAR database and used to identify the SNVs/indels registered in the COSMIC database.
Informatics methods to characterize and filter sequencing artifacts by HyperPlus
We designated a somatic SNV coincidently located at the center of a palindromic sequence as an “SNV-centered palindrome” (SCP). The SNV-centered sequences were extracted from a reference FASTA file (hg19.fa; http://hgdownload.cse.ucsc.edu/goldenPath/hg19/) and were determined to have palindromic structure or not by an in-house script. Reads with and without somatic SNVs/indels were extracted from a BAM file derived from a tumor sample. The position of a variant in a read was subsequently assigned according to the mapped position, and the CIGAR string of the read with an in-house Python (version 3.7.2) script using the Pysam module (https://github.com/pysam-developers/pysam).
COSMIC mutational signatures [9] were computed from trinucleotide frequencies of nucleotide substitutions in each sample. Probabilities for the signatures were downloaded from the COSMIC website (https://cancer.sanger.ac.uk/cosmic/signatures). Each signature contribution to a tumor signature profile was computed by minimizing the difference between the trinucleotide frequencies and the linear sum of probabilities using the Rsolnp (version 1.1.6) library.
Variant recurrence was counted when a variant in a sample had the identical genomic coordinate and altered nucleotides to that in the other samples among the 28 tumors, for which the libraries were prepared with the HyperPlus kit. We utilized this variant recurrence as part of the filtering algorithm for data containing artifacts generated due to preparation with the HyperPlus kit. Because genuine driver mutations were also recurrent in various types of cancer and because most of them were already registered in the COSMIC database, these “COSMIC database-recurrent” variants were returned for further filtering.
To remove SNVs/indels having positional bias in a read, we used p-values from Kolmogorov-Smirnov (KS) comparisons of the position between the variant and wildtype alleles in a read with R (version 3.5.2). The ratio of soft-clipped reads per total reads with SNVs/indels was computed according to information from the BAM file, and this was also used for filtering. A predictive model was developed based on the logistic regression to classify the SNV/indel as noise or signal. Threshold lines were estimated by generalized linear models with the logit link function. The threshold line was then optimized to provide the maximum summation of sensitivity and specificity.
Supporting information
S1 Fig [docx]
Location of SNV-centered palindromes (SCPs) on the read.S1 Table [xlsx]
Sequence and number of filtered palindrome sequences.
Zdroje
1. Taber KAJ, Dickinson BD, Wilson M. The Promise and Challenges of Next-Generation Genome Sequencing for Clinical Care. JAMA Intern. Med. 2014 174 : 275–80. doi: 10.1001/jamainternmed.2013.12048 24217348
2. Ma X, Shao Y, Liqing T, Flasch DA, Mulder HL, Edomonson MN, et al. Analysis of error profiles in deep next-generation sequencing data. Genome Biol. 2019 20 : 50. doi: 10.1186/s13059-019-1659-6 30867008
3. Head SR, Komori HK, LaMere AS, Whisenant T, Van Nieuwerburgh F, Salomon DR, et al. Library construction for next-generation sequencing: Overviews and challenges. BioTechniques 2014 56 : 61–77. doi: 10.2144/000114133 24502796
4. Robasky K, Lewis NE, Church GM. The role of replicates for error mitigation in next-generation sequencing. Nat. Rev. Genet. 2014 15 : 56–62. doi: 10.1038/nrg3655 24322726
5. Abnizova I, te Boekhorst R, Yuriy LO. Computational Errors and Biases in Short Read Next Generation Sequencing. J Proteomics Bioinform. 2017 10(1):1–17.
6. Chen G, Moiser S, Gocke CD, Lin MT, Eshleman JR, Cytosine Deamination is a Major Cause of Baseline Noise in Next Generation Sequencing. Mol Diagn Ther. 2014 18(5): 587–593. doi: 10.1007/s40291-014-0115-2 25091469
7. The Cancer Genome Atlas Network Comprehensive molecular characterization of human colon and rectal cancer. Nature 2012 487 : 330–7. doi: 10.1038/nature11252 22810696
8. The Cancer Genome Atlas Network Comprehensive molecular profiling of lung adenocarcinoma. Nature 2014 511 : 543–50. doi: 10.1038/nature13385 25079552
9. Alexandrov LB, Nik-zainal S, Wedge DC, Campbell PJ, Stratton MR Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 2013 3 : 246–59. doi: 10.1016/j.celrep.2012.12.008 23318258
10. Nik-Zainal S, Alexandrov LB, Wedge DC, Van Loo P, Greenman CD, Raine K, et al. Mutational processes molding the genomes of 21 breast cancers. Cell 2012 149 : 979–93. doi: 10.1016/j.cell.2012.04.024 22608084
11. Alexandrov LB, Nik-zainal S, Wedge DC, Aparicio SA, Behjati S, Biankin AV, et al. Signatures of mutational processes in human cancer. Nature 2013 500 : 415–21. doi: 10.1038/nature12477 23945592
12. Tate JG, Bamford S, Jubb HC, Sondka Z, Beare DM, Bindal N, et al. COSMIC: the Catalogue Of Somatic Mutations In Cancer. Nucleic Acids Res. 2010 38 : 164.
13. Costello M, Pugh TJ, Fennel TJ, Stewart C, Lichtenstein L, Meldrim JC, et al. Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation. Nucleic Acids Res. 2013 41 : 67.
14. Knierim E, Lucke B, Schwarz JM, Schuelke M, Seelow D, Systematic comparison of three methods for fragmentation of long-range PCR products for next generation sequencing. PLoS One 2011 6:e28240. doi: 10.1371/journal.pone.0028240 22140562
15. He HH, Meyer CA, Hu SS, Chen MW, Zang C, Liu Y, et al. Refined DNase-seq protocol and data analysis reveals intrinsic bias in transcription factor footprint identification. Nat. Methods 2014 11 : 73–8. doi: 10.1038/nmeth.2762 24317252
16. Lazarovici A, Zhou T, Shafer A, Dantas Machado AC, Riley TR, Sandstrom R, et al. Probing DNA shape and methylation state on a genomic scale with DNase I. Proc. Natl. Acad. Sci. U. S. A. 2013 110 : 6376–81. doi: 10.1073/pnas.1216822110 23576721
17. Koohy H, Down TA, Hubbard TJ. Chromatin accessibility data sets show bias due to sequence specificity of the DNase I enzyme. PLoS One 2013 8:e69853. doi: 10.1371/journal.pone.0069853 23922824
18. Meyer CA, Liu XS. Identifying and mitigating bias in next-generation sequencing methods for chromatin biology. Nat. Genet. Rev. 2014 15 : 709–721.
19. Roberts RJ, Halford SE. Type II Restriction Enzymes. In Roberts RJ, Linn SM, Lloyd RS, editors, Nucleases, 2nd Ed. Cold Spring Harbor Laboratory Press. 1993 pp. 35–88.
20. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009 25 : 1754–60. doi: 10.1093/bioinformatics/btp324 19451168
21. DePristo MA, Banks E, Poplin RE, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 2011 43 : 491–8. doi: 10.1038/ng.806 21478889
22. Koboldt DC, Zhang Q, Larson DE, Shen D, McLellan MD, Lin L, et al. VarScan 2: Somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res. 2012 22 : 568–76. doi: 10.1101/gr.129684.111 22300766
23. Cibulskis K, Lawrence MS, Carter SL, Sivachenko A, Jaffe D, Sougnez C, et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 2013 31 : 213–9. doi: 10.1038/nbt.2514 23396013
24. Hiroki Ueda Karkinos Available from https://github.com/genome-rcast/karkinos
25. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010 38 : 164.
Článek Regional versus local wind speed and direction at a narrow beach with a high and steep foreduneČlánek Disparate effects of antibiotic-induced microbiome change and enhanced fitness in Daphnia magnaČlánek Effects of Transcranial Direct Current Stimulation on GABA and Glx in Children: A pilot studyČlánek Patient perceived value of teleophthalmology in an urban, low income US population with diabetesČlánek A study to better understand under-utilization of laboratory tests for antenatal care in SenegalČlánek Design and evaluation of a laboratory-based wheelchair castor testing protocol using community dataČlánek Role of ecology in shaping external nasal morphology in bats and implications for olfactory trackingČlánek Influence of light on the infection of Aureococcus anophagefferens CCMP 1984 by a “giant virus”Článek Effects of Allium hookeri on gut microbiome related to growth performance in young broiler chickensČlánek A network analysis revealed the essential and common downstream proteins related to inguinal herniaČlánek Identification and characterization of miRNAs involved in cold acclimation of zebrafish ZF4 cellsČlánek Research on motion planning for an indoor spray arm based on an improved potential field methodČlánek Eye-gaze information input based on pupillary response to visual stimulus with luminance modulationČlánek Disruption in daily eating-fasting and activity-rest cycles in Indian adolescents attending schoolČlánek Umbilical cord separation time, predictors and healing complications in newborns with dry careČlánek Analysis of attitudinal components towards statistics among students from different academic degreesČlánek Forecasting stock prices with long-short term memory neural network based on attention mechanismČlánek Comparison of Monocyte Distribution Width (MDW) and Procalcitonin for early recognition of sepsisČlánek Transcriptome analysis of Actinidia chinensis in response to Botryosphaeria dothidea infectionČlánek Sofosbuvir-based regimen for genotype 2 HCV infected patients in Taiwan: A real world experienceČlánek The virulence domain of Shigella IcsA contains a subregion with specific host cell adhesion function
Článok vyšiel v časopisePLOS One
Najčítanejšie tento týždeň
2020 Číslo 1- Metamizol jako analgetikum první volby: kdy, pro koho, jak a proč?
- Nejasný stín na plicích – kazuistika
- Masturbační chování žen v ČR − dotazníková studie
- Kombinace metamizol/paracetamol v léčbě pooperační bolesti u zákroků v rámci jednodenní chirurgie
- Představa veřejnosti o rezistenci na antibiotika je často mimo realitu − výsledky průzkumu WHO
-
Všetky články tohto čísla
- ETAPOD: A forecast model for prediction of black pod disease outbreak in Nigeria
- Disparate effects of antibiotic-induced microbiome change and enhanced fitness in Daphnia magna
- Deliver on Your Own: Disrespectful Maternity Care in rural Kenya
- Number of days required to estimate physical activity constructs objectively measured in different age groups: Findings from three Brazilian (Pelotas) population-based birth cohorts
- Exploring the mechanism of olfactory recognition in the initial stage by modeling the emission spectrum of electron transfer
- Risk of complications among diabetics self-reporting oral health status in Canada: A population-based cohort study
- Practical considerations in the use of a porcine model (Sus scrofa domesticus) to assess prevention of postoperative peritubal adhesions
- Transcriptional Differences in Peanut (Arachis hypogaea L.) Seeds at the Freshly Harvested, After-ripening and Newly Germinated Seed Stages: Insights into the Regulatory Networks of Seed Dormancy Release and Germination
- Identifying maintenance hosts for infection with Dichelobacter nodosus in free-ranging wild ruminants in Switzerland: A prevalence study
- Model order reduction for left ventricular mechanics via congruency training
- Production, purification and evaluation of biodegradation potential of PHB depolymerase of Stenotrophomonas sp. RZS7
- The impact of a wireless audio system on communication in robotic-assisted laparoscopic surgery: A prospective controlled trial
- Seroprevalence of viral and vector-borne bacterial pathogens in domestic dogs (Canis familiaris) in northern Botswana
- Musical expertise generalizes to superior temporal scaling in a Morse code tapping task
- Cross-cultural adaptation and psychometric evaluation of the Yoruba version of Oswestry disability index
- Post-transcriptional regulation of Rad51c by miR-222 contributes cellular transformation
- Can scientists fill the science journalism void? Online public engagement with science stories authored by scientists
- Retention and predictors of attrition among patients who started antiretroviral therapy in Zimbabwe’s national antiretroviral therapy programme between 2012 and 2015
- Prognostics for pain in osteoarthritis: Do clinical measures predict pain after total joint replacement?
- Effects of Transcranial Direct Current Stimulation on GABA and Glx in Children: A pilot study
- Evaluation of rice wild relatives as a source of traits for adaptation to iron toxicity and enhanced grain quality
- Brief communication: Long-term absence of Langerhans cells alters the gene expression profile of keratinocytes and dendritic epidermal T cells
- APOBEC3B reporter myeloma cell lines identify DNA damage response pathways leading to APOBEC3B expression
- Morphological diversity within a core collection of subterranean clover (Trifolium subterraneum L.): Lessons in pasture adaptation from the wild
- Feasibility of real-time in vivo 89Zr-DFO-labeled CAR T-cell trafficking using PET imaging
- Repository-based plasmid design
- A new method of recording from the giant fiber of Drosophila melanogaster shows that the strength of its auditory inputs remains constant with age
- Aberrant cervical innate immunity predicts onset of dysbiosis and sexually transmitted infections in women of reproductive age
- Safe mobility, socioeconomic inequalities, and aging: A 12-year multilevel interrupted time-series analysis of road traffic death rates in a Latin American country
- THAP11F80L cobalamin disorder-associated mutation reveals normal and pathogenic THAP11 functions in gene expression and cell proliferation
- Lesion of striatal patches disrupts habitual behaviors and increases behavioral variability
- A clinical method for estimating the modulus of elasticity of the human cornea in vivo
- Patient perceived value of teleophthalmology in an urban, low income US population with diabetes
- Evidence in support of chromosomal sex influencing plasma based metabolome vs APOE genotype influencing brain metabolome profile in humanized APOE male and female mice
- Accelerated sparsity based reconstruction of compressively sensed multichannel EEG signals
- Microvesicles from Lactobacillus reuteri (DSM-17938) completely reproduce modulation of gut motility by bacteria in mice
- Dense carbon-nanotube coating scaffolds stimulate osteogenic differentiation of mesenchymal stem cells
- Gamma Knife radiosurgery for vestibular schwannomas: Evaluation of planning using the sphericity degree of the target volume
- Purification and molecular characterization of phospholipase, antigen 5 and hyaluronidases from the venom of the Asian hornet (Vespa velutina)
- Why are animal source foods rarely consumed by 6-23 months old children in rural communities of Northern Ethiopia? A qualitative study
- A study to better understand under-utilization of laboratory tests for antenatal care in Senegal
- Physicians’ perspectives regarding non-medical switching of prescription medications: Results of an internet e-survey
- Effectiveness of information technology–enabled ‘SMART Eating’ health promotion intervention: A cluster randomized controlled trial
- Cauda Equina Syndrome Core Outcome Set (CESCOS): An international patient and healthcare professional consensus for research studies
- A new species of Macrocypraea (Gastropoda, Cypraeidae) from Trindade Island, Brazil, including phenotypic differentiation from remaining congeneric species
- Long term conjugated linoleic acid supplementation modestly improved growth performance but induced testicular tissue apoptosis and reduced sperm quality in male rabbit
- A new approach to the temporal significance of house orientations in European Early Neolithic settlements
- Persistence of chikungunya ECSA genotype and local outbreak in an upper medium class neighborhood in Northeast Brazil
- In vivo elongation of thin filaments results in heart failure
- Disparity in depressive symptoms between heterosexual and sexual minority men in China: The role of social support
- Effect of classroom intervention on student food selection and plate waste: Evidence from a randomized control trial
- Mating strategy is determinant of adenovirus prevalence in European bats
- Preventing HIV and HSV-2 through knowledge and attitudes: A replication study of a multi-component community-based intervention in Zimbabwe
- Randomized clinical trial analyzing maintenance of peripheral venous catheters in an internal medicine unit: Heparin vs. saline
- Patient-related factors may influence nursing perception of sleep in the Intensive Care Unit
- A randomized trial of a behavioral intervention to decrease hospital length of stay by decreasing bedrest
- Color image segmentation using adaptive hierarchical-histogram thresholding
- The role of demographic history and selection in shaping genetic diversity of the Galápagos penguin (Spheniscus mendiculus)
- Attitudes towards animal study registries and their characteristics: An online survey of three cohorts of animal researchers
- Risk perception and behavioral change during epidemics: Comparing models of individual and collective learning
- Risk factors for third-generation cephalosporin resistant Enterobacteriaceae in gestational urine cultures: A retrospective cohort study based on centralized electronic health records
- Residential neighbourhood greenspace is associated with reduced risk of cardiovascular disease: A prospective cohort study
- Potential socioeconomic impacts from ocean acidification and climate change effects on Atlantic Canadian fisheries
- Prevention and control of cholera with household and community water, sanitation and hygiene (WASH) interventions: A scoping review of current international guidelines
- Female finches prefer courtship signals indicating male vigor and neuromuscular ability
- The effect of spatial position and age within an egg-clutch on embryonic development and key metabolic enzymes in two clownfish species, Amphiprion ocellaris and Amphiprion frenatus
- The impact of translated reminder letters and phone calls on mammography screening booking rates: Two randomised controlled trials
- Application of a genetic algorithm to the keyboard layout problem
- Design and evaluation of a laboratory-based wheelchair castor testing protocol using community data
- Relationship between diabetic macular edema and choroidal layer thickness
- Evaluation of the predictive ability of ultrasound-based assessment of breast cancer using BI-RADS natural language reporting against commercial transcriptome-based tests
- A Comprehensive Data Gathering Network Architecture in Large-Scale Visual Sensor Networks
- Recovery of health-related quality of life after burn injuries: An individual participant data meta-analysis
- Modeling aggressive market order placements with Hawkes factor models
- Role of ecology in shaping external nasal morphology in bats and implications for olfactory tracking
- High expression of olfactomedin-4 is correlated with chemoresistance and poor prognosis in pancreatic cancer
- Development and validation of a prognostic model predicting symptomatic hemorrhagic transformation in acute ischemic stroke at scale in the OHDSI network
- Complex patterns of cell growth in the placenta in normal pregnancy and as adaptations to maternal diet restriction
- Tofu intake is inversely associated with risk of breast cancer: A meta-analysis of observational studies
- Influence of light on the infection of Aureococcus anophagefferens CCMP 1984 by a “giant virus”
- Temporal ordering of input modulates connectivity formation in a developmental neuronal network model of the cortex
- Healthy lifestyle index and its association with hypertension among community adults in Sri Lanka: A cross-sectional study
- From organ to cell: Multi-level telomere length assessment in patients with idiopathic pulmonary fibrosis
- How do critical care staff respond to organisational challenge? A qualitative exploration into personality types and cognitive processing in critical care
- Effects of supplemental creatine and guanidinoacetic acid on spatial memory and the brain of weaned Yucatan miniature pigs
- Community-Based Health Planning and Services Plus programme in Ghana: A qualitative study with stakeholders in two Systems Learning Districts on improving the implementation of primary health care
- An investigation of transportation practices in an Ontario swine system using descriptive network analysis
- Comparison of gridded precipitation datasets for rainfall-runoff and inundation modeling in the Mekong River Basin
- Functional interactions in patients with hemianopia: A graph theory-based connectivity study of resting fMRI signal
- The effects of dual-task cognitive interference on gait and turning in Huntington’s disease
- Effects of Allium hookeri on gut microbiome related to growth performance in young broiler chickens
- Novel imaging biomarkers for mapping the impact of mild mitochondrial uncoupling in the outer retina in vivo
- Hyperkalemia treatment modalities: A descriptive observational study focused on medication and healthcare resource utilization
- Long term impact of PositiveLinks: Clinic-deployed mobile technology to improve engagement with HIV care
- Comparison of post-transplantation diabetes mellitus incidence and risk factors between kidney and liver transplantation patients
- A definition-by-example approach and visual language for activity patterns in engineering disciplines
- A network analysis revealed the essential and common downstream proteins related to inguinal hernia
- Use of conventional cardiac troponin assay for diagnosis of non-ST-elevation myocardial infarction: ‘The Ottawa Troponin Pathway’
- Identification and characterization of miRNAs involved in cold acclimation of zebrafish ZF4 cells
- Research on motion planning for an indoor spray arm based on an improved potential field method
- Detailed analysis of the transverse arch of hallux valgus feet with and without pain using weightbearing ultrasound imaging and precise force sensors
- Surrogate R-spondins for tissue-specific potentiation of Wnt Signaling
- Apolipoprotein-AI mimetic peptides D-4F and L-5F decrease hepatic inflammation and increase insulin sensitivity in C57BL/6 mice
- Treating patients with driving phobia by virtual reality exposure therapy – a pilot study
- Efficient processing of raster and vector data
- Therapeutic hypothermia after out of hospital cardiac arrest improve 1-year survival rate for selective patients
- Carotid plaques and neurological impairment in patients with acute cerebral infarction
- Deep learning based image reconstruction algorithm for limited-angle translational computed tomography
- Association between coffee drinking and telomere length in the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial
- Hyperbaric oxygen preconditioning and the role of NADPH oxidase inhibition in postischemic acute kidney injury induced in spontaneously hypertensive rats
- Rad51 paralogs and the risk of unselected breast cancer: A case-control study
- Diagnostic differences in respiratory breathing patterns and work of breathing indices in children with Duchenne muscular dystrophy
- The role of narrative in collaborative reasoning and intelligence analysis: A case study
- Proportions of CD4 test results indicating advanced HIV disease remain consistently high at primary health care facilities across four high HIV burden countries
- Modelling of amino acid turnover in the horse during training and racing: A basis for developing a novel supplementation strategy
- Single-modal and multi-modal false arrhythmia alarm reduction using attention-based convolutional and recurrent neural networks
- Eye-gaze information input based on pupillary response to visual stimulus with luminance modulation
- Trends of litter decomposition and soil organic matter stocks across forested swamp environments of the southeastern US
- Post mortem evaluation of inflammation, oxidative stress, and PPARγ activation in a nonhuman primate model of cardiac sympathetic neurodegeneration
- Were ancient foxes far more carnivorous than recent ones?—Carnassial morphological evidence
- Disruption in daily eating-fasting and activity-rest cycles in Indian adolescents attending school
- Plasma proteome profiling of freshwater and seawater life stages of rainbow trout (Oncorhynchus mykiss)
- Percent amplitude of fluctuation: A simple measure for resting-state fMRI signal at single voxel level
- Antimicrobial activity of Asteraceae species against bacterial pathogens isolated from postmenopausal women
- Are changes in depressive symptoms, general health and residential area socio-economic status associated with trajectories of waist circumference and body mass index?
- Extracellular vesicles of U937 macrophage cell line infected with DENV-2 induce activation in endothelial cells EA.hy926
- Link-centric analysis of variation by demographics in mobile phone communication patterns
- Tobacco smoking and health-related quality of life among university students: Mediating effect of depression
- The Shapley value for a fair division of group discounts for coordinating cooling loads
- Incidence of hospital-acquired pressure ulcers in patients with "minimal risk" according to the "Norton-MI" scale
- Lipoprotein(a) plasma levels are not associated with survival after acute coronary syndromes: An observational cohort study
- Use of Nanotrap particles for the capture and enrichment of Zika, chikungunya and dengue viruses in urine
- Pancreatic secretory trypsin inhibitor reduces multi-organ injury caused by gut ischemia/reperfusion in mice
- Biochemical characterization of Ty1 retrotransposon protease
- Lateral pressure equalisation as a principle for designing support surfaces to prevent deep tissue pressure ulcers
- The validation of the Beijing version of the Montreal Cognitive Assessment in Chinese patients undergoing hemodialysis
- Inflammasome expression is higher in ovarian tumors than in normal ovary
- HCV genotype profile in Brazil of mono-infected and HIV co-infected individuals: A survey representative of an entire country
- Engaging with change: Information and communication technology professionals’ perspectives on change at the mid-point in the UK/EU Brexit process
- Adherence to iron-folic acid supplement and associated factors among antenatal care attending pregnant mothers in governmental health institutions of Adwa town, Tigray, Ethiopia: Cross-sectional study
- Flower, seed, and fruit development in three Tunisian species of Polygonum: Implications for their taxonomy and evolution of distyly in Polygonaceae
- Development of a risk score for prediction of poor treatment outcomes among patients with multidrug-resistant tuberculosis
- Preclinical evaluation of AT-527, a novel guanosine nucleotide prodrug with potent, pan-genotypic activity against hepatitis C virus
- Aqueous extract from Mangifera indica Linn. (Anacardiaceae) leaves exerts long-term hypoglycemic effect, increases insulin sensitivity and plasma insulin levels on diabetic Wistar rats
- Discovery of Jogalong virus, a novel hepacivirus identified in a Culex annulirostris (Skuse) mosquito from the Kimberley region of Western Australia
- Clinical, cytogenetic and molecular genetic characterization of a tandem fusion translocation in a male Holstein cattle with congenital hypospadias and a ventricular septal defect
- Detection of Torque Teno Virus (TTV) and TTV-Like Minivirus in patients with presumed infectious endophthalmitis in India
- CD4 rate of increase is preferred to CD4 threshold for predicting outcomes among virologically suppressed HIV-infected adults on antiretroviral therapy
- Estimating the basic reproduction number of a pathogen in a single host when only a single founder successfully infects
- What drugs modify the risk of iatrogenic impulse-control disorders in Parkinson’s disease? A preliminary pharmacoepidemiologic study
- Evaluating emotional distress and health-related quality of life in patients with heart failure and their family caregivers: Testing dyadic dynamics using the Actor-Partner Interdependence Model
- Community- and trophic-level responses of soil nematodes to removal of a non-native tree at different stages of invasion
- Association of ECG parameters with late gadolinium enhancement and outcome in patients with clinical suspicion of acute or subacute myocarditis referred for CMR imaging
- Catchment-scale export of antibiotic resistance genes and bacteria from an agricultural watershed in central Iowa
- Impact of multi-drug resistant bacteria on economic and clinical outcomes of healthcare-associated infections in adults: Systematic review and meta-analysis
- Characterization of a universal screening approach for congenital CMV infection based on a highly-sensitive, quantitative, multiplex real-time PCR assay
- Proof-of-concept for a non-invasive, portable, and wireless device for cardiovascular monitoring in pediatric patients
- On PTV definition for glioblastoma based on fiber tracking of diffusion tensor imaging data
- Genes associated with body weight gain and feed intake identified by meta-analysis of the mesenteric fat from crossbred beef steers
- Intraoperative computed tomography imaging for dose calculation in intraoperative electron radiation therapy: Initial clinical observations
- Human lung epithelial BEAS-2B cells exhibit characteristics of mesenchymal stem cells
- Simple non-mydriatic retinal photography is feasible and demonstrates retinal microvascular dilation in Chronic Obstructive Pulmonary Disease (COPD)
- Maternal depressive symptoms and children’s cognitive development: Does early childcare and child’s sex matter?
- Evaluation of a bioengineered ACL matrix’s osteointegration with BMP-2 supplementation
- Psychosocial profiles of physical activity fluctuation in office employees: A latent profile analysis
- Prevalence and characteristics of Livestock-Associated Methicillin-Resistant Staphylococcus aureus (LA-MRSA) isolated from chicken meat in the province of Quebec, Canada
- Soluble AXL as a marker of disease progression and survival in melanoma
- Using machine learning methods to determine a typology of patients with HIV-HCV infection to be treated with antivirals
- Gender differences influence over insomnia in Korean population: A cross-sectional study
- Impact of scion/rootstock reciprocal effects on metabolomics of fruit juice and phloem sap in grafted Citrus reticulata
- Adapting cognitive diagnosis computerized adaptive testing item selection rules to traditional item response theory
- Autumn shifts in cold tolerance metabolites in overwintering adult mountain pine beetles
- Umbilical cord separation time, predictors and healing complications in newborns with dry care
- Analysis of attitudinal components towards statistics among students from different academic degrees
- Effects of fatigue induced by repeated-sprint on kicking accuracy and velocity in female soccer players
- A pre-clinical validation plan to evaluate analytical sensitivities of molecular diagnostics such as BD MAX MDR-TB, Xpert MTB/Rif Ultra and FluoroType MTB
- Leadership for success in transforming medical abortion policy in Canada
- Clinical correlates associated with the long-term response of bipolar disorder patients to lithium, valproate or lamotrigine: A retrospective study
- Forecasting stock prices with long-short term memory neural network based on attention mechanism
- On the genus Crossaster (Echinodermata: Asteroidea) and its distribution
- Intracellular and in vivo evaluation of imidazo[2,1-b]thiazole-5-carboxamide anti-tuberculosis compounds
- An integrated vitamin E-coated polymer hybrid nanoplatform: A lucrative option for an enhanced in vitro macrophage retention for an anti-hepatitis B therapeutic prospect
- The effect of strontium and silicon substituted hydroxyapatite electrochemical coatings on bone ingrowth and osseointegration of selective laser sintered porous metal implants
- Molecular prevalence of Bartonella, Babesia, and hemotropic Mycoplasma species in dogs with hemangiosarcoma from across the United States
- Color discrimination and gas chromatography-mass spectrometry fingerprint based on chemometrics analysis for the quality evaluation of Schizonepetae Spica
- Comparisons of recurrence-free survival and overall survival between microwave versus radiofrequency ablation treatment for hepatocellular carcinoma: A multiple centers retrospective cohort study with propensity score matching
- Oral misoprostol, low dose vaginal misoprostol, and vaginal dinoprostone for labor induction: Randomized controlled trial
- The association between dietary patterns before and in early pregnancy and the risk of gestational diabetes mellitus (GDM): Data from the Malaysian SECOST cohort
- Dynamic Extreme Aneuploidy (DEA) in the vegetable pathogen Phytophthora capsici and the potential for rapid asexual evolution
- Assertive, trainable and older dogs are perceived as more dominant in multi-dog households
- Prediction of Uropathogens by Flow Cytometry and Dip-stick Test Results of Urine Through Multivariable Logistic Regression Analysis
- Interleukin 6 is increased in preclinical HNSCC models of acquired cetuximab resistance, but is not required for maintenance of resistance
- Impact of viral disease hypophagia on pig jejunal function and integrity
- Molecular evidence for horizontal transmission of chelonid alphaherpesvirus 5 at green turtle (Chelonia mydas) foraging grounds in Queensland, Australia
- Evaluation and validation of 2D biomechanical models of the knee for radiograph-based preoperative planning in total knee arthroplasty
- Soil-Transmitted Helminth infections reduction in Bhutan: A report of 29 years of deworming
- cagA gene EPIYA motif genetic characterization from Colombian Helicobacter pylori isolates: Standardization of a molecular test for rapid clinical laboratory detection
- Spectral characteristics of urine from patients with end-stage kidney disease analyzed using Raman Chemometric Urinalysis (Rametrix)
- Fast quantitative time lapse displacement imaging of endothelial cell invasion
- Two novel mutations in MSX1 causing oligodontia
- Dome-shaped macula in children and adolescents
- Targeted transcriptomic study of the implication of central metabolic pathways in mannosylerythritol lipids biosynthesis in Pseudozyma antarctica T-34
- Preliminary evidences of the presence of extracellular DNA single stranded forms in soil
- A comparison of quality of life between patients treated with different dialysis modalities in Taiwan
- Comparison of Monocyte Distribution Width (MDW) and Procalcitonin for early recognition of sepsis
- Morphological association between the muscles and bones in the craniofacial region
- Transcriptome analysis of Actinidia chinensis in response to Botryosphaeria dothidea infection
- Comparative study on skin protection activity of polyphenol-rich extract and polysaccharide-rich extract from Sargassum vachellianum
- Real-world data about emotional stress, disability and need for social care in a German IBD patient cohort
- The regenerative compatibility: A synergy between healthy ecosystems, environmental attitudes, and restorative experiences
- Antenatal depression and its association with adverse birth outcomes in low and middle-income countries: A systematic review and meta-analysis
- Perceptions of risk and influences of choice in pregnant women with obesity. An evidence synthesis of qualitative research
- The role of refugee and migrant migration status on medication adherence: Mediation through illness perceptions
- Sexual risk classes among youth experiencing homelessness: Relation to childhood adversities, current mental symptoms, substance use, and HIV testing
- Effects of CK2β subunit down-regulation on Akt signalling in HK-2 renal cells
- Novel broad-spectrum activity-based probes to profile malarial cysteine proteases
- Association between opioid analgesic therapy and initiation of buprenorphine management: An analysis of prescription drug monitoring program data
- Effect of a community-based approach of iron and folic acid supplementation on compliance by pregnant women in Kiambu County, Kenya: A quasi-experimental study
- Improvement project in higher education institutions: A BPEP-based model
- An updated evaluation of serum sHER2, CA15.3, and CEA levels as biomarkers for the response of patients with metastatic breast cancer to trastuzumab-based therapies
- Genome-wide association study of metabolic syndrome in Korean populations
- Drug therapy problems and treatment satisfaction among ambulatory patients with epilepsy in a specialized hospital in Ethiopia
- Plasma kynurenines and prognosis in patients with heart failure
- Occurrence and distribution of anthropogenic persistent organic pollutants in coastal sediments and mud shrimps from the wetland of central Taiwan
- Intensified visual clutter induces increased sympathetic signalling, poorer postural control, and faster torsional eye movements during visual rotation
- Gut microbiota composition alterations are associated with the onset of diabetes in kidney transplant recipients
- Shock index and TIMI risk index as valuable prognostic tools in patients with acute coronary syndrome complicated by cardiogenic shock
- Merit overrules theory of mind when young children share resources with others
- Metabolic analysis of amino acids and vitamin B6 pathways in lymphoma survivors with cancer related chronic fatigue
- Immunopathogenesis of canine chronic ulcerative stomatitis
- Generalizing findings from a randomized controlled trial to a real-world study of the iLookOut, an online education program to improve early childhood care and education providers’ knowledge and attitudes about reporting child maltreatment
- When and what to test for: A cost-effectiveness analysis of febrile illness test-and-treat strategies in the era of responsible antibiotic use
- Comparison of effects and safety in providing controlled hypotension during surgery between dexmedetomidine and magnesium sulphate: A meta-analysis of randomized controlled trials
- The gene encoding the ketogenic enzyme HMGCS2 displays a unique expression during gonad development in mice
- Efficacy of a mitochondrion-targeting agent for reducing the level of urinary protein in rats with puromycin aminonucleoside-induced minimal-change nephrotic syndrome
- Association of endothelial nitric oxide synthase (NOS3) gene polymorphisms with primary open-angle glaucoma in a Saudi cohort
- Antitrust analysis with upward pricing pressure and cost efficiencies
- Natural selection contributes to food web stability
- Pyramiding QTLs controlling tolerance against drought, salinity, and submergence in rice through marker assisted breeding
- Diversity and plant growth-promoting functions of diazotrophic/N-scavenging bacteria isolated from the soils and rhizospheres of two species of Solanum
- Sofosbuvir-based regimen for genotype 2 HCV infected patients in Taiwan: A real world experience
- The virulence domain of Shigella IcsA contains a subregion with specific host cell adhesion function
- Sequencing artifacts derived from a library preparation method using enzymatic fragmentation
- Quantitative analysis of adsorption and desorption of volatile organic compounds on reusable zeolite filters using gas chromatography
- Quo vadis Pantanal? Expected precipitation extremes and drought dynamics from changing sea surface temperature
- Cloud-computing and machine learning in support of country-level land cover and ecosystem extent mapping in Liberia and Gabon
- The Brief Measure of Emotional Preoperative Stress (B-MEPS) as a new predictive tool for postoperative pain: A prospective observational cohort study
- The impact of diabetes mellitus medication on the incidence of endogenous endophthalmitis
- Correction: Chl1 DNA helicase and Scc2 function in chromosome condensation through cohesin deposition
- Clinical and pathological features of thrombotic microangiopathy influencing long-term kidney transplant outcomes
- Occupational exposure to particulate matter from air pollution in the outdoor workplaces in Almaty during the cold season
- Morphological adjustment in free-living Steinernema feltiae infective juveniles to increasing concentration of Nemafric-BL phytonematicide
- Key necroptotic proteins are required for Smac mimetic-mediated sensitization of cholangiocarcinoma cells to TNF-α and chemotherapeutic gemcitabine-induced necroptosis
- Concurrent lipidomics and proteomics on malignant plasma cells from multiple myeloma patients: Probing the lipid metabolome
- Retraction: SDR9C7 Promotes Lymph Node Metastases in Patients with Esophageal Squamous Cell Carcinoma
- Association between tuberculosis and depression on negative outcomes of tuberculosis treatment: A systematic review and meta-analysis
- Bioluminescent imaging of Arabidopsis thaliana using an enhanced Nano-lantern luminescence reporter system
- Biosynthetic pathway of indole-3-acetic acid in ectomycorrhizal fungi collected from northern Thailand
- Sex-specific and opposite modulatory aspects revealed by PPI network and pathway analysis of ischemic stroke in humans
- Control of the microsporidian parasite Nosema ceranae in honey bees (Apis mellifera) using nutraceutical and immuno-stimulatory compounds
- Role of donor genotype in RT-QuIC seeding activity of chronic wasting disease prions using human and bank vole substrates
- Oral magnesium supplementation for leg cramps in pregnancy—An observational controlled trial
- Health care professionals’ knowledge of commonly used sedative, analgesic and neuromuscular drugs: A single center (Rambam Health Care Campus), prospective, observational survey
- Campylobacter portucalensis sp. nov., a new species of Campylobacter isolated from the preputial mucosa of bulls
- Transgenic interleukin 11 expression causes cross-tissue fibro-inflammation and an inflammatory bowel phenotype in mice
- Sleep quality and sex modify the relationships between trait energy and fatigue on state energy and fatigue
- The role of peer, parental, and school norms in predicting adolescents’ attitudes and behaviours of majority and different minority ethnic groups in Croatia
- Availability, prices and affordability of selected antibiotics and medicines against non-communicable diseases in western Cameroon and northeast DR Congo
- The effect of mutations derived from mouse-adapted H3N2 seasonal influenza A virus to pathogenicity and host adaptation
- Detection of posttraumatic pneumothorax using electrical impedance tomography—An observer-blinded study in pigs with blunt chest trauma
- Educators’ perceptions of organisational readiness for implementation of a pre-adolescent transdisciplinary school health intervention for inter-generational outcomes
- Beyond the heterodimer model for mineralocorticoid and glucocorticoid receptor interactions in nuclei and at DNA
- The effects of sport expertise and shot results on basketball players’ action anticipation
- Framework and algorithms for identifying honest blocks in blockchain
- Exploring the impact of terminology differences in blood and organ donor decision making
- Platelet indices significantly correlate with liver fibrosis in HCV-infected patients
- The nitrate content of fresh and cooked vegetables and their health-related risks
- Bioreactor for mobilization of mesenchymal stem/stromal cells into scaffolds under mechanical stimulation: Preliminary results
- Non-gradient and genotype-dependent patterns of RSV gene expression
- Multiplex real-time PCR for the detection of Clavibacter michiganensis subsp. michiganensis, Pseudomonas syringae pv. tomato and pathogenic Xanthomonas species on tomato plants
- The 24-hour urinary cortisol in post-traumatic stress disorder: A meta-analysis
- Drug-eluting versus bare-metal stents for first myocardial infarction in patients with atrial fibrillation: A nationwide population-based cohort study
- Health-related quality of life among patients with type 2 diabetes mellitus in Eastern Province, Saudi Arabia: A cross-sectional study
- “I like the way I am, but I feel like I could get a little bit bigger”: Perceptions of body image among adolescents and youth living with HIV in Durban, South Africa
- Nanoparticle-based ‘turn-on’ scattering and post-sample fluorescence for ultrasensitive detection of water pollution in wider window
- Insights into the strategy of micro-environmental adaptation: Transcriptomic analysis of two alvinocaridid shrimps at a hydrothermal vent
- Thirty-day readmission after medical-surgical hospitalization for people who experience imprisonment in Ontario, Canada: A retrospective cohort study
- Hyper-spectral response and estimation model of soil degradation in Kenli County, the Yellow River Delta
- The association of telomere length and telomerase activity with adverse outcomes in older patients with non-ST-elevation acute coronary syndrome
- Construction of a high-density genetic map and fine mapping of a candidate gene locus for a novel branched-spike mutant in barley
- Alterations of aqueous humor Aβ levels in Aβ-infused and transgenic mouse models of Alzheimer disease
- Natural hybridization between Phyllagathis and Sporoxeia species produces a hybrid without reproductive organs
- The impact of peer attachment on prosocial behavior, emotional difficulties and conduct problems in adolescence: The mediating role of empathy
- Diagnostic performance of serum interferon gamma, matrix metalloproteinases, and periostin measurements for pulmonary tuberculosis in Japanese patients with pneumonia
- Characterization of black patina from the Tiber River embankments using Next-Generation Sequencing
- Problem gambling, associations with comorbid health conditions, substance use, and behavioural addictions: Opportunities for pathways to treatment
- Nanosheet wrapping-assisted coverslip-free imaging for looking deeper into a tissue at high resolution
- Validity of cerebrovascular ICD-9-CM codes in healthcare administrative databases. The Umbria Data-Value Project
- Torque teno virus viral load is related to age, CMV infection and HLA type but not to Alzheimer's disease
- Associations of cigarette smoking and burden of thoracic aortic calcification in asymptomatic individuals: A dose-response relationship
- Transforming assessment of speech in children with cleft palate via online crowdsourcing
- Human-raptor conflict in rural settlements of Colombia
- Assessment of peritoneal microbial features and tumor marker levels as potential diagnostic tools for ovarian cancer
- Deficiency syndromes in top predators associated with large-scale changes in the Baltic Sea ecosystem
- Perceived relative social status and cognitive load influence acceptance of unfair offers in the Ultimatum Game
- Hepatitis B and C virus infection among HIV patients within the public and private healthcare systems in Chile: A cross-sectional serosurvey
- Retraction: Oncogenic Fibulin-5 Promotes Nasopharyngeal Carcinoma Cell Metastasis through the FLJ10540/AKT Pathway and Correlates with Poor Prognosis
- From seed to flour: Sowing sustainability in the use of cantaloupe melon residue (Cucumis melo L. var. reticulatus)
- Core Scientific Dataset Model: A lightweight and portable model and file format for multi-dimensional scientific data
- Accounting for measurement error to assess the effect of air pollution on omic signals
- Leucine zipper transcription factor-like 1 binds adaptor protein complex-1 and 2 and participates in trafficking of transferrin receptor 1
- Barriers for tuberculosis case finding in Southwest Ethiopia: A qualitative study
- Genetic predisposition to celiac disease in Kazakhstan: Potential impact on the clinical practice in Central Asia
- A lower psoas muscle volume was associated with a higher rate of recurrence in male clear cell renal cell carcinoma
- Two angles of overqualification-the deviant behavior and creative performance: The role of career and survival job
- Cost-utility analysis of de-escalating biological disease-modifying anti-rheumatic drugs in patients with rheumatoid arthritis
- Efficient estimation of stereo thresholds: What slope should be assumed for the psychometric function?
- Learning efficient haptic shape exploration with a rigid tactile sensor array
- Effects of dietary supplementation with a microalga (Schizochytrium sp.) on the hemato-immunological, and intestinal histological parameters and gut microbiota of Nile tilapia in net cages
- Regional versus local wind speed and direction at a narrow beach with a high and steep foredune
- Fragmented QRS complex in patients with systemic lupus erythematosus at the time of diagnosis and its relationship with disease activity
- Severe thiamine deficiency in eastern Baltic cod (Gadus morhua)
- Transfer entropy as a variable selection methodology of cryptocurrencies in the framework of a high dimensional predictive model
- Psychometric validation of Czech version of the Sport Motivation Scale
- Correction: Multiple innate antibacterial immune defense elements are correlated in diverse ungulate species
- Recognition of personality disorder and anxiety disorder comorbidity in patients treated for depression in secondary psychiatric care
- Correction: Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform
- PLOS One
- Archív čísel
- Aktuálne číslo
- Informácie o časopise
Najčítanejšie v tomto čísle- Psychometric validation of Czech version of the Sport Motivation Scale
- Comparison of Monocyte Distribution Width (MDW) and Procalcitonin for early recognition of sepsis
- Effects of supplemental creatine and guanidinoacetic acid on spatial memory and the brain of weaned Yucatan miniature pigs
- Alterations of aqueous humor Aβ levels in Aβ-infused and transgenic mouse models of Alzheimer disease
Prihlásenie#ADS_BOTTOM_SCRIPTS#Zabudnuté hesloZadajte e-mailovú adresu, s ktorou ste vytvárali účet. Budú Vám na ňu zasielané informácie k nastaveniu nového hesla.
- Časopisy