Nothing Special   »   [go: up one dir, main page]

Liao et al., 2019 - Google Patents

Current challenges and solutions of de novo assembly

Liao et al., 2019

View PDF @Full View
Document ID
15067389215724397186
Author
Liao X
Li M
Zou Y
Wu F
Wang J
Publication year
Publication venue
Quantitative Biology

External Links

Snippet

Background Next‐generation sequencing (NGS) technologies have fostered an unprecedented proliferation of high‐throughput sequencing projects and a concomitant development of novel algorithms for the assembly of short reads. However, numerous …
Continue reading at onlinelibrary.wiley.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • G06F19/22Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30312Storage and indexing structures; Management thereof
    • G06F17/30321Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • G06F19/24Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • G06F19/18Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for functional genomics or proteomics, e.g. genotype-phenotype associations, linkage disequilibrium, population genetics, binding site identification, mutagenesis, genotyping or genome annotation, protein-protein interactions or protein-nucleic acid interactions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30289Database design, administration or maintenance
    • G06F17/30303Improving data quality; Data cleansing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • G06F19/28Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • G06F19/14Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for phylogeny or evolution, e.g. evolutionarily conserved regions determination or phylogenetic tree construction
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • G06F19/16Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for molecular structure, e.g. structure alignment, structural or functional relations, protein folding, domain topologies, drug targeting using structure data, involving two-dimensional or three-dimensional structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES OR MICRO-ORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or micro-organisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or micro-organisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Similar Documents

Publication Publication Date Title
Liao et al. Current challenges and solutions of de novo assembly
Sohn et al. The present and future of de novo whole-genome assembly
Li Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences
Schulz et al. Fiona: a parallel and automatic strategy for read error correction
Li Toward better understanding of artifacts in variant calling from high-coverage samples
Yang et al. A survey of error-correction methods for next-generation sequencing
Heo et al. BLESS: bloom filter-based error correction solution for high-throughput sequencing reads
Berlin et al. Assembling large genomes with single-molecule sequencing and locality-sensitive hashing
Narzisi et al. Comparing de novo genome assembly: the long and short of it
Schubert et al. Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX
Selvaraj et al. Whole-genome haplotype reconstruction using proximity-ligation and shotgun sequencing
Nagarajan et al. Sequence assembly demystified
Töpfer et al. Viral quasispecies assembly via maximal clique enumeration
Homer et al. BFAST: an alignment tool for large scale genome resequencing
Chen et al. TIGRA: a targeted iterative graph routing assembler for breakpoint assembly
Wee et al. The bioinformatics tools for the genome assembly and analysis based on third-generation sequencing
Ronen et al. SEQuel: improving the accuracy of genome assemblies
Zhou et al. Prevention, diagnosis and treatment of high‐throughput sequencing data pathologies
Wan et al. VirAmp: a galaxy-based viral genome assembly pipeline
Sarmashghi et al. Estimating repeat spectra and genome length from low-coverage genome skims with RESPECT
Orabi et al. Alignment-free clustering of UMI tagged DNA molecules
EP3482329B1 (en) A computer-implemented and reference-free method for identifying variants in nucleic acid sequences
Klein et al. LOCAS–a low coverage assembly tool for resequencing projects
Howison et al. Toward a statistically explicit understanding of de novo sequence assembly
Chen et al. Recent advances in sequence assembly: principles and applications