English
Español
Valencià
De Novo Studies
data-analysis image

At Biotechvana we are pleased to offer our users de novo data analysis services aimed at assembling and annotating new genomes and transcriptomes, both prokaryotic and eukaryotic, for which no prior reference is available. These services also allow the study of microbial metagenomes and metatranscriptomes for the characterization of complex communities and non-model species.

Within the scope of de novo studies, we offer different types of analyses tailored to the needs and objectives of each project, including:

  • Genomes and metagenomes
  • Transcriptomes and metatranscriptomes
  • Viruses and Mobile Genetic Elements

Each project is approached in a personalized manner, adapting the workflow to the experimental design.

Genomes, transcriptomes, metagenomes, metatranscriptomes

We perform assembly and analysis of complex sequencing data to obtain reliable genomic and transcriptomic representations in the absence of a reference. This approach allows work at both individual-organism and community levels, enabling the study of gene content and expression profiles as a basis for subsequent functional analyses.

1
Raw Data Preprocessing
  • Quality analysis of sequencing reads (sff, fastq, sam, fasta, etc.).
  • Read preprocessing, including demultiplexing and removal of low-quality sequences, primer/adapter remnants, and artifacts.
2
De Novo Assembly and Scaffolding
  • Assembly of processed reads (into contigs) and scaffolding of contigs.
  • Gap filling and re-scaffolding in genomic studies.
  • Consensus reference reconstruction by merging two or more assemblies.
  • Isoform reconstruction (for transcriptome-oriented studies).
  • ORF prediction and extraction (prokaryotic) or exon–intron structure prediction (eukaryotic).
  • Inference of assembly metrics.
3
Annotation and Functional Analysis
  • Repeat masking where appropriate.
  • Automatic annotation of coding and non-coding genes.
  • Functional analysis and metabolic pathway characterization.
  • Detection of regulatory elements such as start/stop codons, promoters, etc.
  • Data integration.
4
Downstream Analyses Oriented to Discovery
  • Correction of homopolymers and artifactual frame shifts in coding sequences.
  • Characterization of paralogous and orthologous genes.
  • Phylome annotation.
  • Data mining oriented to knowledge discovery.
  • Comparative analysis.
  • Curation and post-processing of sequences.
  • Database implementation.
Characterization of complete or partial genomes of viruses and mobile genetic elements

We address the identification, assembly, and structural characterization of viral genomes and mobile genetic elements present in sequencing data. These analyses allow description of their genomic organization, evaluation of their diversity, and study of their role in the genetic and evolutionary dynamics of the systems analyzed.

1
Raw Data Preprocessing
  • Quality analysis of sequencing reads (sff, fastq, sam, fasta, etc.).
  • Read preprocessing, including demultiplexing and removal of low-quality sequences, primer/adapter remnants, and artifacts.
  • Curation of reads when appropriate.
2
De Novo Assembly
  • De Novo assembly of processed reads.
  • Genome circularization of the mobile element, if applicable.
  • Inference of assembly metrics.
3
Annotation
  • Characterization of LTR and TIR where applicable.
  • ORF annotation for genes and other regulatory elements of the viral or mobile element genome.
  • Phylogenetic analysis.
Annotation of Viromes / Mobilomes

We carry out functional and taxonomic annotation of viromes and mobilomes to interpret the genetic content of viruses and mobile elements present in a sample. This analysis enables identification of functions, families, and genes of interest, as well as exploration of processes related to genetic transfer, adaptation, and evolution.

1
Mapping and Annotation
  • Masking of repeats and transposons using comparative analysis with various reference databases.
  • De Novo identification of repeats by self-comparison of the characterized genome.
  • Search for tandem repeats in the genome.
  • Characterization of tight junctions.
  • Characterization of LTRs and TIRs.
  • ORF annotation for genes and other features.
  • Functional analysis using biological vocabularies such as Gene Ontology (GO).
  • Characterization of metabolic pathways.
  • Annotation and reconstruction of the viral phylome.
2
Integrative and Downstream Analyses
  • Complete reconstruction of a mobile element or viral genome.
  • Analysis of orthologous mobile elements or viruses.
  • Analysis of differential insertion in the host genome.
  • Data integration.
  • Other statistical analyses.
If you are interested in obtaining more information about the services we offer for data analysis from de Novo experiments, please contact us at biotechvana@biotechvana.com. We will send you a quote for your study, or, if you prefer, we can arrange a meeting with you and your team without any obligation.
Sign in to your account
Username or email
Password